D
D Cameron
I just started writing perl a couple of weeks ago when I walked on to a
project that was using perl to extract data from a raw data file. It has
been a dream for text processing, as advertised. But after the data is
extracted I want to do some statistical tests of the sequential analysis
variety, and I'm not sure if perl is the best way to go. I was hoping
some people here could offer some opinions.
The questions I want to answer are along the lines of showing that a
Markov chain is of order r, and not order r-1; looking at individual
cells of a contingency table to see which ones are significant; and
calculating the likelihood ratio chi-square (LRX^2).
The calculations involve a lot of matrix manipulation which I'm
currently doing with 2D arrays and lots of for(i;i<n;i++) loops. I've
searched around CPAN a bit but the Statistics and Algorithm namespaces
didn't offer what I was looking for. After getting great results from
Algorithm::Cluster I was hoping I'd find something similar for
sequential analysis.
My perl knowledge is limited, I'm only around chapter 4 of the Camel
book. The entertaining writing has me reading every page, but I was
quite impressed to find I could do a
%results{$picture}{$question}[$prev][$curr] structure without too many
complaints.
Basically I'm wondering:
-Does the problem sound appropriate for perl, or should I go looking
somewhere else?
-Am I missing something about perl that would simplify my code
significantly?
-Can anyone suggest packages that would be useful that I've overlooked?
In case my statistical jargon is unclear, I'll mention my reference here:
Gottman & Roy (1990) Sequential Analysis: A guide for behavioral
researchers. New York: Cambridge University Press.
project that was using perl to extract data from a raw data file. It has
been a dream for text processing, as advertised. But after the data is
extracted I want to do some statistical tests of the sequential analysis
variety, and I'm not sure if perl is the best way to go. I was hoping
some people here could offer some opinions.
The questions I want to answer are along the lines of showing that a
Markov chain is of order r, and not order r-1; looking at individual
cells of a contingency table to see which ones are significant; and
calculating the likelihood ratio chi-square (LRX^2).
The calculations involve a lot of matrix manipulation which I'm
currently doing with 2D arrays and lots of for(i;i<n;i++) loops. I've
searched around CPAN a bit but the Statistics and Algorithm namespaces
didn't offer what I was looking for. After getting great results from
Algorithm::Cluster I was hoping I'd find something similar for
sequential analysis.
My perl knowledge is limited, I'm only around chapter 4 of the Camel
book. The entertaining writing has me reading every page, but I was
quite impressed to find I could do a
%results{$picture}{$question}[$prev][$curr] structure without too many
complaints.
Basically I'm wondering:
-Does the problem sound appropriate for perl, or should I go looking
somewhere else?
-Am I missing something about perl that would simplify my code
significantly?
-Can anyone suggest packages that would be useful that I've overlooked?
In case my statistical jargon is unclear, I'll mention my reference here:
Gottman & Roy (1990) Sequential Analysis: A guide for behavioral
researchers. New York: Cambridge University Press.