Text::Statistics::Latin - Performs statistical analysis of corpora
Version 0.06
DESCRIPTION
Given a copus as input, Text::Statistics::Latin creates a seven column CSV file as output, with one line for each token per text. Names of input files need match the following pattern:
1 (1). txt', '1 (2). txt', ..., '1 (n).txt'
or
1 \(([1-9]|[1-9][0-9]+)\)\.txt
Columns store statistical information:
(1) number of word forms in document d; (2) number of tokens in d; (3) Id number of d, ie., n; (4) frequency of term t in d; (5) corpus frequency of t ; (6) document frequency of t (number of documents where t occurs at + least once); (7) t, UTF8 latin coded token-string delimited by C<< /[ -@]|[\[-` +]|[{-¿]|[ɐ-˩]|[ʹ-�]/ >> Main output file name is '1 (n + 5).txt' and it is stored in the s +ame directory as the corpus, together with residual files on each input file with . +txu and .txv ad hoc extensions. This code was written under CAPES BEX-09323-5
Example:
#!/usr/bin/perl use strict; use Text::CStatiBR; &Text::CStatiBR::CSTATIBR("5"); #5 files are analised. #Main output #file created is #1 (10).txt
&LATIN();
Rodrigo Panchiniak Fernandes, <fernandes at cpan.org>
<fernandes at cpan.org>
Please report any bugs or feature requests to bug-text-statistics-latin at rt.cpan.org, or through the web interface at http://rt.cpan.org/NoAuth/ReportBug.html?Queue=Text-Statistics-Latin. I will be notified, and then you'll automatically be notified of progress on your bug as I make changes.
bug-text-statistics-latin at rt.cpan.org
You can find documentation for this module with the perldoc command.
perldoc Text::Statistics::Latin
You can also look for information at:
AnnoCPAN: Annotated CPAN documentation
http://annocpan.org/dist/Text-Statistics-Latin
CPAN Ratings
http://cpanratings.perl.org/d/Text-Statistics-Latin
RT: CPAN's request tracker
http://rt.cpan.org/NoAuth/Bugs.html?Dist=Text-Statistics-Latin
Search CPAN
http://search.cpan.org/dist/Text-Statistics-Latin
Copyright 2007 Rodrigo Panchiniak Fernandes, all rights reserved.
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
Written under CAPES BEX-09323-5
1 POD Error
The following errors were encountered while parsing the POD:
Non-ASCII character seen before =encoding in '+]|[{-¿]|[ɐ-˩]|[ʹ-�]/'. Assuming UTF-8
To install Text::Statistics::Latin, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Text::Statistics::Latin
CPAN shell
perl -MCPAN -e shell install Text::Statistics::Latin
For more information on module installation, please visit the detailed CPAN module installation guide.