NAME

Text::Statistics::Arabic - Performs statistical corpora analysis

VERSION

Version 0.04

SYNOPSIS

Text::Statistics::Arabic creates a seven column CSV file output with one line each token per text given as input a corpus that files names follows ' 1 (1). txt', '1 (2). txt', ..., '1 (n).txt' or 1 \(([1-9]|[1-9][0-9]+)\)\.txt Columns stores statistical information: (1) number of word forms in document d; (2) number of tokens in d; (3) Id number of d, ie., n; (4) frequency of term t in d; (5) corpus frequency of t ; (6) document frequency of t (number of documents where t occurs at least once); (7) t, UTF8 latin coded token-string

Main output file name is '1 (n + 5).txt' and it is stored in the same directory as the corpus itself, toghether with residual files on each input file with .txu and .txv extensions.

Example:

use Text::Statistics::Arabic;
&arabic("4"); #3 (4-1) texts will be analised.

EXPORT

&arabic();

AUTHOR

Rodrigo Panchiniak Fernandes, <fernandes at cpan.org>

BUGS

Please report any bugs or feature requests to bug-text-statistics-latin at rt.cpan.org, or through the web interface at http://rt.cpan.org/NoAuth/ReportBug.html?Queue=Text-Statistics-Arabic. I will be notified, and then you'll automatically be notified of progress on your bug as I make changes.

SUPPORT

You can find documentation for this module with the perldoc command.

perldoc Text::Statistics::Arabic

You can also look for information at:

AnnoCPAN: Annotated CPAN documentation

http://annocpan.org/dist/Text-Statistics-Arabic
CPAN Ratings

http://cpanratings.perl.org/d/Text-Statistics-Arabic
RT: CPAN's request tracker

http://rt.cpan.org/NoAuth/Bugs.html?Dist=Text-Statistics-Arabic
Search CPAN

http://search.cpan.org/dist/Text-Statistics-Arabic

ACKNOWLEDGEMENTS

Alberto Manuel Brandão Simões

COPYRIGHT & LICENSE

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

This code was written under CAPES BEX-09323-5

1 POD Error

The following errors were encountered while parsing the POD:

Around line 372:: Non-ASCII character seen before =encoding in 'Brandão'. Assuming UTF-8

To install Text::Statistics::Arabic, copy and paste the appropriate command in to your terminal.

cpanm

cpanm Text::Statistics::Arabic

CPAN shell

perl -MCPAN -e shell
install Text::Statistics::Arabic

For more information on module installation, please visit the detailed CPAN module installation guide.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	go to github issues (only if github is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)