readme - metacpan.org


            
              1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
              Text-Statistics-Devanagari version 0.04
==========================================
ABSTRACT
This module performs corpora(1) statatistical analysis.
DESCRIPTION
Text::Statistics::Devanagari creates a seven column CSV file output, with one line each token per text, given as input a latin-utf8 coded corpus that files names follows:
    1 (1). txt', '1 (2). txt', ..., '1 (n).txt'  or
    1 \(([1-9]|[1-9][0-9]+)\)\.txt
Columns stores statistical information:
(1) number of word forms in document d;
(2) number of tokens in d;
(3) Id number of d, ie., n;
(4) frequency of term t in d;
(5) corpus frequency(2) of t ;
(6) document frequency of t (number of documents where t occurs at least once);
(7) t, UTF8 latin-coded token-string
Main output file name is '1 (n + 5).txt' and it is stored in the same directory as
the corpus itself, together with residual files on each input file with .txu and .txv ad hoc extensions.
Example:
    use Text::Statistics::Devanagari;
    &devanagari("4"); #3, i.e (4-1) texts will be analysed.
INSTALLATION
To install this module type the following:
   perl Makefile.PL
   make
   make test
   make install
DEPENDENCIES
This module requires these other modules and libraries:
        utf8
        Text::ParseWords
SEE ALSO
        http://search.cpan.org/~ambs/
        http://search.cpan.org/~tpederse/
         
REFERENCES
(1) BERBER-SARDINHA, Tony. Linguistica de Corpus. Manole, 2004
(2) http://www-csli.stanford.edu/~schuetze/information-retrieval-book.html
COPYRIGHT AND LICENCE
Copyright (C) 2007 by Rodrigo Panchiniak Fernandes
This code was written under CAPES BEX-09323-5
This library is free software; you can redistribute it and/or modify
it under the same terms as Perl itself, either Perl version 5.8.8 or,
at your option, any later version of Perl 5 you may have available.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	go to github issues (only if github is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)