### Search results for "module:Fsdb::Filter::dbcolstats"

###
Fsdb::Filter::dbcolstats - compute statistics on a fsdb column

Compute statistics over a COLUMN of data. Records containing non-numeric data are considered null do not contribute to the stats (with the "-a" option they are treated as zeros). Confidence intervals are a t-test (+/- (t_{a/2})*s/sqrt(n)) and assume ...

JOHNH/Fsdb-3.0 - 04 Apr 2022 22:44:17 UTC###
Fsdb::Filter::dbmultistats - run dbcolstats over each group of inputs identified by some key

The input table is grouped by KeyField, then we compute a separate set of column statistics on ValueField for each group with a unique key. Assumptions and requirements are the same as dbmapreduce (this program is just a wrapper around that program):...

JOHNH/Fsdb-3.0 - 04 Apr 2022 22:44:17 UTC###
Fsdb::Filter::dbrowcount - count the number of rows in an Fsdb stream

Count the number of rows and write out a new fsdb file with one column (n) and one value: the number of rows. This program is a strict subset of dbcolstats. Although there are other ways to get a count of rows ("dbcolstats", or "dbrowaccumulate -C 1"...

JOHNH/Fsdb-3.0 - 04 Apr 2022 22:44:17 UTC###
Fsdb::Filter::dbpipeline - allow db commands to be assembled as pipelines in Perl

This module makes it easy to create pipelines in Perl using separate processes. (In the past we used to use perl threads.) By default (as with all Fsdb modules), input is from STDIN and output to STDOUT. Two helper functions, fromfile and tofile can ...

JOHNH/Fsdb-3.0 - 04 Apr 2022 22:44:17 UTC###
Fsdb::Filter::dbcolhisto - compute a histogram over a column of Fsdb data

This program computes a histogram over a column of data. Records containing non-numeric data are considered null do not contribute to the stats (optionally they are treated as zeros). Defaults to 10 buckets over the exact range of data. Up to three p...

JOHNH/Fsdb-3.0 - 04 Apr 2022 22:44:17 UTC###
Fsdb::Filter::dbmapreduce - reduce all input rows with the same key

Group input data by KeyField, then apply a function (the "reducer") to each group. The reduce function can be an external program given by ReduceCommand and ReduceArguments, or an Perl subroutine given in CodeFile or FilterCode. If a "--" appears bef...

JOHNH/Fsdb-3.0 - 04 Apr 2022 22:44:17 UTC###
Fsdb::Filter::dbrvstatdiff - evaluate statistical differences between two random variables

Produce statistics on the difference of sets of random variables. If a hypothesized difference is given (with "-h"), to does a Student's t-test. Random variables are specified by: "m1c", "m2c" The column names of means of random variables. "sd1c", "s...

JOHNH/Fsdb-3.0 - 04 Apr 2022 22:44:17 UTC###
Fsdb::Filter::dbcolscorrelate - find the coefficient of correlation over columns

Compute the coefficient of correlation over two (or more) columns. The output is one line of correlations. With exactly two columns, a new column *correlation* is created. With more than two columns, correlations are computed for each pairwise combin...

JOHNH/Fsdb-3.0 - 04 Apr 2022 22:44:17 UTC###
Fsdb::Filter::dbcolstatscores - compute z-scores or t-scores for each value in a population

Compute statistics (z-score and optionally t-score) over a COLUMN of numbers. Creates new columns called "zscore", "tscore". T-scores are only computed if requested with the "-t" option, or if "--tmean" or "--tstddev" are explicitly specified (defaul...

JOHNH/Fsdb-3.0 - 04 Apr 2022 22:44:17 UTC###
Fsdb::Filter::dbcolsregression - compute linear regression between two columns

Compute linear regression over "column1" and "column2". Outputs slope, intercept, and correlation coefficient....

JOHNH/Fsdb-3.0 - 04 Apr 2022 22:44:17 UTC###
Fsdb::Filter::dbcolmovingstats - compute moving statistics over a window of a column of data

Compute moving statistics over a COLUMN of data. Records containing non-numeric data are considered null do not contribute to the stats (optionally they are treated as zeros with "-a"). Statitics are computed over a WINDOW of samples of data. [In pro...

JOHNH/Fsdb-3.0 - 04 Apr 2022 22:44:17 UTC