The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

splitCollection.pl - Perl script for spliting a collection of XML documents

SYNOPSIS

splitCollection.pl [--help] [--man] [--rcfile=file] [--lang=lang] [--params=params] [--format=format]

OPTIONS AND ARGUMENTS

--help brief help message
--man full documentation
--verbose go into the verbose mode
--file <fileame>, --file <fileame> file to split
--dir <dirname>, -d <dirname> directory containing the output files (by default, the current directory)
--prefix <prefix>, -p <prefix> prefix of the output filenames
--base <number>, -b <number> start number of the output filenames
--size <size>, -s <size> number of documents per output file

DESCRIPTION

The script splits a document collection in ALVIS XML format into several files in the same format. The ALVIS XML format is the format used by the Ogmios platform to load file and record linguistic annotations.

Intput file is given by the option --file. Output files are stored in a directory specified with the option --dir. Each output file has the prefix indicated by the option --prefix.

EXAMPLE

SplitCollection.pl -f examples/twodocs.xml -d examples -p subcoll -b 1 -s 1

AUTHOR

Thierry Hamon, <thierry.hamon@limsi.fr>

COPYRIGHT AND LICENSE

Copyright (C) 2012 Thierry Hamon

This is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.4 or, at your option, any later version of Perl 5 you may have available.