NAME
splitCollection.pl - Perl script for spliting a collection of XML documents
SYNOPSIS
splitCollection.pl [--help] [--man] [--rcfile=file] [--lang=lang] [--params=params] [--format=format]
OPTIONS AND ARGUMENTS
- --help brief help message
- --man full documentation
- --verbose go into the verbose mode
- --file <fileame>, --file <fileame> file to split
- --dir <dirname>, -d <dirname> directory containing the output files (by default, the current directory)
- --prefix <prefix>, -p <prefix> prefix of the output filenames
- --base <number>, -b <number> start number of the output filenames
- --size <size>, -s <size> number of documents per output file
DESCRIPTION
The script splits a document collection in ALVIS XML format into several files in the same format. The ALVIS XML format is the format used by the Ogmios platform to load file and record linguistic annotations.
Intput file is given by the option --file
. Output files are stored in a directory specified with the option --dir
. Each output file has the prefix indicated by the option --prefix
.
EXAMPLE
SplitCollection.pl -f examples/twodocs.xml -d examples -p subcoll -b 1 -s 1
AUTHOR
Thierry Hamon, <thierry.hamon@limsi.fr>
COPYRIGHT AND LICENSE
Copyright (C) 2012 Thierry Hamon
This is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.4 or, at your option, any later version of Perl 5 you may have available.