The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

DTA::TokWrap::Processor::tcfencode - DTA tokenizer wrappers: TEI->TCF encoding

SYNOPSIS

 use DTA::TokWrap::Processor::tcfencode;
 
 $enc = DTA::TokWrap::Processor::tcfencode->new(%opts);
 $doc_or_undef = $dec->tcfencode($doc);

DESCRIPTION

DTA::TokWrap::Processor::tcfencode provides an object-oriented DTA::TokWrap::Processor wrapper for encoding (serialized) TEI-XML as TCF ("Text Corpus Format", cf. http://weblicht.sfs.uni-tuebingen.de/weblichtwiki/index.php/The_TCF_Format) using DTA::TokWrap::Document objects.

Constants

@ISA

DTA::TokWrap::Processor::tcfencode inherits from DTA::TokWrap::Processor.

Constructors etc.

new
 $obj = $CLASS_OR_OBJECT->new(%args);

Constructor.

defaults
 %defaults = $CLASS->defaults();

Static class-dependent defaults.

Methods

tcfencode
 $doc_or_undef = $CLASS_OR_OBJECT->tcfencode($doc);

Converts the DTA::TokWrap::Document object to TCF, storing the result as an XML::LibXML::Document in $doc->{tcfdoc}.

Relevant %$doc keys:

 xmlfile => $xmlfile, ##-- (input) source TEI-XML file
 txtfile => $txtfile, ##-- (input) serialized text file
 xmldata => $xmldata, ##-- (input,alternate) source TXT-XML buffer
 txtdata => $txtdata, ##-- (input,alternate) serialized text data
 ##
 tcfdoc  => $tcfdoc,   ##-- (output) TCF output document
 tcfencode_stamp0 => $f, ##-- (output) timestamp of operation begin
 tcfencode_stamp  => $f, ##-- (output) timestamp of operation end
 tcfdoc_stamp   => $f, ##-- (output) timestamp of operation end

SEE ALSO

DTA::TokWrap::Intro(3pm), dta-tokwrap.perl(1), ...

SEE ALSO

DTA::TokWrap::Intro(3pm), dta-tokwrap.perl(1), ...

AUTHOR

Bryan Jurish <moocow@cpan.org>

COPYRIGHT AND LICENSE

Copyright (C) 2014-2018 by Bryan Jurish

This package is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.24.1 or, at your option, any later version of Perl 5 you may have available.