The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

msdoc - module to replace MS document by its text contents

VERSION

Version 0.01

SYNOPSIS

optex command -Mmsdoc

DESCRIPTION

This module replaces argument which terminate with .docx, pptx or xlsx files by node representing its text information. File itself is not altered.

For examle, you can check the text difference between ms word files like this:

    $ optex diff -Mmsdoc OLD.docx NEW.docx

If you have symbolic link named diff to optex, and following setting in your ~/.optex.d/diff.rc:

    option default --msdoc
    option --msdoc -Mmsdoc $<move>

Next command simply produces the same result.

    $ diff OLD.docx NEW.docx

Text data is extracted by greple command with -Mmsdoc module, and above command is almost equivalent to below bash command using process substitution.

    $ diff <(greple -Mmsdoc --dump OLD.docx) \
           <(greple -Mmsdoc --dump NEW.docx)

SEE ALSO

https://github.com/kaz-utashiro/optex-msdoc

It is possible to use other data conversion program, like pandoc or "Apache Tika". Feel to free to modify this module. I'm reluctant to use them, because they work quite leisurely.

LICENSE

Copyright (C) Kazumasa Utashiro.

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

AUTHOR

Kazumasa Utashiro