msdoc - module to replace MS document by its text contents
Version 0.01
optex command -Mmsdoc
This module replaces argument which terminate with .docx, pptx or xlsx files by node representing its text information. File itself is not altered.
For examle, you can check the text difference between ms word files like this:
$ optex diff -Mmsdoc OLD.docx NEW.docx
If you have symbolic link named diff to optex, and following setting in your ~/.optex.d/diff.rc:
option default --msdoc option --msdoc -Mmsdoc $<move>
Next command simply produces the same result.
$ diff OLD.docx NEW.docx
Text data is extracted by greple command with -Mmsdoc module, and above command is almost equivalent to below bash command using process substitution.
$ diff <(greple -Mmsdoc --dump OLD.docx) \ <(greple -Mmsdoc --dump NEW.docx)
https://github.com/kaz-utashiro/optex-msdoc
It is possible to use other data conversion program, like pandoc or "Apache Tika". Feel to free to modify this module. I'm reluctant to use them, because they work quite leisurely.
Copyright (C) Kazumasa Utashiro.
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
Kazumasa Utashiro
To install App::optex::msdoc, copy and paste the appropriate command in to your terminal.
cpanm
cpanm App::optex::msdoc
CPAN shell
perl -MCPAN -e shell install App::optex::msdoc
For more information on module installation, please visit the detailed CPAN module installation guide.