Changes for version 1.02

  • Removed defined on hash.
  • Minor improvements to the documentation.
  • Fixed encoding and parsing errors.
    • Use HTML::Encoding to find encoding.


Script to update CNN news article corpus.


Make a corpus of CNN documents for research.
Parse CNN article for research.