The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Changes for version 1.02 - 2013-05-14

  • Make ISO3166.pm a UTF-8 file, to match its use of =encoding utf8.
  • Add scripts/test.nfc.pl.
  • In the FAQ, clarify use of NFC() rather than NFD().
  • Add scripts/get.statoids.pl in case we need their FIPS 10-4 codes one day.
  • Add scripts/get.fips.pages.pl and data/List_of_FIPS_region_codes_*.html.
  • Add scripts/populate.fips.pages.pl, data/wikipedia.fips.codes.txt and data/wikipedia.fips.mismatch.log.
  • Update the POD re the new files.
  • Update the pre-reqs.

Documentation

Export the SQLite database as CSV
Export the SQLite database as HTML
Get http://en.wikipedia.org/wiki/ISO_3166-2.html & the 3-letter code page
Get all country pages from https://en.wikipedia.org/wiki/List_of_FIPS_region_codes_(A-C) etc.
Get all country pages from http://statoids.com/.
Get http://en.wikipedia.org.wiki.ISO_3166-2.$code2.html
Get http://en.wikipedia.org.wiki.ISO_3166-2.$code2.html for all missing $code2
Parse en.wikipedia.org.wiki.ISO_3166-2.html
Parse data/List_of_FIPS_region_codes_A-C.html etc
Parse en.wikipedia.org.wiki.ISO_3166-2.$code2.html for all missing $code2
Parse en.wikipedia.org.wiki.ISO_3166-2.$code2.html
Report some stats about the SQLite database

Modules

Gently scrape Wikipedia for ISO3166-2 data
The interface to www.scraper.wikipedia.iso3166.sqlite
Create/drop tables in www.scraper.wikipedia.iso3166.sqlite
Download various pages from Wikipedia
Export www.scraper.wikipedia.iso3166.sqlite as CSV and HTML
Part of the interface to www.scraper.wikipedia.iso3166.sqlite