Changes for version 2.00 - 2016-05-10

  • Switch from HTML::TreeBuilder to Mojo::DOM for parsing Wikipedia pages, now that the latter have made significant changes to their page formats.
  • Change 'utf8' to 'UTF-8'.
  • Add the subcounty_categories table to the database, and a corresponding method for reading it.
  • Add the subcounty_info table to the database, and a corresponding method for reading it.
  • Add subcountry_info table and data/subcountry.info.csv.
  • Switch from obsolete ISO_3166-2.3.html to ISO_3166-1.html.
  • Add a column called 'number' to the countries table, derived from ISO_3166-1.html.
  • Add method Import.check_downloads().
  • Fix a couple of typos in the names of programs listed in the Description.
  • Switch from File::Slurp to File::Slurper.
  • Add an FAQ about the way SQLite allocates primary keys, and hence how you can force them to start from 1.
  • Stop supporting any FIPS-related stuff.
  • Add data/www.scraper.wikipedia.iso3166.schema.svg for the database schema.
  • Add scripts/build.database.sh.
  • Add scripts/find.db.pl (but only run it after installation!).
  • Add scripts/report.Australian.statistics.pl.
  • Fix check.downloads.pl to call the right method.
  • Remove scripts/test.nfc.pl.

Documentation

Report missing and unexpected files in data/
Export the SQLite database as CSV
Export the SQLite database as HTML
Get http://en.wikipedia.org/wiki/ISO_3166-[12].html
Get http://en.wikipedia.org.wiki.ISO_3166-2.$code2.html
Get http://en.wikipedia.org.wiki.ISO_3166-2.$code2.html for all missing $code2
Parse en.wikipedia.org.wiki.ISO_3166-2.html
Parse en.wikipedia.org.wiki.ISO_3166-2.$code2.html for all missing $code2
Parse en.wikipedia.org.wiki.ISO_3166-2.$code2.html
Report some stats about the SQLite database
Report some stats about the SQLite database

Modules

Gently scrape Wikipedia for ISO3166-2 data
The interface to www.scraper.wikipedia.iso3166.sqlite
Create/drop tables in www.scraper.wikipedia.iso3166.sqlite
Download various pages from Wikipedia
Export www.scraper.wikipedia.iso3166.sqlite as CSV and HTML
Part of the interface to www.scraper.wikipedia.iso3166.sqlite