Nominatim/data-sources/us-tiger
2018-11-15 23:01:08 +01:00
..
convert.sh 2018 TIGER data conversion scripts, add documentation to /docs/data-sources 2018-11-15 23:01:08 +01:00
README.md 2018 TIGER data conversion scripts, add documentation to /docs/data-sources 2018-11-15 23:01:08 +01:00
tiger_address_convert.py 2018 TIGER data conversion scripts, add documentation to /docs/data-sources 2018-11-15 23:01:08 +01:00
tiger_county_fips.json 2018 TIGER data conversion scripts, add documentation to /docs/data-sources 2018-11-15 23:01:08 +01:00

US TIGER address data

Convert TIGER/Line dataset of the US Census Bureau to SQL files which can be imported by Nominatim. The created tables in the Nominatim database are separate from OpenStreetMap tables and get queried at search time separately.

The dataset gets updated once per year. Downloading is prown to be slow (can take a full day) and converting them can take hours as well.

Replace '2018' with the current year throughout.

  1. Install the GDAL library and python bindings and the unzip tool

    Ubuntu:

    sudo apt-get install python-gdal unzip

    CentOS:

    sudo yum install gdal-python unzip

  2. Get the TIGER 2018 data. You will need the EDGES files (3,233 zip files, 11GB total).

    wget -r ftp://ftp2.census.gov/geo/tiger/TIGER2018/EDGES/
    
  3. Convert the data into SQL statements. Adjust the file paths in the scripts as needed

    cd data-sources/us-tiger ./convert.sh

  4. Maybe: package the created files

    tar -czf tiger2018-nominatim-preprocessed.tar.gz tiger