Skip to content

Commit

Permalink
Merge pull request #127 from easherma/patch-1
Browse files Browse the repository at this point in the history
Update README.md
  • Loading branch information
orangejulius authored Jun 21, 2016
2 parents 4c15418 + 96ca5bc commit 1e63764
Showing 1 changed file with 4 additions and 0 deletions.
4 changes: 4 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -52,6 +52,8 @@ inserting this duplicate data into Pelias, the importer can optionally run all
records through the [openvenues](https://github.com/openvenues) [address
deduplicator](https://github.com/openvenues/address_deduper).

**Note:** The address deduper is a Python/Flask app and defaults to running on port 5000. It must be running in order for deduplication to take place!

The address deduplicator will filter out records that are both nearby, and have
nearly identical values. The deduplicator knows to expand common abbreviations,
so `123 Main Street` and `123 Main St` would be matched as duplicates.
Expand All @@ -60,6 +62,8 @@ Like admin lookup, this process slows down the importer quite a bit, and
requires a large amount of disk space (it uses a LevelDB store to track which
addresses it has seen), so it defaults to off.



## Configuration
This importer can be configured in [pelias-config](https://github.com/pelias/config), in the `imports.openaddresses`
hash. A sample configuration file might look like:
Expand Down

0 comments on commit 1e63764

Please sign in to comment.