close
The Wayback Machine - https://web.archive.org/web/20201222155023/https://github.com/tonytw1/nominatim-ac
Skip to content
master
Go to file
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
sql
 
 
src
 
 
 
 
 
 
 
 
 
 
 
 

README.md

An experimental Open Street Map (OSM) / Nominatim place name auto-complete service.

Indexes the contents of a Nominatim 3.1 database into Elasticsearch version 6.

End points

/search

Parameter Description
q The user query. ie. London
tag Restrict results to a given OSM tag.
lat / lon / radius Restrict results to a given geocircle
country Restrict results to a given country code
callback The name of an optional JSONP callback to wrap the results in

Motivation

Allowing users to tag content with just a latitude/longitude point loses a lot of the context. Was the user referring to a country, a city or a specific building when they applied the tag?

Can OSM data be used to provide a user friendly auto complete place name service which preserves context by assigning a repeatable location id to each result?

Background

Many existing location lookup services resolve to a point location only. The Google Maps geocoding service provided some context information, but lacked persistant ids which can be advertised and referred back to at a later date (it now does).

Some services such as Twitter and Instagram exposed their own in-house location ids but data ids referred to wasn't available to 3rd party developers.

OSM does provide publicly accessible ids for locations. Resolving the fairly basic OSM schema into a hierarchy is a fairly complicated task. This has been tackled by the Nominatim project (http://wiki.openstreetmap.org/wiki/Nominatim).

This means we can infer things like 'if this content is tagged with that street, then it must also be relevant to this city and this country'. OSM ids are public so we can advertise them and be confident that other applications can make use of that information.

The Nominatim terms and conditions state that it shouldn't be used as an auto complete service. This is fair. Using Nominatim in this manner would result in a lot of expensive database queries.

Overview

This code base dumps out the contents of a populated Nominatim postgres instance into Elasticsearch and exposes it as a JSON web service.

This service is able to quickly respond to key stokes as the Nominatim output has been pre-rendered and indexed and does not need to be calculated in real time. The Elasticsearch index can be distributed to smaller machines than those needed to run a whole planet Nominatim instance.

The Nominatim place name and OSM id is made available in the JSON returned to clients. The calling application can now persist the OSM id of the selected result for future reference.

Implementation

Java / Spring Boot and Elasticsearch.

Reads from a populated Nominatim 3.4 postgres database and indexes into an Elasticsearch 6.8 index.

An example install containing the whole planet data set is available at https://nominatim-ac.eelpieconsulting.co.uk

Also see

photon (http://photon.komoot.de/) takes a similar approach and is production ready.

Nominatim AC will be maintained to further investigate the use of search 'profiles' to improve the user experience. ie. selectively excluding things like post boxes and bus stops.

Local development

This is a Spring Boot project with a Gradle build.

Configuration is in the file named application.properties.

The tests are expecting to see a Postgres Nominatim 3.4 schema containing a Sep 2019 Great Britain import on localhost port 6432 (which is where nominatim-docker would be).

Start Elasticsearch:

docker-compose -f docker-compose/docker-compose.yml up

Create Elasticsearch index:

bash elasticsearch/index.bash 
{"acknowledged":true,"shards_acknowledged":true,"index":"nominatimac"}

bash elasticsearch/mappings.bash 
{"acknowledged":true}

Build locally with:

./gradlew clean build -x test

Start locally

mv build/libs/nominatim-ac-0.1.0.jar .
java -jar nominatim-ac-0.1.0.jar 

Build index

curl http://localhost:8080/import

Installation

The Elasticsearch index is populated by reading from the Postgres database of a locally running Nominatim instance.

Installing a whole world Nominatim instances is a fairly large undertaking. From experience a full world Nominatim instance requires a machine with at least 32GB of RAM available. I struggled to build a consumer machine which could complete a full install. A full install was eventually successful using a HP DL360 server with 32Gb of RAM (single Xeon processor).

The initial Nominatim import of planet.osm took approximately 6 weeks to complete and consumed around 1TB of disk. The indexing the Nominatim postgres database into Elasticsearch takes around 48 hours. The Elasticsearch index is topped up every 5 minutes to capture updates feed into Nominatim via Osmosis (as per Nominatims' install instructions).

Mediagis have published a Nominatim Docker build which can be used for local development (https://github.com/mediagis/nominatim-docker).

About

Elasticsearch backed place name autocomplete based on Open Street Map Nominatim data

Topics

Resources

Releases

No releases published

Packages

No packages published
You can’t perform that action at this time.