Nominatim

mirror of https://github.com/osm-search/Nominatim.git synced 2024-12-26 06:22:13 +03:00

Author	SHA1	Message	Date
Sarah Hoffmann	62b7670e0c	for postcodes use rank_search as base rank for finding addresses The rank_address reflects the position in the address which is usually lower than what one would expect for a postcode area.	2024-02-28 14:40:36 +01:00
Sarah Hoffmann	019a68a4bb	Merge pull request #3345 from lonvia/simplify-large-geometries Simplify very large polygons that are not used in addresses	2024-02-28 12:06:49 +01:00
Sarah Hoffmann	36b1660121	add support for new middle table format of osm2pgsql Functions are adapted according to the format detected from the osm2pgsql property table.	2024-02-27 18:18:19 +01:00
Sarah Hoffmann	56201feb28	simplify very large polygons non used in addresses Polygons with rank_address = 0 are only used in search and (rarely) for reverse lookup. Geometries do not need to be precise for that because topology does not matter. OSM has some very large polygons of natural features with sizes of more than 10MB. Simplify these polygons to keep the database and indexes smaller.	2024-02-27 10:16:18 +01:00
Sarah Hoffmann	4c19762e33	extratags should become null when empty Removing the artifical entries in the extratags may lead to an empty hstore. Set it to null in that case. Fixes #3055.	2024-02-08 10:21:48 +01:00
Sarah Hoffmann	9627352ee4	search postcodes for highway areas around the area So far the code would only accept postcodes that are inside the area. Fixes #3304.	2024-01-26 18:14:11 +01:00
Sarah Hoffmann	a873f260cf	fix merging of linked names into unnamed boundaries The NULL value of the boundaries' name field was erasing all content when used in SQL operations.	2023-06-30 22:14:11 +02:00
Sarah Hoffmann	645ea5a057	use information from tokenizer to determine street vs. place address So far the SQL logic used the information from the address field to determine if an address is attached to a street or place. This changes the logic to use the information provided in the token_info. This allows sanitizers to enforce a certain parenting without changing the visible address information.	2023-06-30 11:08:25 +02:00
Sarah Hoffmann	d574ceb598	restrict place rank inheritance to address items Place tags must have no influence on street- or POI-level objects.	2023-02-17 16:25:26 +01:00
Sarah Hoffmann	922352e215	do not assign postcodes to long linear features This avoids a postcode in particular for waterway features and long natural featues like ridges and valleys. Fixes #2915.	2022-12-10 14:53:08 +01:00
Sarah Hoffmann	4f05a03d13	handle associatedStreet relations with multiple streets When a associatedStreet relation has multiple street members always take the closest one. Avoid geometry operations for the frequent case that there is only one street.	2022-11-16 17:25:51 +01:00
Sarah Hoffmann	abf349fb0d	simplify use of secondary importance The values in the raster are already normalized between 0 and 2**16, so a simple conversion to [0, 1] will do. Check for existance of secondary_importance table statically when creating the SQL function. For that to work importance tables need to be created before the functions.	2022-10-01 11:01:49 +02:00
Tareq Al-Ahdal	0ab0f0ea44	Integrated OSM views into importance computation	2022-10-01 11:01:49 +02:00
Sarah Hoffmann	f017e1e9a1	make sure indexes are used	2022-09-25 14:09:45 +02:00
Sarah Hoffmann	33ba6896a8	further split up the big geometry index Adds partial indexes for all geometry queries used during import. A full index is not necessary anymore at that point. Still create the index afterwards for use in queries. Also adds documentation for all indexes on where they are used.	2022-09-21 16:21:41 +02:00
Sarah Hoffmann	aef014a47d	add indexes for lookup of addressable areas The generic geometry index has become to slow for that purpose.	2022-09-18 16:57:12 +02:00
Sarah Hoffmann	487e81fe3c	more invalidations when boundary changes rank When a boundary or place changes its address rank, all places where it participates as address need to be potentially reindexed. Also use the computed rank when testing place nodes against boundaries. Boundaries are computed earlier. Fixes #2794.	2022-08-12 09:48:46 +02:00
Sarah Hoffmann	b19c90b9a6	export centroid to tokenizer May come in handy when developping sanitizers for an area smaller than country size.	2022-07-31 22:10:58 +02:00
Kian-Meng Ang	f5e52e748f	docs: fix typos	2022-07-20 22:05:31 +08:00
Sarah Hoffmann	b7704833e4	icu: switch postcodes to using the pre-formatted one	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	f833cc80df	use default ranks when reorganising rank_address When shifting address ranks, the evaluation is always done against unshifted address ranks on import because the objects we compare against have not been indexed yet. This changes for updates when the object have been touched in the meantime. To ensure consistent behaviour across imports and updates, always use the unshifted address ranks.	2022-06-16 11:20:23 +02:00
Sarah Hoffmann	df0142678a	improve address ordering with mixes of place and admin areas Resolves a couple of situations where a mixed use of places areas and administrative boundaries would result in a hierarchy that did not properly respect the contains relation.	2022-06-16 10:44:16 +02:00
Sarah Hoffmann	15cf7dd416	add testcase for #2551 This test proves that places that are linked need to be reindexed.	2022-06-05 21:39:17 +02:00
Sarah Hoffmann	2c05fc858a	fix rank inheritance from linked places When taking over the address rank from a linked place, it needs to be the originally computed rank, not the one that might have been adjusted in the meantime. The adjustment was made under the assumption that the node is not linked.	2022-06-05 19:38:14 +02:00
Sarah Hoffmann	1d203fdb3c	fix bug with keeping linking on updates When moving the finding of linked places to the precomputation stage, it was also moved before the statement where the linked_place_id was removed from the linkee. The result was that the current linkee was excluded when looking for a linked place on updates because it was still linked to the boundary to be updated. Fixed by allowing to either keep the linkage or change to an unlinked place.	2022-05-23 10:55:10 +02:00
Sarah Hoffmann	08672cdf0a	explicit cast for osm_type parameter in SQL needed Otherwise PostgreSQL won't correctly pick up the index condition.	2022-05-02 14:12:17 +02:00
Sarah Hoffmann	372874e89a	accept any OSM type in street member of associatedStreet This is needed for pedestrian areas mapped as multipolygons and consequently as relations. The lookup in placex guarantees that the referenced OSM object is indeed a street. Fixes #2669.	2022-05-02 09:48:51 +02:00
Sarah Hoffmann	3c68b12176	keep inherited address parts after indexing The inherited housenumber is needed for display output. We can't take the one from the housenumber field because it is already normalized. Remove the inherited address only when reindexing. Fixes #2683.	2022-04-28 21:38:00 +02:00
Sarah Hoffmann	784dad866f	change distance computation between place and address part Instead of computing the distance to the centroid of the area compute the distance of the area to the centroid of the feature. This means we give preference to the area that covers the centroid. It's still a heuristics but one that is a bit less random.	2022-04-22 14:32:09 +02:00
Sarah Hoffmann	524dc64ab7	make sure outputs take into account linked place names	2022-03-16 21:44:52 +01:00
Sarah Hoffmann	42cd021d04	save differing linked polace names in extra fields This keeps the names tracable and ensures that all names are searchable when they differ. Do not keep names when they are exactly the same to save some space. Linked names are cleaned out before relinking.	2022-03-16 16:38:52 +01:00
Sarah Hoffmann	15beeef6ce	do not expand records in select list An expression of the form 'SELECT (func()).*' will be expanded by Postgresql _before_ execution with the result that the function will be called as many times as there are fields in the record. This is not what we want. The function call needs to go into the FROM clause instead.	2022-03-01 09:34:32 +01:00
Sarah Hoffmann	a9e3329c39	country_name: use separate columns for names from OSM This allows us to distinguish between base names and imported ones and consiquently removing imported ones if necessary.	2022-02-23 09:23:06 +01:00
Sarah Hoffmann	fea4dbba50	inherit tags from interpolation not parent Nodes on an interpolation now only get the address tags of interpolations and then compute their own parent from that. They no longer inherit the parent directly.	2022-01-27 11:14:55 +01:00
Sarah Hoffmann	9f64c34f1a	optimize indexes for interpolation lines Do not index 'inactive' rows (with startnumber is null) where possible.	2022-01-27 11:14:55 +01:00
Sarah Hoffmann	638ed15ada	improve handling von updates on nodes in interpolations Use the same update mechanism as for updates on the interpolations themselves. Updates must solely happen in place_insert as this is the place where actual changes of the data happen.	2022-01-27 11:14:55 +01:00
Sarah Hoffmann	c3788d765e	add consistent SPDX copyright headers	2022-01-03 16:23:58 +01:00
Sarah Hoffmann	be65c8303f	export more data for the tokenizer name preparation Adds class, type, country and rank to the exported information and removes the rather odd hack for countries. Whether a place represents a country boundary can now be computed by the tokenizer.	2021-09-29 11:54:14 +02:00
Sarah Hoffmann	59fe74ddf6	move name matching into tokenizer module Instead of requesting the match tokens from the tokenizer when looking for parent streets/places and address parts, hand in the saved tokens and ask if they match. This gives the tokenizer more freedom to decide how name matching should be done.	2021-09-27 11:36:19 +02:00
Sarah Hoffmann	28ee3d0949	move linking of places to the preparation stage Linked places may bring in extra names. These names need to be processed by the tokenizer. That means that the linking needs to be done before the data is handed to the tokenizer. Move finding the linked place into the preparation stage and update the name fields. Everything else is still done in the indexing stage.	2021-08-20 22:44:17 +02:00
Sarah Hoffmann	f74dc38766	always compute guessed postcode for POIs from centroid When guessing postcodes from the area, only postcodes within that area are accepted. For POIs that is usually not what we want as the postcode would have to be within a house for example. Fixes #2301.	2021-05-26 11:15:13 +02:00
Sarah Hoffmann	0da481f207	remove debug code	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	d75a235c1f	use address tokens in SQL	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	ffc2d82b0e	move postcode normalization into tokenizer	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	d8ed1bfc60	move houseunumber handling to tokenizer Normalization and token computation are now done in the tokenizer. The tokenizer keeps a cache to the hundred most used house numbers to keep the numbers of calls to the database low.	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	d711f5a81e	move name token creation into tokenizer Name tokens are now handed in via token_info and used from there. Also moves the generic search name insertion function back to placex_triggers.sql.	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	1b1ed820c3	introduce index for finding surrounding buildings	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	9397bf54b8	introduce external processing in indexer Indexing is now split into three parts: first a preparation step that collects the necessary information from the database and returns it to Python. In a second step the data is transformed within Python as necessary and then returned to the database through the usual UPDATE which now not only sets the indexed_status but also other fields. The third step comprises the address computation which is still done inside the update trigger in the database. The second processing step doesn't do anything useful yet.	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	e7266b52ae	simplify name matching between boundary and place node Instead of normalising the names simply compare them in lower case. This removes the dependency on the tokenizer for linking boundaries and nodes. When looking up the linked places by place type also allow that one name is simply contained in the other. This catches the frequent case where one of the names has an addendum (e.g. Newport vs. City of Newport). Drops the special index for the name lookup and insted relies on a slightly extended version of the geometry index used for reverse lookup. Saves around 100MB on a planet.	2021-04-14 17:52:59 +02:00
Sarah Hoffmann	6cbef84cad	use new transliteration in initial housenumber word computation The new create_housenumber_id() function splits housenumber lists correctly. Otherwise there is no difference.	2021-04-04 15:26:47 +02:00

1 2

54 Commits