Nominatim

mirror of https://github.com/osm-search/Nominatim.git synced 2024-11-23 05:35:13 +03:00

Author	SHA1	Message	Date
Kian-Meng Ang	f5e52e748f	docs: fix typos	2022-07-20 22:05:31 +08:00
Sarah Hoffmann	e6775e713c	add typing information to DB properties	2022-07-18 09:47:57 +02:00
Sarah Hoffmann	bc63f10057	fix syntax error with tablespaces	2022-06-30 09:19:16 +02:00
Sarah Hoffmann	0f00f4968c	fix up BDD tests for postcode changes Includes smaller code fixes found by the tests.	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	37b2c6a830	port legacy tokenizer to new postcode handling Also documents the changes to the SQL functions of the tokenizer.	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	b7704833e4	icu: switch postcodes to using the pre-formatted one	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	ca7b46511d	introduce and use analyzer for postcodes	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	f833cc80df	use default ranks when reorganising rank_address When shifting address ranks, the evaluation is always done against unshifted address ranks on import because the objects we compare against have not been indexed yet. This changes for updates when the object have been touched in the meantime. To ensure consistent behaviour across imports and updates, always use the unshifted address ranks.	2022-06-16 11:20:23 +02:00
Sarah Hoffmann	df0142678a	improve address ordering with mixes of place and admin areas Resolves a couple of situations where a mixed use of places areas and administrative boundaries would result in a hierarchy that did not properly respect the contains relation.	2022-06-16 10:44:16 +02:00
Sarah Hoffmann	15cf7dd416	add testcase for #2551 This test proves that places that are linked need to be reindexed.	2022-06-05 21:39:17 +02:00
Sarah Hoffmann	2c05fc858a	fix rank inheritance from linked places When taking over the address rank from a linked place, it needs to be the originally computed rank, not the one that might have been adjusted in the meantime. The adjustment was made under the assumption that the node is not linked.	2022-06-05 19:38:14 +02:00
Sarah Hoffmann	bd0e157b91	fix order when searching for addr:* components When matching addr:* components the preference was given to matches that do not intersect with the place.	2022-05-31 16:57:37 +02:00
Sarah Hoffmann	1d203fdb3c	fix bug with keeping linking on updates When moving the finding of linked places to the precomputation stage, it was also moved before the statement where the linked_place_id was removed from the linkee. The result was that the current linkee was excluded when looking for a linked place on updates because it was still linked to the boundary to be updated. Fixed by allowing to either keep the linkage or change to an unlinked place.	2022-05-23 10:55:10 +02:00
Sarah Hoffmann	739fe1c2c4	no longer allow fuzzy assignment of country The fallback country boundaries already contain a sufficiently large part of the water area, so there is no need to extend the country assignment even more. Features outside countries should not show a country in their address.	2022-05-11 11:54:25 +02:00
Sarah Hoffmann	08672cdf0a	explicit cast for osm_type parameter in SQL needed Otherwise PostgreSQL won't correctly pick up the index condition.	2022-05-02 14:12:17 +02:00
Sarah Hoffmann	372874e89a	accept any OSM type in street member of associatedStreet This is needed for pedestrian areas mapped as multipolygons and consequently as relations. The lookup in placex guarantees that the referenced OSM object is indeed a street. Fixes #2669.	2022-05-02 09:48:51 +02:00
Sarah Hoffmann	3c68b12176	keep inherited address parts after indexing The inherited housenumber is needed for display output. We can't take the one from the housenumber field because it is already normalized. Remove the inherited address only when reindexing. Fixes #2683.	2022-04-28 21:38:00 +02:00
Sarah Hoffmann	a515761193	further tweaking of address distance For point features, keep using the distance to centroid. For area features, add a tie breaker for the case where the center point falls on the boundary.	2022-04-22 14:32:19 +02:00
Sarah Hoffmann	784dad866f	change distance computation between place and address part Instead of computing the distance to the centroid of the area compute the distance of the area to the centroid of the feature. This means we give preference to the area that covers the centroid. It's still a heuristics but one that is a bit less random.	2022-04-22 14:32:09 +02:00
Tareq Al-Ahdal	943e5fe699	Revert the removal of new line at the end of the file	2022-03-18 06:07:48 +08:00
Tareq Al-Ahdal	83b4b8d9c1	reattach 'name:' prefix to keys	2022-03-18 05:46:23 +08:00
Tareq Al-Ahdal	90ac15748e	fix comment	2022-03-18 02:38:04 +08:00
Tareq Al-Ahdal	6be2077d92	Merge branch 'master' into country-names-yaml-configuration	2022-03-18 02:36:12 +08:00
Tareq Al-Ahdal	456d439e97	Reformatting of country keys	2022-03-18 02:23:11 +08:00
Sarah Hoffmann	524dc64ab7	make sure outputs take into account linked place names	2022-03-16 21:44:52 +01:00
Sarah Hoffmann	42cd021d04	save differing linked polace names in extra fields This keeps the names tracable and ensures that all names are searchable when they differ. Do not keep names when they are exactly the same to save some space. Linked names are cleaned out before relinking.	2022-03-16 16:38:52 +01:00
Sarah Hoffmann	15beeef6ce	do not expand records in select list An expression of the form 'SELECT (func()).*' will be expanded by Postgresql _before_ execution with the result that the function will be called as many times as there are fields in the record. This is not what we want. The function call needs to go into the FROM clause instead.	2022-03-01 09:34:32 +01:00
Sarah Hoffmann	a6903651fc	add framework for analysing housenumbers This lays the groundwork for adding variants for housenumbers. When analysis is enabled, then the 'word' field in the word table is used as usual, so that variants can be created. There will be only one analyser allowed which must have the fixed name '@housenumber'.	2022-03-01 09:34:32 +01:00
Sarah Hoffmann	a9e3329c39	country_name: use separate columns for names from OSM This allows us to distinguish between base names and imported ones and consiquently removing imported ones if necessary.	2022-02-23 09:23:06 +01:00
Sarah Hoffmann	85d65a2fd2	create idx_place_interpolations for import already It is needed to look up if a node is part of an interpolation. Fixes #2608.	2022-02-18 11:11:22 +01:00
Sarah Hoffmann	6b9fea6f1a	disable debug message in interpolation processing	2022-02-07 23:30:25 +01:00
Sarah Hoffmann	fbc8884693	restrict change propagation to interpolation lines Also means that Postgresql will use the right index for the query.	2022-01-28 11:05:37 +01:00
Sarah Hoffmann	64abc90d30	use new tiger step column for queries	2022-01-27 14:08:08 +01:00
Sarah Hoffmann	788505095e	add step column to tiger data table This replaces the interpolationtype column.	2022-01-27 11:54:12 +01:00
Sarah Hoffmann	4b28b4fed4	adapt BDD tests for new interpolation style	2022-01-27 11:14:55 +01:00
Sarah Hoffmann	fea4dbba50	inherit tags from interpolation not parent Nodes on an interpolation now only get the address tags of interpolations and then compute their own parent from that. They no longer inherit the parent directly.	2022-01-27 11:14:55 +01:00
Sarah Hoffmann	9f64c34f1a	optimize indexes for interpolation lines Do not index 'inactive' rows (with startnumber is null) where possible.	2022-01-27 11:14:55 +01:00
Sarah Hoffmann	638ed15ada	improve handling von updates on nodes in interpolations Use the same update mechanism as for updates on the interpolations themselves. Updates must solely happen in place_insert as this is the place where actual changes of the data happen.	2022-01-27 11:14:55 +01:00
Sarah Hoffmann	c0d8b95f67	update interpolations instead of deleting and recreating	2022-01-27 11:14:55 +01:00
Sarah Hoffmann	b44493e7f2	reorganise place_insert trigger Code cleanup and formatting as well as minor improvements, in particular removal of unnecessary code.	2022-01-24 09:12:50 +01:00
Sarah Hoffmann	c3788d765e	add consistent SPDX copyright headers	2022-01-03 16:23:58 +01:00
Sarah Hoffmann	5e435b41ba	ICU: matching any street name will do again	2021-12-06 14:26:08 +01:00
Sarah Hoffmann	b1d490ea53	add index for Tiger housenumber queries	2021-11-24 11:10:20 +01:00
Sarah Hoffmann	85797acf1e	ICU: add an index over word_ids Needed for keyword lookup in the details response.	2021-10-25 21:33:27 +02:00
Sarah Hoffmann	e8e2502e2f	make word recount a tokenizer-specific function	2021-10-19 11:21:16 +02:00
Sarah Hoffmann	3649487f5e	use SP-GIST index for building index where available Point-in-polygon queries are much faster with a SP-GIST geometry index, so use that for the index used to check if a housenumber is inside a building. Only available with Postgis 3. There is an automatic fallback to GIST for Postgis 2.	2021-10-10 21:55:38 +02:00
Sarah Hoffmann	be65c8303f	export more data for the tokenizer name preparation Adds class, type, country and rank to the exported information and removes the rather odd hack for countries. Whether a place represents a country boundary can now be computed by the tokenizer.	2021-09-29 11:54:14 +02:00
Sarah Hoffmann	40f9d52ad8	Merge pull request #2454 from lonvia/sort-out-token-assignment-in-sql ICU tokenizer: switch match method to using partial terms	2021-09-28 09:45:15 +02:00
Sarah Hoffmann	bd7c7ddad0	icu tokenizer: switch to matching against partial names When matching address parts from addr:* tags against place names, the address names where so far converted to full names and compared those to the place names. This can become problematic with the new ICU tokenizer once we introduce creation of different variants depending on the place name context. It wouldn't be clear which variant to produce to get a match, so we would have to create all of them. To work around this issue, switch to using the partial terms for matching. This introduces a larger fuzziness between matches but that shouldn't be a problem because matching is always geographically restricted. The search terms created for address parts have a different problem: they are already created before we even know if they are going to be used. This can lead to spurious entries in the word table, which slows down searching. This problem can also be circumvented by using only partial terms for the search terms. In terms of searching that means that the address terms would not get the full-word boost, but given that the case where an address part does not exist as an OSM object should be the exception, this is likely acceptable.	2021-09-27 11:36:19 +02:00
Sarah Hoffmann	59fe74ddf6	move name matching into tokenizer module Instead of requesting the match tokens from the tokenizer when looking for parent streets/places and address parts, hand in the saved tokens and ask if they match. This gives the tokenizer more freedom to decide how name matching should be done.	2021-09-27 11:36:19 +02:00

1 2 3

105 Commits