Nominatim

mirror of https://github.com/osm-search/Nominatim.git synced 2024-12-25 05:52:32 +03:00

Author	SHA1	Message	Date
miku0	67706cec4e	add @fail-legacy	2023-07-27 07:33:53 +00:00
miku0	0722495434	add japanese sanitizer	2023-07-26 07:54:58 +00:00
Sarah Hoffmann	a873f260cf	fix merging of linked names into unnamed boundaries The NULL value of the boundaries' name field was erasing all content when used in SQL operations.	2023-06-30 22:14:11 +02:00
Sarah Hoffmann	2d05ff0190	slightly adapt postcode tests	2023-06-22 16:51:59 +02:00
Sarah Hoffmann	8f299838f7	fix various failing BDD tests	2023-05-26 15:08:48 +02:00
Sarah Hoffmann	60c1301fca	fix a number of corner cases with interpolation splitting Snapping a line to a point before splitting was meant to ensure that the split point is really on the line. However, ST_Snap() does not always behave well for this case. It may shorten the interpolation line in some cases with the result that two points housenumbers suddenly fall on the same point. It might also shorten the line down to a single point which then makes ST_Split() crash. Switch to a combination of ST_LineLocatePoint and ST_LineSubString instead, which guarantees to keep the original geometry. Explicitly handle the corner cases, where the split point falls on the beginning or end of the line.	2023-04-06 16:54:00 +02:00
Sarah Hoffmann	3f2296e3ea	bdd: extend reverse API tests for format checks Reorganise the API reverse tests and extend the checks for the output format, testing for all expected fields.	2023-03-09 20:20:50 +01:00
Sarah Hoffmann	01010e443f	bdd: remove special case for osm_type field The fuzzy field check hide cover formatting errors. Use 'osm' when only caring about the conent.	2023-03-09 17:44:34 +01:00
Sarah Hoffmann	d574ceb598	restrict place rank inheritance to address items Place tags must have no influence on street- or POI-level objects.	2023-02-17 16:25:26 +01:00
Sarah Hoffmann	929a13d4cd	remove comma as name separator Commas are most of the time used as a part of a name, not to separate multiple names. See also #2950.	2023-01-22 22:29:36 +01:00
Sarah Hoffmann	c9ff7d2130	drop illegal values for addr:interpolation on update	2022-11-18 17:26:56 +01:00
Sarah Hoffmann	4f05a03d13	handle associatedStreet relations with multiple streets When a associatedStreet relation has multiple street members always take the closest one. Avoid geometry operations for the frequent case that there is only one street.	2022-11-16 17:25:51 +01:00
Sarah Hoffmann	dddfa3a075	ignore irrelevant extra tags on address interpolations When deciding if an address interpolation has address information, only look for addr:street and addr:place. If they are not there go looking for the address on the address nodes. Ignores irrelevant tags like addr:inclusion. Fixes #2797.	2022-08-13 14:07:06 +02:00
Sarah Hoffmann	487e81fe3c	more invalidations when boundary changes rank When a boundary or place changes its address rank, all places where it participates as address need to be potentially reindexed. Also use the computed rank when testing place nodes against boundaries. Boundaries are computed earlier. Fixes #2794.	2022-08-12 09:48:46 +02:00
Sarah Hoffmann	3dd7410bb7	bdd: correctly skip postcode tests for legacy	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	6eb9044353	adapt search algorithm to new postcode format in word	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	0f00f4968c	fix up BDD tests for postcode changes Includes smaller code fixes found by the tests.	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	8080625747	remove postcodes from countries that don't have them The postcodes will only be removed as a 'computed postcode' they are still searchable for the given object.	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	6c58a4c46c	bdd: move query tests from scene to grid description	2022-06-17 11:54:18 +02:00
Sarah Hoffmann	00d8df6fc3	bdd: move update tests from scenes to grid descriptions	2022-06-17 11:54:18 +02:00
Sarah Hoffmann	02068aec7f	bdd: move import tests from scenes to grid descriptions	2022-06-17 11:54:18 +02:00
Sarah Hoffmann	a2b486a5b0	bdd: allow to set an origin of the grid	2022-06-17 11:54:18 +02:00
Sarah Hoffmann	df0142678a	improve address ordering with mixes of place and admin areas Resolves a couple of situations where a mixed use of places areas and administrative boundaries would result in a hierarchy that did not properly respect the contains relation.	2022-06-16 10:44:16 +02:00
Sarah Hoffmann	15cf7dd416	add testcase for #2551 This test proves that places that are linked need to be reindexed.	2022-06-05 21:39:17 +02:00
Sarah Hoffmann	bd0e157b91	fix order when searching for addr:* components When matching addr:* components the preference was given to matches that do not intersect with the place.	2022-05-31 16:57:37 +02:00
Sarah Hoffmann	1d203fdb3c	fix bug with keeping linking on updates When moving the finding of linked places to the precomputation stage, it was also moved before the statement where the linked_place_id was removed from the linkee. The result was that the current linkee was excluded when looking for a linked place on updates because it was still linked to the boundary to be updated. Fixed by allowing to either keep the linkage or change to an unlinked place.	2022-05-23 10:55:10 +02:00
Sarah Hoffmann	372874e89a	accept any OSM type in street member of associatedStreet This is needed for pedestrian areas mapped as multipolygons and consequently as relations. The lookup in placex guarantees that the referenced OSM object is indeed a street. Fixes #2669.	2022-05-02 09:48:51 +02:00
Sarah Hoffmann	3c68b12176	keep inherited address parts after indexing The inherited housenumber is needed for display output. We can't take the one from the housenumber field because it is already normalized. Remove the inherited address only when reindexing. Fixes #2683.	2022-04-28 21:38:00 +02:00
Sarah Hoffmann	36a1560117	add migration to mark internal country names	2022-03-31 15:55:20 +02:00
Sarah Hoffmann	e133476c35	merge linked names correctly into namedetails Convert the '_place_' entries back to normal entries before returning them in the 'namedetails' section. If the name field is duplicated, kept the '_place_' notation. This preserves the previous behaviour before _place_ names were introduces but adds the additional names from the linked place for reference.	2022-03-17 11:02:02 +01:00
Sarah Hoffmann	524dc64ab7	make sure outputs take into account linked place names	2022-03-16 21:44:52 +01:00
Sarah Hoffmann	42cd021d04	save differing linked polace names in extra fields This keeps the names tracable and ensures that all names are searchable when they differ. Do not keep names when they are exactly the same to save some space. Linked names are cleaned out before relinking.	2022-03-16 16:38:52 +01:00
Sarah Hoffmann	ef98a85b05	correctly handle single-point interpolations in reverse Lookup in location_property_osmline needs to be special cased for startnumber = endnumber. Also adds tests for the case. Fixes #2680.	2022-03-16 11:19:09 +01:00
Sarah Hoffmann	89e1446131	bdd: disable some housenumber tests for legacy Optional spaces in housenumbers are not supported by legacy tokenizer, so disable those tests.	2022-03-01 09:34:32 +01:00
Sarah Hoffmann	f03a05f6bb	add new analyser for houenumbers This analyser makes spaces optional.	2022-03-01 09:34:32 +01:00
Sarah Hoffmann	1d82569f6d	add tests for country updates	2022-02-24 16:18:49 +01:00
Sarah Hoffmann	f74228830d	bdd: run full import on tests This uncovered a couple of outdated/wrong tests which have been fixed, too.	2022-02-24 14:27:51 +01:00
Sarah Hoffmann	0e11ca9b76	add test that interpolations are found by odd/even	2022-02-10 11:23:51 +01:00
Sarah Hoffmann	a79a3210e6	implement is-a-name option for housenumbers	2022-02-07 09:27:11 +01:00
Sarah Hoffmann	6b89624f33	adapt frontend to new interpolation table layout	2022-01-27 11:14:55 +01:00
Sarah Hoffmann	4b28b4fed4	adapt BDD tests for new interpolation style	2022-01-27 11:14:55 +01:00
Sarah Hoffmann	206ee87188	factor out housenumber splitting into sanitizer	2022-01-19 17:27:50 +01:00
Sarah Hoffmann	b453b0ea95	introduce mutation variants to generic token analyser Mutations are regular-expression-based replacements that are applied after variants have been computed. They are meant to be used for variations on character level. Add spelling variations for German umlauts.	2022-01-18 11:09:21 +01:00
Sarah Hoffmann	f9b56a8581	correctly match abbreviated addr:street This only works when addr:street is abbreviated and the street name isn't. It does not work the other way around.	2021-12-08 21:58:43 +01:00
Sarah Hoffmann	5e435b41ba	ICU: matching any street name will do again	2021-12-06 14:26:08 +01:00
Sarah Hoffmann	80e0a3cce4	change default rank for highway objects to 30 The highway key is being used more and more for non-ways these days. This clashes with Nominatim's assumption that essentially everything that has a highway tag can be used as the street part of the address. Change the default rank of highway objects to 30 to avoid this. Only the known values for streets keep the rank 26 and are now listed explicitly.	2021-11-24 22:10:40 +01:00
Sarah Hoffmann	1722fc537f	bdd: add tests for non-latin scripts	2021-10-26 17:29:03 +02:00
Sarah Hoffmann	97a10ec218	apply variants by languages Adds a tagger for names by language so that the analyzer of that language is used. Thus variants are now only applied to names in the specific language and only tag name tags, no longer to reference-like tags.	2021-10-06 11:09:54 +02:00
Sarah Hoffmann	40f9d52ad8	Merge pull request #2454 from lonvia/sort-out-token-assignment-in-sql ICU tokenizer: switch match method to using partial terms	2021-09-28 09:45:15 +02:00
Sarah Hoffmann	bd7c7ddad0	icu tokenizer: switch to matching against partial names When matching address parts from addr:* tags against place names, the address names where so far converted to full names and compared those to the place names. This can become problematic with the new ICU tokenizer once we introduce creation of different variants depending on the place name context. It wouldn't be clear which variant to produce to get a match, so we would have to create all of them. To work around this issue, switch to using the partial terms for matching. This introduces a larger fuzziness between matches but that shouldn't be a problem because matching is always geographically restricted. The search terms created for address parts have a different problem: they are already created before we even know if they are going to be used. This can lead to spurious entries in the word table, which slows down searching. This problem can also be circumvented by using only partial terms for the search terms. In terms of searching that means that the address terms would not get the full-word boost, but given that the case where an address part does not exist as an OSM object should be the exception, this is likely acceptable.	2021-09-27 11:36:19 +02:00

1 2 3 4

161 Commits