Nominatim

mirror of https://github.com/osm-search/Nominatim.git synced 2024-11-27 00:49:55 +03:00

Author	SHA1	Message	Date
Sarah Hoffmann	5cf573c340	use a typed record for place info in get_addressdata Older versions of Postgresql cannot handle an untyped record for INTO. Fixes #2100.	2020-12-12 14:46:34 +01:00
Sarah Hoffmann	65d8770b28	update country_names from OSM data Update names in the coutry_names table on the fly from incomming OSM country data. Adding a small sanity check that the country must be an OSM relation and within the area where we expect the country to be.	2020-12-09 11:38:19 +01:00
Sarah Hoffmann	e20defeebd	avoid contains operator for geometries Postgis keeps messing up use of index in some circumstances.	2020-12-02 22:18:27 +01:00
Sarah Hoffmann	987d60ccda	place nodes can only be linked once against boundaries If a place node is already linked against a boundary, it should not be used for linking again. It is usually a sign of a mapping error, when there are multiple boundary candidates. This change just avoids inconsistent data in the database, it does not guarantee that the linking is against the more correct boundary.	2020-12-02 15:31:02 +01:00
Sarah Hoffmann	7d520bf448	also add explicit cast for varchar	2020-12-01 22:15:51 +01:00
Sarah Hoffmann	63544db8f9	null entries need to be typed	2020-12-01 14:54:42 +01:00
Sarah Hoffmann	7295cad715	compute address parts for rank 30 objects on the fly Rank 30 objects usually use the address parts of their parent. When the parent has address parts that are areas but not marked as isaddress, then the parent might go through multiple administrative areas. In that case recheck if the right area has been choosen for the object in question instead of relying on isaddress. Note that we really only have to do the recomputation in the case of 'isarea = True and isaddress = False' which hopefully keeps the number of additional geometric operations we have to do to a minimum. There is one more special case to be taken into account here: a street may go through two administrative areas and a house along that street is placed in one of the area while the addr:* tags says it belongs to the other. In that case we must not switch the isaddress to the one it is situated. To avoid that recheck the address names against the name of the ara. That is not perfect but should cover most cases. Fixes #328.	2020-12-01 11:58:25 +01:00
Sarah Hoffmann	ff85da0a31	cleanup get_addressdata Save location data in a ROW instead of using separate varaibles for each value.	2020-11-30 22:54:36 +01:00
Sarah Hoffmann	2db751700e	restrict size of features that get a full address search It would be nice to always compute addresses for rank 0 objects over the complete geometry, so that they can be found via all the admin boundaries that they intersect. However, there are a couple of extramely large boundaries in OSM (like timezones) where this results in thousands of possible address candidates that need to be checked. Fall back to getting the address of the centroid for them.	2020-11-26 11:53:58 +01:00
Sarah Hoffmann	cc1af99dbd	filter postcodes by search rank when adding to address list The post codes are the last part that does not fit the new address ranking scheme. In particular, the search rank is still relevant for choosing if a postcode should be included into the address terms. Filter out irrelevant postcodes in getNearFeatures() already, to avoid having to check for geometry relation.	2020-11-25 21:01:33 +01:00
Sarah Hoffmann	22800d7d59	Search housenumbers with unknown address parts by housenumber term House numbers need special handling because they may appear after the street term. That means we canot just use them as the main name for searches where the address has its own search term entries. Doing this right now, we are able to find '40, Main St, Town' but not 'Main St 40, Town'. This switches to using the housenumber token as the name term instead. House number tokens can get special handling when building the search query that covers the case where they come after the street. The main disadvantage is that this once more increases the numbers of possible search interpretation of which we have already too many. no penalty for housenumber searches	2020-11-25 11:36:10 +01:00
Sarah Hoffmann	f21853ea9d	Merge pull request #2071 from lonvia/fix-more-ranks Search rank 30 must always go with address rank 30	2020-11-24 21:45:30 +01:00
Sarah Hoffmann	b4b50eef15	search rank 30 must always go with address rank 30	2020-11-24 17:57:28 +01:00
Sarah Hoffmann	a9ad390b9e	move unlisted places to address rank 25 Unlisted places are derived from addr:place and as such are still places not streets.	2020-11-24 17:54:00 +01:00
Sarah Hoffmann	a4f1e40b72	do not create POI search terms on reverse-only Fixes #2067.	2020-11-23 19:55:36 +01:00
Sarah Hoffmann	49083c2597	Merge pull request #2058 from lonvia/split-address-words Split addr:* tags into words before adding to the search index	2020-11-18 08:58:17 +01:00
Sarah Hoffmann	ffb2c93ba3	POIs with unknown addr:place must add parent name to address The previous behaviour was a left-over from a former version where such POIs parented to the street. Now that they parent to places, it should be included.	2020-11-17 19:44:43 +01:00
Sarah Hoffmann	30a6b6bdac	split addr: tags into words before adding to the search index Address parts are only matched by single partial words. If the addr: names are not split, then multi-word names cannot be found.	2020-11-17 18:03:33 +01:00
Sarah Hoffmann	9ede048769	disallow linking for postcode areas	2020-11-17 10:53:26 +01:00
Sarah Hoffmann	6b60f0ab03	use bool_or(ST_Intersects) instead of ST_Intersects(ST_Collect) ST_Intersects segfaults on geometry collections for certain versions of Postgis 3.	2020-11-16 15:28:01 +01:00
Sarah Hoffmann	aa9923bf07	fix typo	2020-11-16 15:28:01 +01:00
Sarah Hoffmann	9160cce6d8	remove unused columns in search_name_* and use right index We only need the address rank these days, so get rid of search rank. Also switch indexes to work on address rank.	2020-11-16 15:28:01 +01:00
Sarah Hoffmann	7324431b12	get additional addresses for rank 30 objects get_addressdata() now also checks if the place itself has entries in the place_addressline table and merges them into the results. Also restrict checking for address tag places to cases where the name cannot be found in the parent's address search terms. Looking up all address tags is just too slow.	2020-11-16 15:28:01 +01:00
Sarah Hoffmann	021f2bef4c	get address terms from address tags for rank 30 For rank 30 objects add extra elements into the place_addressline table.	2020-11-16 15:28:01 +01:00
Sarah Hoffmann	c7472662a6	lookup places for address tags for rank < 30 While previously the content of addr:* tags was only added to the list of address search keywords, we now really look up the matching place. This has the advantage that we pull in all potential translations from the place, just like all the other address terms that are looked up by neighbourhood search. If no place can be found for a given name, the content of the addr:* tag is still added to the search keywords as before.	2020-11-16 15:28:01 +01:00
Sarah Hoffmann	fa574ae9fd	use different area estimates for large countries	2020-11-02 14:21:30 +01:00
Sarah Hoffmann	0f5615b618	guess a base address level for address rank 0 objects The guess is based on the area and mainly avoids odd addresses for very large or small objects.	2020-11-02 11:42:10 +01:00
Sarah Hoffmann	95f83b90d2	minor fixes for geometry compuation during boundary ranking Go back to using centroid when determining if one admin level is within another. There are cases where boundaries are slightly misaligned due to mapping errors (not using the same ways in the relations). Only declare boundaries the same when they have the same wikidata tag _and_ have exactly the same geometry. This works around tagging errors with the wikidata tag, which happen because of automated edits to the wikidata tag.	2020-10-28 10:49:26 +01:00
Sarah Hoffmann	7a16909219	detect and remove admin boundary duplicates The Polish community maps admin boundaries that span multiple levels by duplicating the boundary relations. Detect this situation by looking out for matching wikidata tags. The higher ranked duplicates are then thrown out from the address pool by setting their address rank to 0.	2020-10-28 10:49:26 +01:00
Sarah Hoffmann	bf4d75458c	add explicit bbox contains check Now that the containment check uses ST_Relate, we need to add a separate bbox contains check to ensure that Postgis does the efficient check first. Note that we still cannot get rid of the overlap(&&) check because then Postgis will use the wrong indexes.	2020-10-19 10:39:01 +02:00
Sarah Hoffmann	1064a9264e	revert to && comparison for geometries Postgis 3 picks the wrong index when using ~ or @.	2020-10-16 09:49:48 +02:00
Sarah Hoffmann	acfa7bec9c	use computed centroid for location_area_large The new address computation assumes that the centroid is inside the area. Therefore we cannot use the centroid function. Use the pre-computed centroid instead which has already been corrected to be inside the area.	2020-10-15 17:30:52 +02:00
Sarah Hoffmann	62b94e838b	correctly set from area column in place_addressline This was always set to true which brings us to the question if it is even still needed.	2020-10-15 12:06:53 +02:00
Sarah Hoffmann	5236e7a03e	fix use of geometry operators @ is contained by while ~ is contains.	2020-10-15 12:06:18 +02:00
Sarah Hoffmann	7e9412a044	demote admin boundaries for place areas Also demote the address rank of an admin boundary when there is a place area of higher rank that completely contains the area. This catches the case where city boundaries do not exactly align with administrative units (see for example Moscow).	2020-10-14 11:33:47 +02:00
Sarah Hoffmann	e47c19beb9	exclude rank 25 when computing addresses of streets Address rank 25 is used for squares which are address-wise on the same level as streets.	2020-10-13 22:36:17 +02:00
Sarah Hoffmann	2fe3c654fc	overhaul address computation This is a complete rewrite of the selection of address parts to be inserted into the place_addressline table. The new algorithm selects for each rank: * the boundary overlapping with the addressee and contained in the already selected boundaries of lower rank, or failing that * the place node closest to the addressee that is contained in the already selected boundaries and in the influence radius of already selected place nodes of lower rank Place nodes that are not contained in already selected boundaries of lower rank are completely thrown away. All other candidates are added as non-address parts.	2020-10-13 22:10:07 +02:00
Sarah Hoffmann	5ec48c66cb	move ordering out of getNearFeatures The two places where the function is called have different ordering requirement.	2020-10-13 15:24:54 +02:00
Sarah Hoffmann	887ae7fcab	increase radius of influence around city nodes The current radius does not cover cities with more than a million inhabitants well.	2020-10-12 14:17:37 +02:00
Sarah Hoffmann	ff47f6f65d	when linking always check against original address rank	2020-10-11 12:29:49 +02:00
Sarah Hoffmann	b04463bb2d	demote place nodes in admin areas If a place node of city rank and above finds itself in an administrative boundary of the same address rank, then increase the address rank by 2. This catches the rather frequent case where city suburbs are tagged for historical reasons as towns or villages.	2020-10-11 12:04:53 +02:00
Sarah Hoffmann	5391bb6cf7	update to latest osm2pgsql version The latest version of osm2pgsql no longer creates indexes on the members of planet_osm_rels. So we do that ourselves. Given that we only need to access associated street relations, the index can become quite a bit smaller.	2020-10-05 17:11:13 +02:00
Sarah Hoffmann	40bc1752c2	remove unused idx_placex_geometry_reverse_lookupPoint The index has been unused ever since the query using it was changed two years ago. Given that it has not been missed much, drop it completely here.	2020-09-30 09:21:35 +02:00
Sarah Hoffmann	f8694da3c9	Remove more rank_search usage from address computation Fixes #1904.	2020-09-25 17:50:36 +02:00
Sarah Hoffmann	6625e93be6	Merge pull request #1975 from lonvia/simplify-parent-assignment-for-unlisted-places Use closest containing place area for parent of unlisted addr:place	2020-09-23 19:10:42 +02:00
Sarah Hoffmann	d9325dc11a	use rank_address when invalidating containing objects Only rank_address is now relevant for determining if a place could be part of an address.	2020-09-23 17:44:31 +02:00
Sarah Hoffmann	d3ca9dd3f7	remove ST_Covers check when also testing for ST_Intersects Using both is slightly problematic because they have different ways to use the index. Newer versions of Postgis exhibit a query planner issue when both functions appear together. As ST_Intersects includes ST_Covers, simply remove the latter.	2020-09-23 17:44:31 +02:00
Sarah Hoffmann	e552f6bce5	use closest containing place for unlisted addr:place We can't use getNearFeatures() to determine the parent of a place with an unlisted addr:place because this function returns place nodes that are potentially outside the area of interest. Doing the complete address computation is too expensive, so simply use the area with the largest rank that contains the feature instead.	2020-09-23 17:33:42 +02:00
Sarah Hoffmann	c84e7e72f1	add unknown addr:place to address output When a POI has no addr:street but an addr:place that is not contained in the name list of the parent place, then remember this situation and merge the content of addr:place into the address output. We don't need to care about translations in this case because it is obvious that no object with translations exists if the parent isn't the object named in addr:place.	2020-09-23 11:55:18 +02:00
Sarah Hoffmann	f2ff351da4	Merge pull request #1971 from lonvia/drop-support-for-isin Drop support for is_in tag	2020-09-23 09:20:35 +02:00

1 2 3 4 5 ...

476 Commits