Nominatim

mirror of https://github.com/osm-search/Nominatim.git synced 2024-11-27 00:49:55 +03:00

Author	SHA1	Message	Date
Sarah Hoffmann	d562f11298	slightly increase radius to look for postcodes	2021-09-24 23:56:42 +02:00
Sarah Hoffmann	86ea077092	merge marking rare name with adding name token Only name tokens can be rare, so this should be the same function.	2021-07-18 16:52:37 +02:00
Sarah Hoffmann	5d6aabc457	add documentation for public interface of SearchDescription	2021-07-18 16:10:42 +02:00
Sarah Hoffmann	a48ebd9b47	move SearchDescription building into tokens Moving the logic for extending the SearchDescription into the token classes splits up the code and makes it more readable. More importantly: it allows tokenizer to define custom token classes in the future.	2021-07-17 20:24:33 +02:00
Sarah Hoffmann	3cd85eaaf1	remove Token from explicit input for SearchDescription extension The token string is only required by the PartialToken type, so it can simply save the token string internally. No need to pass it to every type. Also moves the check for multi-word partials to the token loader code in the tokenizer. Multi-word partials can only happen with the legacy tokenizer and when the database was loaded with an older version of Nominatim. No need to keep the check for everybody.	2021-07-17 18:18:31 +02:00
Sarah Hoffmann	ec3f6c9c42	factor out query position Moves token and phrase position and phrase type into a separate class that is handed in when assembling the search description. This drastically reduces the number of parameters for the function to extend the search descriptions and gives us more flexibility in the future for more complex positional analysis.	2021-07-15 14:12:59 +02:00
Sarah Hoffmann	143ff14466	remove special status of partial tokens Full-word tokens are no longer marked by a space at the beginning of the token. Use the new Partial token category instead. This removes a couple of special casing, we don't really need. The word table still has the space for compatibility reasons, so the tokenizer code needs to get rid of it when loading the tokens.	2021-07-14 22:17:17 +02:00
Sarah Hoffmann	500c61685b	remove unused variables As reported by sonarqube.	2021-07-09 16:36:42 +02:00
Sarah Hoffmann	63755c31ff	remove penalty for full words in address Now that mutli-word partials no longer exist, multi-word full words need to be used to search in addresses and therefore no longer should have a penalty. Also changes the condition when a full word is included into the address. It is no longer relevant if an equivalent partial exists but only if the term consists of more than one word.	2021-06-26 11:37:15 +02:00
Sarah Hoffmann	161f5f5cee	adjust penalty for housenumber-in-name searches When searching for house numbers in the name (for place-only terms) then the same penalties need to apply as for the regular house number search. Change the code to first compute the penalties and then create the new search variants.	2021-06-26 11:37:15 +02:00
Sarah Hoffmann	fe11d3cbbd	do not return POIs when dropping house number in query We've previously added searching through rank 30 in a house number search to enable searches for house number+name. This had the unintended side effect that rank 30 objects are also returned in s search that dropped the house number from the query. This is wrong because POIs cannot function as a parent to a house number. This fix drops all rank 30 objects from the results for a house number search if they do not match the requested house number.	2021-06-17 14:21:20 +02:00
Sarah Hoffmann	02f6afa51b	always ignore multi term partials in search Partial terms should only ever consist of one word. Ignore any other, they are a leftover from inefficient word index builts.	2021-05-23 22:13:03 +02:00
Sarah Hoffmann	185d369404	remove support for AUX housenumber tables These tables have never been actively maintained and the code is completely untested. With the upcomming changes, it is unlikely that the code remains usable. This removes the aux tables and all code that references them.	2021-04-30 10:08:29 +02:00
Sarah Hoffmann	16a66b5326	move transliteration of housenumbers into indexing Housenumbers are now saved in transliterated form in the housenumber column. This saves the transliteration step during lookup.	2021-04-04 15:26:47 +02:00
Sarah Hoffmann	e05dee6df5	allow sorting by housenumbers for rare street names Usually we don't narrow down search results by house number when only a street name is given because there may be a lot of rows to cross check when the street name is very frequent. However, when it is known to be rare, the housenumber check may be done anyway. Fixes #2238.	2021-03-29 12:06:51 +02:00
Sarah Hoffmann	6dd2b9c2ec	do not mix partial names with other words As soon as a housenumber, postcode, etc. appear, the name term must obviously be closed and no further partial terms can be appended.	2021-03-11 22:44:49 +01:00
Sarah Hoffmann	3fbe4511f9	make linter happy	2021-03-11 21:14:23 +01:00
Sarah Hoffmann	3933fc3ad3	avoid multi-term partials in names Names are either full words or single-word partial names. Searching for multi-word partials yields exactly the same result as with full words.	2021-03-11 20:42:37 +01:00
Sarah Hoffmann	00b05e2394	higher penalty for special searches Adds a general higher penalty for special search term and an additional one if the term is anywhere but the beginning or the end. Also housenumbers and special searches together are less likely.	2021-03-11 20:37:51 +01:00
Sarah Hoffmann	d5e8c5e975	do not mix partial and full name terms If NameNonSearch already contains a partial term, then a full term must not be added to the Name list anymore.	2021-03-11 20:22:54 +01:00
Sarah Hoffmann	478dfb0639	add one-rank penalty for using partial search Ensures that full matches are preferred over partial ones even when the full word consists of only one term.	2021-03-11 17:52:44 +01:00
Sarah Hoffmann	182f5f5d7b	give preference to full words in address, too Full word terms are already preferred for the name part. Adding only one-word partials to the address, makes it impossible to give a similar preference for the address part. Each term adds a rank penalty. The problem here is that we interpret the query forwards and backwards. Having different penalty systems for name and address means that the same term ends up with different penalties and that often leads to interpretations of the wrong direction being in the way.	2021-03-11 15:03:36 +01:00
Sarah Hoffmann	8eb85f1340	increase penalty for places without housenumber Results where the housenumber was dropped are an unlikely result when they refer to something other than a street. Therefore increase their result rank so that other matches are tried first before choosing them as a result. Improves #2167.	2021-02-16 17:47:06 +01:00
Sarah Hoffmann	db3ced17bb	rename lib to lib-php	2021-02-09 11:52:07 +01:00

24 Commits