Nominatim

mirror of https://github.com/osm-search/Nominatim.git synced 2024-12-29 16:04:07 +03:00

Author	SHA1	Message	Date
Sarah Hoffmann	823502a40a	use grapheme_stripos instead of stripos in PHP code The stripos() does not handle non-ASCII correctly.	2022-12-11 13:55:27 +01:00
Sarah Hoffmann	e0c184e097	fix base number of returned results The intent was to always search for at least 10 results. Improves on #882.	2022-08-09 13:53:20 +02:00
Kian-Meng Ang	f5e52e748f	docs: fix typos	2022-07-20 22:05:31 +08:00
Sarah Hoffmann	161d83af5b	fix handling of zero importance To avoid importance becoming zero and cancelling out other weights, `df008d99f5` introduced a minimum value for importance. That broke importances for interpolated addresses, which are less than zero. Instead of setting a minimum, set zero importances to a very small value. Fixes #2753.	2022-06-29 17:54:30 +02:00
Sarah Hoffmann	f9889f81d6	swap order of query interpretation A forward interpretation of the form 'street, city, country' is much more frequent than the reverse form 'country, city, street'. Thus swap the order of interpretations that the forward order comes first.	2022-01-05 15:21:14 +01:00
Sarah Hoffmann	c3788d765e	add consistent SPDX copyright headers	2022-01-03 16:23:58 +01:00
Sarah Hoffmann	f00b8dd1c3	move special hack for US states to legacy tokenizer The hack for IL, AL and LA is only needed because these abbreviations are removed by the legacy tokenizer as a stop word. There is no need to keep the hack for future tokenizers. Move it therefore to the token extraction function.	2021-08-17 14:28:55 +02:00
Sarah Hoffmann	0fb8eade13	remove country restriction from tokenizer Restricting tokens due to the search context is better done in the generic search part instead of repeating the same test in every tokenizer implementation.	2021-08-16 11:41:54 +02:00
Sarah Hoffmann	cca912af4e	make all Token menbers private	2021-07-18 22:54:55 +02:00
Sarah Hoffmann	b14ce959d9	factor out check if a token fits current search Saves allocating an empty array.	2021-07-17 22:01:35 +02:00
Sarah Hoffmann	a48ebd9b47	move SearchDescription building into tokens Moving the logic for extending the SearchDescription into the token classes splits up the code and makes it more readable. More importantly: it allows tokenizer to define custom token classes in the future.	2021-07-17 20:24:33 +02:00
Sarah Hoffmann	3cd85eaaf1	remove Token from explicit input for SearchDescription extension The token string is only required by the PartialToken type, so it can simply save the token string internally. No need to pass it to every type. Also moves the check for multi-word partials to the token loader code in the tokenizer. Multi-word partials can only happen with the legacy tokenizer and when the database was loaded with an older version of Nominatim. No need to keep the check for everybody.	2021-07-17 18:18:31 +02:00
Sarah Hoffmann	ec3f6c9c42	factor out query position Moves token and phrase position and phrase type into a separate class that is handed in when assembling the search description. This drastically reduces the number of parameters for the function to extend the search descriptions and gives us more flexibility in the future for more complex positional analysis.	2021-07-15 14:12:59 +02:00
Sarah Hoffmann	143ff14466	remove special status of partial tokens Full-word tokens are no longer marked by a space at the beginning of the token. Use the new Partial token category instead. This removes a couple of special casing, we don't really need. The word table still has the space for compatibility reasons, so the tokenizer code needs to get rid of it when loading the tokens.	2021-07-14 22:17:17 +02:00
Sarah Hoffmann	1e40d65aa9	remove dead code	2021-07-11 23:22:53 +02:00
Sarah Hoffmann	d933ead2b5	remove unnecessayly nested ifs Found by Sonarqube.	2021-07-11 19:11:37 +02:00
Sarah Hoffmann	27af9b102c	always use brackets on if statements This adds bracket around all one-line if statements that did not have them yet.	2021-07-10 17:04:46 +02:00
Sarah Hoffmann	500c61685b	remove unused variables As reported by sonarqube.	2021-07-09 16:36:42 +02:00
Sarah Hoffmann	63755c31ff	remove penalty for full words in address Now that mutli-word partials no longer exist, multi-word full words need to be used to search in addresses and therefore no longer should have a penalty. Also changes the condition when a full word is included into the address. It is no longer relevant if an equivalent partial exists but only if the term consists of more than one word.	2021-06-26 11:37:15 +02:00
Sarah Hoffmann	044bb6afa5	move tokenization in query into tokenizer	2021-04-30 17:41:08 +02:00
Sarah Hoffmann	3eb4d88057	boilerplate for PHP code of tokenizer This adds an installation step for PHP code for the tokenizer. The PHP code is split in two parts. The updateable code is found in lib-php. The tokenizer installs an additional script in the project directory which then includes the code from lib-php and defines all settings that are static to the database. The website code then always includes the PHP from the project directory.	2021-04-30 11:31:52 +02:00
Sarah Hoffmann	185d369404	remove support for AUX housenumber tables These tables have never been actively maintained and the code is completely untested. With the upcomming changes, it is unlikely that the code remains usable. This removes the aux tables and all code that references them.	2021-04-30 10:08:29 +02:00
Sarah Hoffmann	1db468b6c3	remove special handling for reversed queries in getGroupedSearches getGroupedSearches is guaranteed not to be called with reversed structured queries, so there is no need to have special exclusion code.	2021-04-08 10:35:14 +02:00
Sarah Hoffmann	534de5ba81	remove reverseInPlan option from Geocode Disabling query reversal is no longer possible in the configuration, so there is no need to keep this as an option. Reversal is automatically disabled for structured search only.	2021-04-08 10:19:27 +02:00
Sarah Hoffmann	f498e40208	fix result splitting for last search group When we are in the final iteration of the search groups, it is not possible to further delay the results. Unconditionally use the results with the best rank instead.	2021-03-11 17:14:46 +01:00
Sarah Hoffmann	8eb85f1340	increase penalty for places without housenumber Results where the housenumber was dropped are an unlikely result when they refer to something other than a street. Therefore increase their result rank so that other matches are tried first before choosing them as a result. Improves #2167.	2021-02-16 17:47:06 +01:00
Sarah Hoffmann	db3ced17bb	rename lib to lib-php	2021-02-09 11:52:07 +01:00

27 Commits