Nominatim

mirror of https://github.com/osm-search/Nominatim.git synced 2024-11-23 05:35:13 +03:00

Author	SHA1	Message	Date
Sarah Hoffmann	17da5f45be	fix return code for PHP exceptions These have returned a 0 until now.	2022-03-16 21:44:02 +01:00
Sarah Hoffmann	ef98a85b05	correctly handle single-point interpolations in reverse Lookup in location_property_osmline needs to be special cased for startnumber = endnumber. Also adds tests for the case. Fixes #2680.	2022-03-16 11:19:09 +01:00
Sarah Hoffmann	a6903651fc	add framework for analysing housenumbers This lays the groundwork for adding variants for housenumbers. When analysis is enabled, then the 'word' field in the word table is used as usual, so that variants can be created. There will be only one analyser allowed which must have the fixed name '@housenumber'.	2022-03-01 09:34:32 +01:00
Sarah Hoffmann	fd38dd02ce	make sure step is taken into account for interpolations	2022-02-09 21:42:28 +01:00
Sarah Hoffmann	474418f03c	include houseumber search in name query The name query already looks for the existence of housenumbers and may as well retrive them. Saves up to threee additional lookups. It also means that we can lift the restriction on looking for existance of housenumbers for simple queries only.	2022-02-08 22:35:12 +01:00
Sarah Hoffmann	64abc90d30	use new tiger step column for queries	2022-01-27 14:08:08 +01:00
Sarah Hoffmann	6b89624f33	adapt frontend to new interpolation table layout	2022-01-27 11:14:55 +01:00
Sarah Hoffmann	f9889f81d6	swap order of query interpretation A forward interpretation of the form 'street, city, country' is much more frequent than the reverse form 'country, city, street'. Thus swap the order of interpretations that the forward order comes first.	2022-01-05 15:21:14 +01:00
Sarah Hoffmann	c3788d765e	add consistent SPDX copyright headers	2022-01-03 16:23:58 +01:00
Sarah Hoffmann	042df4198a	disable JIT and parallel workers on search frontend Bad query planning now also interferes with queries for search and reverse.	2021-12-22 10:47:54 +01:00
Sarah Hoffmann	345637290b	take Tiger housenumbers into account when ranking street results Queries with a housenumber need to rank streets higher that have the requested housenumber attached. We already do that for ordinary housenumber objects and for interpolations. This adds support for Tiger housenumbers as well. Fixes #2501.	2021-11-24 11:10:20 +01:00
Sarah Hoffmann	3a2597e5c4	don't penalize French 'bis' housenumbers House numbers of the form '9 bis' are usual in France. So be a bit more lenient before adding penalties to house numbers with letters in them. Fixes #2527.	2021-11-19 21:12:17 +01:00
Sarah Hoffmann	c79dcfad9a	make sure housenumbers are properly quoted	2021-11-10 20:44:28 +01:00
Sarah Hoffmann	e2d2571ad0	fix combination of NeedsAddress flag When dealing with multiple partial terms, only keep the flag, when all partial terms are so frequent as to need an address. Fixes #2510.	2021-11-05 22:18:37 +01:00
Sarah Hoffmann	2275fe59ab	include unlisted places in ordering by housenumber When ordering results by the fact that they have a housenumber, also take cases into account where the housenumber is on the place itself. This may happen when the search includes the name of the place and the housenumber or for addr:place addresses where the place is unlisted.	2021-10-28 11:27:31 +02:00
Sarah Hoffmann	1cf14a8e94	searches for house numbers must have an address	2021-10-26 12:00:13 +02:00
Sarah Hoffmann	4864bf1509	disallow search for partials without address Very frequent partial terms take too long to look up and do not return any valuable results unless the search is further narrowed down by an address.	2021-10-26 12:00:13 +02:00
Sarah Hoffmann	8e439d3dd9	Merge pull request #2484 from lonvia/fix-index-use Reverse: add index hints	2021-10-25 17:20:42 +02:00
Sarah Hoffmann	7bd9094aaa	reverse: add index hints The fairly complex where condition of idx_placex_geometry_placenode won't always be matched by the query planner if the condition part doesn't appear verbatim in the query. Fixes #2480.	2021-10-25 15:01:03 +02:00
Sarah Hoffmann	16cc395f78	fix warming for ICU tokenizer Running the warm-up search requests requires querying the most frequent words. This must be done via the tokenizer to honor the different formats of the word table.	2021-10-25 13:08:16 +02:00
Sarah Hoffmann	c77df2d1eb	replace NOMINATIM_PHRASE_CONFIG with command line option	2021-10-22 14:41:14 +02:00
Sarah Hoffmann	9ff98073db	remove outdated country_languages.php	2021-10-10 21:58:43 +02:00
Sarah Hoffmann	d562f11298	slightly increase radius to look for postcodes	2021-09-24 23:56:42 +02:00
Sarah Hoffmann	98c2e08add	reduce penalty for special searches by name Additional penalty for special terms with operator None should only go to near searches. To reduce the number of produced searches, restrict the none operator to appear only in conjunction with the name.	2021-09-03 08:50:38 +02:00
Sarah Hoffmann	94d3dee369	further increase penalty on housenumbers without numbers Make the penality dependent on the length of the token: no penalty for one letter house numbers and increasing one for more letters.	2021-09-02 18:11:49 +02:00
Sarah Hoffmann	118858a55e	rename legacy_icu tokenizer to icu tokenizer The new icu tokenizer is now no longer compatible with the old legacy tokenizer in terms of data structures. Therefore there is also no longer a need to refer to the legacy tokenizer in the name.	2021-08-17 23:11:47 +02:00
Sarah Hoffmann	f00b8dd1c3	move special hack for US states to legacy tokenizer The hack for IL, AL and LA is only needed because these abbreviations are removed by the legacy tokenizer as a stop word. There is no need to keep the hack for future tokenizers. Move it therefore to the token extraction function.	2021-08-17 14:28:55 +02:00
Sarah Hoffmann	1147b83b22	php: make word list a first-class object This separates the logic of creating word sets from the Phrase class. A tokenizer may now derived the word sets any way they like. The SimpleWordList class provides a standard implementation for splitting phrases on spaces.	2021-08-16 11:51:49 +02:00
Sarah Hoffmann	0fb8eade13	remove country restriction from tokenizer Restricting tokens due to the search context is better done in the generic search part instead of repeating the same test in every tokenizer implementation.	2021-08-16 11:41:54 +02:00
Sarah Hoffmann	23e3724abb	ignore words without id for status	2021-08-15 21:59:36 +02:00
Sarah Hoffmann	fdff579188	php: force use of global Exception class	2021-07-28 11:31:47 +02:00
Sarah Hoffmann	001b2aa9f9	fix linitin issues in PHP	2021-07-28 11:31:47 +02:00
Sarah Hoffmann	1db098c05d	reinstate word column in icu word table Postgresql is very bad at creating statistics for jsonb columns. The result is that the query planer tends to use JIT for queries with a where over 'info' even when there is an index.	2021-07-28 11:31:47 +02:00
Sarah Hoffmann	324b1b5575	bdd tests: do not query word table directly The BDD tests cannot make assumptions about the structure of the word table anymore because it depends on the tokenizer. Use more abstract descriptions instead that ask for specific kinds of tokens.	2021-07-28 11:31:47 +02:00
Sarah Hoffmann	6ad35aca4a	adapt special terms lookup to new word table	2021-07-28 11:31:47 +02:00
Sarah Hoffmann	70f154be8b	switch word tokens to new word table layout	2021-07-28 11:31:47 +02:00
Sarah Hoffmann	4342b28882	switch special phrases to new word table format	2021-07-28 11:31:47 +02:00
Sarah Hoffmann	5394b1fa1b	switch postcode tokens to new word table layout	2021-07-28 11:31:47 +02:00
Sarah Hoffmann	5ab0a63fd6	switch housenumber tokens to new word table layout	2021-07-28 11:31:47 +02:00
Sarah Hoffmann	1618aba5f2	switch country name tokens to new word table layout	2021-07-28 11:31:47 +02:00
Sarah Hoffmann	1bd068d42d	remove unused update script	2021-07-26 10:41:37 +02:00
Sarah Hoffmann	8096a1d67f	fix parameters for TokenWord creation	2021-07-20 10:21:40 +02:00
Sarah Hoffmann	cca912af4e	make all Token menbers private	2021-07-18 22:54:55 +02:00
Sarah Hoffmann	86ea077092	merge marking rare name with adding name token Only name tokens can be rare, so this should be the same function.	2021-07-18 16:52:37 +02:00
Sarah Hoffmann	5d6aabc457	add documentation for public interface of SearchDescription	2021-07-18 16:10:42 +02:00
Sarah Hoffmann	b14ce959d9	factor out check if a token fits current search Saves allocating an empty array.	2021-07-17 22:01:35 +02:00
Sarah Hoffmann	a48ebd9b47	move SearchDescription building into tokens Moving the logic for extending the SearchDescription into the token classes splits up the code and makes it more readable. More importantly: it allows tokenizer to define custom token classes in the future.	2021-07-17 20:24:33 +02:00
Sarah Hoffmann	3cd85eaaf1	remove Token from explicit input for SearchDescription extension The token string is only required by the PartialToken type, so it can simply save the token string internally. No need to pass it to every type. Also moves the check for multi-word partials to the token loader code in the tokenizer. Multi-word partials can only happen with the legacy tokenizer and when the database was loaded with an older version of Nominatim. No need to keep the check for everybody.	2021-07-17 18:18:31 +02:00
Sarah Hoffmann	ec3f6c9c42	factor out query position Moves token and phrase position and phrase type into a separate class that is handed in when assembling the search description. This drastically reduces the number of parameters for the function to extend the search descriptions and gives us more flexibility in the future for more complex positional analysis.	2021-07-15 14:12:59 +02:00
Sarah Hoffmann	143ff14466	remove special status of partial tokens Full-word tokens are no longer marked by a space at the beginning of the token. Use the new Partial token category instead. This removes a couple of special casing, we don't really need. The word table still has the space for compatibility reasons, so the tokenizer code needs to get rid of it when loading the tokens.	2021-07-14 22:17:17 +02:00
Sarah Hoffmann	6070c3d1d5	introduce a separate token type for partials This means that the leading space can be removed as a partial word indicator.	2021-07-13 16:57:12 +02:00
Sarah Hoffmann	bc5e15996a	convert single case switch to if statement	2021-07-12 11:28:47 +02:00
Sarah Hoffmann	128ca800cd	avoid local variable assignment	2021-07-11 23:22:53 +02:00
Sarah Hoffmann	000d133af6	fix more missing braces on one-liners	2021-07-11 23:22:53 +02:00
Sarah Hoffmann	1e40d65aa9	remove dead code	2021-07-11 23:22:53 +02:00
Sarah Hoffmann	bffbe68ec3	do not intermix params with and without default	2021-07-11 23:22:53 +02:00
Sarah Hoffmann	58b10074ad	directly return data in function The temporary variable is not necessary.	2021-07-11 19:24:04 +02:00
Sarah Hoffmann	d933ead2b5	remove unnecessayly nested ifs Found by Sonarqube.	2021-07-11 19:11:37 +02:00
Sarah Hoffmann	1cdc30c5e8	remove unused functions The functions were necessary for the transitory code to Python and are no longer used.	2021-07-11 19:10:04 +02:00
Sarah Hoffmann	27af9b102c	always use brackets on if statements This adds bracket around all one-line if statements that did not have them yet.	2021-07-10 17:04:46 +02:00
Sarah Hoffmann	500c61685b	remove unused variables As reported by sonarqube.	2021-07-09 16:36:42 +02:00
Sarah Hoffmann	106d960f84	fix bad use of echo in PHP output	2021-07-09 12:50:35 +02:00
Sarah Hoffmann	8413075249	move abbreviation computation into import phase This adds precomputation of abbreviated terms for names and removes abbreviation of terms in the query. Basic import works but still needs some thorough testing as well as speed improvements during import. New dependency for python library datrie.	2021-07-04 10:28:20 +02:00
Sarah Hoffmann	63755c31ff	remove penalty for full words in address Now that mutli-word partials no longer exist, multi-word full words need to be used to search in addresses and therefore no longer should have a penalty. Also changes the condition when a full word is included into the address. It is no longer relevant if an equivalent partial exists but only if the term consists of more than one word.	2021-06-26 11:37:15 +02:00
Sarah Hoffmann	161f5f5cee	adjust penalty for housenumber-in-name searches When searching for house numbers in the name (for place-only terms) then the same penalties need to apply as for the regular house number search. Change the code to first compute the penalties and then create the new search variants.	2021-06-26 11:37:15 +02:00
Sarah Hoffmann	fe11d3cbbd	do not return POIs when dropping house number in query We've previously added searching through rank 30 in a house number search to enable searches for house number+name. This had the unintended side effect that rank 30 objects are also returned in s search that dropped the house number from the query. This is wrong because POIs cannot function as a parent to a house number. This fix drops all rank 30 objects from the results for a house number search if they do not match the requested house number.	2021-06-17 14:21:20 +02:00
Sarah Hoffmann	7383f05e45	remove deprecated query interface Searches can now be done via the thin API wrapper.	2021-06-06 15:28:21 +02:00
Sarah Hoffmann	02f6afa51b	always ignore multi term partials in search Partial terms should only ever consist of one word. Ignore any other, they are a leftover from inefficient word index builts.	2021-05-23 22:13:03 +02:00
mogita	507543a482	fix: add the missing question mark	2021-05-19 13:35:15 +08:00
Sarah Hoffmann	fef1bbb1a7	always use object type for details keywords When name and address is empty, the keywords field in the response of the details API would be an array because that is what PHP's json_encode defaults to with empty array(). This default can only be changed globally per json_encode call and that might cause unintended colleteral damage. Work around the issue by making name and address an empty array instead of keywords. Fixes #2329.	2021-05-17 16:36:32 +02:00
Darkshredder	e5ffc59cd5	feat: Added reverse-only-search validation	2021-05-14 02:36:21 +05:30
Frederik Ramm	fe39185894	Add array_key_last function for PHP <7.3 This patch adds an array_key_last function if it doesn't yet exist, fixes #2316. It is tested on PHP 7.2.24 but not PHP 7.3.	2021-05-13 16:42:22 +02:00
marc tobias	38f9e18afb	typelabel value is already lowercased	2021-05-12 19:16:51 +02:00
Sarah Hoffmann	40cb17d299	Merge pull request #2314 from lonvia/fix-status-no-import-date Correctly catch the exception when import date is missing	2021-05-06 17:41:53 +02:00
Sarah Hoffmann	d8ead78e03	correctly catch the exception when import date is missing	2021-05-06 16:27:42 +02:00
Sarah Hoffmann	ba8ed7967d	add PHP part for new ICU-base tokenizer	2021-05-05 10:15:27 +02:00
Sarah Hoffmann	be6262c6ce	move status test to tokenizer The availability of the module is now tested by the tokenizer.	2021-04-30 17:41:08 +02:00
Sarah Hoffmann	044bb6afa5	move tokenization in query into tokenizer	2021-04-30 17:41:08 +02:00
Sarah Hoffmann	3eb4d88057	boilerplate for PHP code of tokenizer This adds an installation step for PHP code for the tokenizer. The PHP code is split in two parts. The updateable code is found in lib-php. The tokenizer installs an additional script in the project directory which then includes the code from lib-php and defines all settings that are static to the database. The website code then always includes the PHP from the project directory.	2021-04-30 11:31:52 +02:00
Sarah Hoffmann	185d369404	remove support for AUX housenumber tables These tables have never been actively maintained and the code is completely untested. With the upcomming changes, it is unlikely that the code remains usable. This removes the aux tables and all code that references them.	2021-04-30 10:08:29 +02:00
Sarah Hoffmann	b7e5c54593	remove PHP code for transition functions	2021-04-16 17:28:51 +02:00
Sarah Hoffmann	1db468b6c3	remove special handling for reversed queries in getGroupedSearches getGroupedSearches is guaranteed not to be called with reversed structured queries, so there is no need to have special exclusion code.	2021-04-08 10:35:14 +02:00
Sarah Hoffmann	534de5ba81	remove reverseInPlan option from Geocode Disabling query reversal is no longer possible in the configuration, so there is no need to keep this as an option. Reversal is automatically disabled for structured search only.	2021-04-08 10:19:27 +02:00
Sarah Hoffmann	16a66b5326	move transliteration of housenumbers into indexing Housenumbers are now saved in transliterated form in the housenumber column. This saves the transliteration step during lookup.	2021-04-04 15:26:47 +02:00
Darkshredder	0b154a2a1a	Added HTTP_HOST to if statement	2021-03-30 03:02:55 +05:30
Darkshredder	27b379c1e3	fixed: XML format: more_url points to localhost, not base URL	2021-03-30 01:02:43 +05:30
Sarah Hoffmann	e05dee6df5	allow sorting by housenumbers for rare street names Usually we don't narrow down search results by house number when only a street name is given because there may be a lot of rows to cross check when the street name is very frequent. However, when it is known to be rare, the housenumber check may be done anyway. Fixes #2238.	2021-03-29 12:06:51 +02:00
AntoJvlt	57ce75eb67	Change command 'import-special-phrases --from-wiki' to 'special-phrases --import-from-wiki'.	2021-03-26 02:22:38 +01:00
AntoJvlt	cde9389e75	Errors fixes, Cleaning code, Improvement and addition of tests	2021-03-26 01:53:33 +01:00
AntoJvlt	ff34198569	Code cleaning, tests simplification and use of python3-icu package	2021-03-23 23:56:39 +01:00
AntoJvlt	1ce8b530cd	Introduction of PyICU for transliteration in python. Reversed changes in normalization.sql.	2021-03-23 23:34:16 +01:00
AntoJvlt	2fb6018078	Added wrapper in specialphrases.php to call corresponding nominatim command.	2021-03-23 23:30:42 +01:00
AntoJvlt	6d56cbb3e8	Changed phrase_settings.py to phrase-settings.json and added migration function for old php settings file.	2021-03-23 23:30:39 +01:00
AntoJvlt	1a93319093	Changed phrase_settings.py to phrase-settings.json and added migration function for old php settings file.	2021-03-23 23:27:56 +01:00
AntoJvlt	d5acade4db	Deleted specialphrases.php and phrase_settings.php	2021-03-20 19:48:05 +01:00
Sarah Hoffmann	4d7c5ec089	reverse: do not prefer interpolations over closer housenumbers Always look up the closest housenumber before looking up interpolations. This ensures that closer housenumbers are preferred over interpolations. Fixes #2214.	2021-03-15 10:50:04 +01:00
Sarah Hoffmann	81a6b746b8	Merge pull request #2212 from darkshredder/country-name Ported createCountryNames() to python and Added tests	2021-03-15 09:36:06 +01:00
Darkshredder	f356a75a24	Add setup.php	2021-03-14 15:02:30 +05:30
Sarah Hoffmann	6cabc44841	Merge pull request #2213 from lonvia/tweak-search-weights Some more tweaking of the ranking of search interpretations	2021-03-12 15:47:36 +01:00
Darkshredder	7a874d5b97	Ported createCountryNames() to python and added tests	2021-03-12 10:28:41 +05:30

1 2 3 4

188 Commits