Nominatim

mirror of https://github.com/osm-search/Nominatim.git synced 2024-11-23 05:35:13 +03:00

Author	SHA1	Message	Date
Sarah Hoffmann	80ea13437d	move postcode matcher in a separate file	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	4885fdf0f9	add class for online centroid computation	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	18864afa8a	postcodes: introduce a default pattern for countries without postcodes	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	9172696324	postcodes: add support for optional spaces	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	baee6f3de0	postcodes: strip leading country codes	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	28ab2f6048	add postcodes patterns without optional spaces	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	90d4d339db	initial postcode cleaner for simple patterns Moves postcodes that are either in countries without a postcode system or don't correspond to the local pattern for postcodes into a field for a normal address part. Makes them searchable but not as a special address. This has two consequences: they are no longer a skippable part of the address and the postcodes cannot be searched on their own.	2022-06-23 23:42:31 +02:00
Sarah Hoffmann	cbb4749996	change indexing order for interpolations Interpolations are now indexed after rank 30 objects. The housenumber nodes no longer need information from the interpolations while the interpolations can make use of precomputed postcodes.	2022-06-02 15:16:46 +02:00
Sarah Hoffmann	46689df668	custom comparison for SpecialPhrase Duplicate elemination only works when a custom hash/equal function is implemented that is based on the members.	2022-05-30 16:30:41 +02:00
Sarah Hoffmann	e828d0d3f7	move quoting hack to wiki loader The bad quotes around the type for special phrases specifically occure in the Wiki pages, so it should be removed by the loader and not in the generic SpecialPhrase object.	2022-05-30 14:40:33 +02:00
Sarah Hoffmann	cce0e5ea38	convert special phrase loaders to generators Generators simplify the code quite a bit compared to the previous Iterator approach.	2022-05-30 14:12:46 +02:00
Sarah Hoffmann	042e314589	remove the language parameter in the SPWikiLoader Languages must always be configured through config or environment. Also use monkeypatched environment in tests.	2022-05-30 10:26:20 +02:00
Sarah Hoffmann	61d813bfef	add get_str_list() for config Converts a config value written as a comma-sparated list into a Python list of strings.	2022-05-29 13:53:50 +02:00
Sarah Hoffmann	adeebec32a	switch tests to ICU tokenizer as default	2022-05-10 14:54:50 +02:00
Sarah Hoffmann	ed6fda6968	Merge pull request #2702 from lonvia/move-country-names-into-includes Clean up country name settings	2022-05-10 09:21:16 +02:00
Marc Tobias	821dabb138	add git commit hash to --version output	2022-05-09 23:56:13 +02:00
Sarah Hoffmann	9d468f6da0	support arbitrary prefixes in country name list This means we can now get rid of the last special cases for names.	2022-05-09 11:55:26 +02:00
Marc Tobias	0de83c4a51	fix typos of name Nominatim	2022-05-05 01:04:47 +02:00
Marc Tobias	a79ab41782	new nominatim --version CLI argument	2022-05-04 01:33:25 +02:00
Sarah Hoffmann	4f59644cc2	add tests for new data invalidation functions	2022-04-14 14:52:13 +02:00
Sarah Hoffmann	fd4ab3f262	Merge pull request #2629 from tareqpi/country-names-yaml-configuration Move default country names into yaml configuration	2022-04-04 09:04:25 +02:00
Tareq Al-Ahdal	e9f979b67b	'read_config' is no longer a fixture add 'read_config' to test cases that need it	2022-04-01 22:52:17 +08:00
Tareq Al-Ahdal	a323b8f63a	test for loading special characters from country_settings.yaml	2022-04-01 21:58:57 +08:00
Tareq Al-Ahdal	9411c14fd2	fix reset country info before loading custom data	2022-04-01 21:55:34 +08:00
Tareq Al-Ahdal	8525e7542f	custom country config loads correctly	2022-04-01 21:46:56 +08:00
Sarah Hoffmann	de18cd1523	add test for new table_has_column function	2022-03-31 15:55:20 +02:00
Tareq Al-Ahdal	b5f311d6bc	separate unit test function into three functions	2022-03-30 22:06:59 +08:00
Tareq Al-Ahdal	9db13aac72	Added unit tests for loading country info from yaml file	2022-03-25 22:22:44 +08:00
Sarah Hoffmann	a0ed80d821	restore the tokenizer directory when missing Automatically repopulate the tokenizer/ directory with the PHP stub and the postgresql module, when the directory is missing. This allows to switch working directories and in particular run the service from a different maschine then where it was installed. Users still need to make sure that .env files are set up correctly or they will shoot themselves in the foot. See #2515.	2022-03-20 11:31:42 +01:00
Sarah Hoffmann	0a9f971e44	add tests for new analyzed housenumbers	2022-03-01 09:34:32 +01:00
Sarah Hoffmann	837d44391c	move generation of normalized token form to analyzer This gives the analyzer more flexibility in choosing the normalized form. In particular, an analyzer creating different variants can choose the variant that will be used as the canonical form.	2022-03-01 09:34:32 +01:00
Sarah Hoffmann	a6b4e8ff67	add tests for housenumber-as-name feature	2022-02-07 11:45:12 +01:00
Sarah Hoffmann	38c3ef3da0	add tests for get_string_list() Renaming test file for sanitizer config because pytest requires unique names for test files.	2022-02-07 11:22:24 +01:00
Sarah Hoffmann	610f2cc254	sanitizer: move helpers into a configuration class	2022-02-07 10:48:00 +01:00
Sarah Hoffmann	c170d323d9	add tests for cleaning housenumbers	2022-01-20 23:47:20 +01:00
Sarah Hoffmann	d09db09849	adapt ICU tets to new housenumber sanitizer Restrict tests to making sure that handing in multiple housenumbers works.	2022-01-20 16:05:49 +01:00
Sarah Hoffmann	3741afa6dc	generalize filter-kind parameter for sanatizers Now behaves the same for tag_analyzer_by_language and clean_housenumbers. Adds tests.	2022-01-20 15:42:42 +01:00
Sarah Hoffmann	560a006892	add pytest config We are using custom marks now which need to be registered to avoid warnings.	2022-01-20 15:38:02 +01:00
Sarah Hoffmann	4774e45218	clean_housenumbers: make kinds and delimiters configurable Also adds unit tests for various options.	2022-01-20 12:07:12 +01:00
Sarah Hoffmann	b453b0ea95	introduce mutation variants to generic token analyser Mutations are regular-expression-based replacements that are applied after variants have been computed. They are meant to be used for variations on character level. Add spelling variations for German umlauts.	2022-01-18 11:09:21 +01:00
Sarah Hoffmann	c3788d765e	add consistent SPDX copyright headers	2022-01-03 16:23:58 +01:00
Sarah Hoffmann	7f7d2fd5b3	skip most addr: tags with suffixes Only one addr: tag can be processed currently, so make sure it is the one without suffixes to not get odd data. addr:street is the exception because it uses a different matching mechanism.	2021-12-06 14:55:10 +01:00
Sarah Hoffmann	44cfce1ca4	revert to using full names for street name matching Using partial names turned out to not work well because there are often similarly named streets next to each other. It also prevents us from being able to take into account all addr:street:* tags. This change gets all the full term tokens for the addr:street tags from the DB. As they are used for matching only, we can assume that the term must already be there or there will be no match. This avoid creating unused full name tags.	2021-12-06 11:38:38 +01:00
Sarah Hoffmann	5a9fb6eaf7	specify text type in test SQL Older version of postgres fail otherwise.	2021-12-03 13:56:23 +01:00
Sarah Hoffmann	54d35ddfe9	split cli tests by subcommand and extend coverage	2021-12-02 23:45:48 +01:00
Sarah Hoffmann	14a78f55cd	more unit tests for tokenizers	2021-12-02 15:46:36 +01:00
Sarah Hoffmann	7617a9316e	extend API unit tests	2021-12-01 20:48:29 +01:00
Sarah Hoffmann	a52ed366e4	add tests for migration	2021-12-01 20:27:40 +01:00
Sarah Hoffmann	7be164e2a5	more testing for refresh functions	2021-12-01 14:58:54 +01:00
Sarah Hoffmann	a24f25c0d8	more tests for exec utilities	2021-12-01 14:23:51 +01:00

1 2 3 4 5

235 Commits