Nominatim

mirror of https://github.com/osm-search/Nominatim.git synced 2024-12-26 14:36:23 +03:00

Author	SHA1	Message	Date
Sarah Hoffmann	fc995ea6b9	move database check for module to tokenizer	2021-04-30 17:41:08 +02:00
Sarah Hoffmann	be6262c6ce	move status test to tokenizer The availability of the module is now tested by the tokenizer.	2021-04-30 17:41:08 +02:00
Sarah Hoffmann	893490f94e	add more tests for legacy tokenizer	2021-04-30 17:41:08 +02:00
Sarah Hoffmann	044bb6afa5	move tokenization in query into tokenizer	2021-04-30 17:41:08 +02:00
Sarah Hoffmann	3eb4d88057	boilerplate for PHP code of tokenizer This adds an installation step for PHP code for the tokenizer. The PHP code is split in two parts. The updateable code is found in lib-php. The tokenizer installs an additional script in the project directory which then includes the code from lib-php and defines all settings that are static to the database. The website code then always includes the PHP from the project directory.	2021-04-30 11:31:52 +02:00
Sarah Hoffmann	23fd1d032a	tests for legacy tokenizer	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	7cb7cf848d	move amenity creation to tokenizer The BDD tests still use the old-style amenity creation scripts because we don't have simple means to import a hand-crafted test file of special phrases right now.	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	bef300305e	move default country name creation to tokenizer The new function is also used, when a country us updated. All SQL function related to country names have been removed.	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	ffc2d82b0e	move postcode normalization into tokenizer	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	fa2bc60468	introduce name analyzer The name analyzer is the actual work horse of the tokenizer. It is instantiated on a thread-base and provides all functions for analysing names and queries.	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	e1c5673ac3	require tokeinzer for indexer	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	9397bf54b8	introduce external processing in indexer Indexing is now split into three parts: first a preparation step that collects the necessary information from the database and returns it to Python. In a second step the data is transformed within Python as necessary and then returned to the database through the usual UPDATE which now not only sets the indexed_status but also other fields. The third step comprises the address computation which is still done inside the update trigger in the database. The second processing step doesn't do anything useful yet.	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	fbbdd31399	move word table and normalisation SQL into tokenizer Creating and populating the word table is now the responsibility of the tokenizer. The get_maxwordfreq() function has been replaced with a simple template parameter to the SQL during function installation. The number is taken from the parameter list in the database to ensure that it is not changed after installation.	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	296a66558f	move module installation to legacy tokenizer	2021-04-30 11:29:57 +02:00
Sarah Hoffmann	af968d4903	introduce tokenizer modules This adds the boilerplate for selecting configurable tokenizers. A tokenizer can be chosen at import time and will then install itself such that it is fixed for the given database import even when the software itself is updated. The legacy tokenizer implements Nominatim's traditional algorithms.	2021-04-30 11:29:57 +02:00
Sarah Hoffmann	185d369404	remove support for AUX housenumber tables These tables have never been actively maintained and the code is completely untested. With the upcomming changes, it is unlikely that the code remains usable. This removes the aux tables and all code that references them.	2021-04-30 10:08:29 +02:00
Sarah Hoffmann	46e8c6b112	Merge pull request #2291 from AntoJvlt/special-phrases-statistics Special phrases statistics	2021-04-27 11:57:05 +02:00
Sarah Hoffmann	1fd483643b	add tests for different scripts	2021-04-26 23:01:06 +02:00
AntoJvlt	1b68152fb2	reorganization of folder/file for the special phrases importer	2021-04-25 17:57:42 +02:00
Sarah Hoffmann	9685c68e30	replace usages of fromisoformat() with strptime() fromisoformat was only introduced with Python 3.7 while we still support Python 3.5. Fixes #2292.	2021-04-23 22:50:08 +02:00
Sarah Hoffmann	788baafa26	bdd tests: fix place dependen ranking tests The ranks of places may differ for some countries. Force the place nodes in the test on null island which always uses the default ranking.	2021-04-22 17:31:00 +02:00
Sarah Hoffmann	50b6d7298c	factor out async connection handling into separate class Also adds a test for reconnecting regularly while indexing.	2021-04-20 14:08:37 +02:00
Sarah Hoffmann	b88b952f56	simplify token precomputation Rename function to reflect that it is only used for precomputation. The token IDs are not really needed, so don't bother to compute the array of tokens.	2021-04-19 17:24:19 +02:00
Darkshredder	1f898405a6	Fix: tiger-data tarfile test	2021-04-19 16:02:52 +05:30
Sarah Hoffmann	79d55357e8	simplify sql and website creation functions	2021-04-19 10:53:30 +02:00
Sarah Hoffmann	4fa6c0ad53	simplify constructor for SQL preprocessor Use sql path from config.	2021-04-19 10:26:25 +02:00
Sarah Hoffmann	8f63f9516b	simplify interface for adding tiger data Also simplifies tests using existing fixtures.	2021-04-19 10:26:25 +02:00
AntoJvlt	b2ae715699	Only log a warning if a wrong input is detected on the wiki while importing special phrases	2021-04-17 20:19:39 +02:00
AntoJvlt	ec859e41c6	Cleaned tests and add database cleaning tests on test_import_from_wiki	2021-04-17 19:23:33 +02:00
Sarah Hoffmann	2ca11ccc6b	add tests for continuing import	2021-04-17 11:10:36 +02:00
Sarah Hoffmann	0f11e311c4	add test for new postcode import function	2021-04-16 16:11:20 +02:00
Sarah Hoffmann	c64193f839	Merge pull request #2263 from AntoJvlt/special-phrases-autoupdate Implemented auto update of special phrases while importing them	2021-04-15 10:13:25 +02:00
Darkshredder	49ee7505ed	Fix: Removed error if endstatement is wrong and improved tests	2021-04-13 15:44:12 +05:30
AntoJvlt	ae2b2cb9a5	Tests added for the auto update of special phrases during import	2021-04-12 14:35:29 +02:00
Sarah Hoffmann	16a66b5326	move transliteration of housenumbers into indexing Housenumbers are now saved in transliterated form in the housenumber column. This saves the transliteration step during lookup.	2021-04-04 15:26:47 +02:00
Sarah Hoffmann	3590e76a1c	tests for finding non-ascii housenumbers	2021-04-04 15:26:47 +02:00
Darkshredder	0f9df32d11	Added Test for TokenSpecialTerm	2021-04-02 04:49:05 +05:30
AntoJvlt	e82de99e5a	Cleaned tests of exceptions and fix phrase_settings.json test file name.	2021-03-29 22:07:29 +02:00
Sarah Hoffmann	09b2510219	Merge pull request #2228 from AntoJvlt/import-special-phrases-porting-python Import special phrases porting python	2021-03-29 09:49:35 +02:00
AntoJvlt	57ce75eb67	Change command 'import-special-phrases --from-wiki' to 'special-phrases --import-from-wiki'.	2021-03-26 02:22:38 +01:00
AntoJvlt	cde9389e75	Errors fixes, Cleaning code, Improvement and addition of tests	2021-03-26 01:53:33 +01:00
AntoJvlt	2c19bd5ea3	Encapsulation of tools/special_phrases.py into SpecialPhrasesImporter class and add new tests.	2021-03-25 21:13:57 +01:00
AntoJvlt	ff34198569	Code cleaning, tests simplification and use of python3-icu package	2021-03-23 23:56:39 +01:00
AntoJvlt	1ce8b530cd	Introduction of PyICU for transliteration in python. Reversed changes in normalization.sql.	2021-03-23 23:34:16 +01:00
AntoJvlt	9d1c23e4f5	Updated specialphrases_testdb.sql	2021-03-20 19:17:03 +01:00
AntoJvlt	17cb59efbd	Ported functions for the import of special phrases from php to python. - the command is now --import-special-phrases - the output is not an sql file anymore, data are directly imported to the database. - the little part on the documentation (section data import) has been modified.	2021-03-20 19:11:50 +01:00
Sarah Hoffmann	118befd7d7	bdd tests: make indexing less verbose Do not print progress info for indexing when there is an error in the BDD tests.	2021-03-20 10:39:29 +01:00
Sarah Hoffmann	0d9fe6e49c	Merge pull request #2219 from lonvia/bdd-test-remove-php BDD tests: run all setup via nominatim Python library	2021-03-17 11:40:34 +01:00
Sarah Hoffmann	ebae3553e0	bdd: run all setup via nominatim Python library Drops all calls to PHP utility functions. nominatim cli functions are used where possible, to stay as close to the final code as possible with the tests. By removing the PHP calls, the test code now only uses osm2pgsql and the database module from the build directory.	2021-03-16 22:20:41 +01:00
Sarah Hoffmann	4d7c5ec089	reverse: do not prefer interpolations over closer housenumbers Always look up the closest housenumber before looking up interpolations. This ensures that closer housenumbers are preferred over interpolations. Fixes #2214.	2021-03-15 10:50:04 +01:00

1 2 3 4 5 ...

430 Commits