Nominatim

mirror of https://github.com/osm-search/Nominatim.git synced 2024-11-29 16:42:23 +03:00

Author	SHA1	Message	Date
Sarah Hoffmann	a2ee58d8a1	only run analyze on indexing when work was done This speeds up processing when continuing indexing after it was interrupted.	2022-09-28 10:22:54 +02:00
Kian-Meng Ang	f5e52e748f	docs: fix typos	2022-07-20 22:05:31 +08:00
Sarah Hoffmann	5617bffe2f	add type annotations for indexer	2022-07-18 09:47:57 +02:00
Sarah Hoffmann	cbb4749996	change indexing order for interpolations Interpolations are now indexed after rank 30 objects. The housenumber nodes no longer need information from the interpolations while the interpolations can make use of precomputed postcodes.	2022-06-02 15:16:46 +02:00
Sarah Hoffmann	c3788d765e	add consistent SPDX copyright headers	2022-01-03 16:23:58 +01:00
Sarah Hoffmann	c1fa70639b	add new replication mode catch-up This mode gets updates until the server reports no new diffs anymore. Also adds additional indexing, when the main indexing step left a couple of objects to process. This happens only when the next update is expected to be more than 40min away.	2021-10-20 22:05:15 +02:00
Sarah Hoffmann	cf98cff2a1	more formatting fixes Found by flake8.	2021-07-12 17:45:42 +02:00
Sarah Hoffmann	568316f07c	simplify analyse function	2021-07-12 14:47:50 +02:00
Sarah Hoffmann	b9a09129fa	move WorkerPool into db module The pool is independent of the indexer and may also be used by other parts of the software.	2021-05-13 17:11:17 +02:00
Sarah Hoffmann	20891abe1c	indexer: fetch extra place data asynchronously The indexer now fetches any extra data besides the place_id asynchronously while processing the places from the last batch. This also means that more places are now fetched at once.	2021-04-30 17:41:08 +02:00
Sarah Hoffmann	6ce6f62b8e	fetch place info asynchronously	2021-04-30 17:41:08 +02:00
Sarah Hoffmann	602728895e	indexer: fetch ids in batches	2021-04-30 17:41:08 +02:00
Sarah Hoffmann	ffc2d82b0e	move postcode normalization into tokenizer	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	d8ed1bfc60	move houseunumber handling to tokenizer Normalization and token computation are now done in the tokenizer. The tokenizer keeps a cache to the hundred most used house numbers to keep the numbers of calls to the database low.	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	d711f5a81e	move name token creation into tokenizer Name tokens are now handed in via token_info and used from there. Also moves the generic search name insertion function back to placex_triggers.sql.	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	fa2bc60468	introduce name analyzer The name analyzer is the actual work horse of the tokenizer. It is instantiated on a thread-base and provides all functions for analysing names and queries.	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	e1c5673ac3	require tokeinzer for indexer	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	9397bf54b8	introduce external processing in indexer Indexing is now split into three parts: first a preparation step that collects the necessary information from the database and returns it to Python. In a second step the data is transformed within Python as necessary and then returned to the database through the usual UPDATE which now not only sets the indexed_status but also other fields. The third step comprises the address computation which is still done inside the update trigger in the database. The second processing step doesn't do anything useful yet.	2021-04-30 11:30:51 +02:00
Sarah Hoffmann	f7e4aa51d3	indexer: reset query counter Reset the counter for queries after the asynchronous connections have been reopened.	2021-04-21 10:33:45 +02:00
Sarah Hoffmann	50b6d7298c	factor out async connection handling into separate class Also adds a test for reconnecting regularly while indexing.	2021-04-20 14:08:37 +02:00
Sarah Hoffmann	26a81654a8	indexer: make self.conn function-local Also switches to our internal connect function which gives us a cursor with a sclar() function.	2021-04-20 14:08:37 +02:00
Sarah Hoffmann	6430371d7d	make index() function private	2021-04-20 14:08:37 +02:00
Sarah Hoffmann	18705b3f18	move analyse function into indexinf function	2021-04-20 14:08:37 +02:00
Sarah Hoffmann	c6bd2bb7fb	indexer: move runner into separate file	2021-04-20 14:08:37 +02:00
Sarah Hoffmann	76b1885595	use absolute imports in Python code Relative imports are no longer officially recommended.	2021-04-16 14:20:09 +02:00
Sarah Hoffmann	dd301cf5ac	indexer: ANALYSE must be run outside transactions	2021-03-04 11:06:33 +01:00
Sarah Hoffmann	15b5906790	move setup function to python There are still back-calls to PHP for some of the sub-steps. These needs some larger refactoring to be moved to Python.	2021-02-26 15:02:39 +01:00
Sarah Hoffmann	3ee8d9fa75	properly close connections of indexer after use	2021-02-26 12:10:54 +01:00
Sarah Hoffmann	3c186f8030	add a function for the intial indexing run Also moves postcodes to fully parallel indexing.	2021-02-25 18:42:54 +01:00
Sarah Hoffmann	8c02786820	add tests for indexer	2021-01-20 21:30:27 +01:00
Sarah Hoffmann	504922ffbe	remove old nominatim.py in favour of 'nominatim index' The PHP scripts need to know the position of the nominatim tool in order to call it. This is handed in as environment variable, so it can be set by the Python script.	2021-01-18 15:43:27 +01:00
Sarah Hoffmann	c77877a934	implementaion of 'nominatim index'	2021-01-18 15:43:27 +01:00
Sarah Hoffmann	27977411e9	move indexing function into its own Python module This makes it mow a standard function of our new Python library instead of a stand-alone program.	2021-01-18 15:43:27 +01:00

33 Commits