it's useful to limit per process mem usage to prevent
oom killer because we can't save if we get killed.
overhaul diskpagecache to just use rdbcache. much simpler
and faster, but disabled for now until debugged more.
reduce min files to merge for crawlbot collections so
they stay more tightly merged to conserve fds and mem.
improved logDebugDisk msgs.
overhauled File.cpp fd pool. now it is way faster and
doesn't use any extra mem. much simpler too. although
could be sped up a little by using a linked list, but
probably is not significant enough to warrant doing right now.
increase mem ptr table from 3M to 8M slots. should really make
dynamic though. fix core from null msg20s[0]->m_r.
only call attemptMergeAll once every 60 seconds really.
do not attempt merge if already merging.
added url discovered time to gbssdocs so we know when
we first found a url. also added to new urls.csv.
fixed spiderdb list deduping so as not to discard
the oldest spider request any more so we keep our
discovered time in tact.
was causing data corruption in reads and writes.
go to urgent shutdown mode if on 10th try so gb
will actually exit. do not startup if there is
critical data corruption.