mwells
4e485b6649
increase dolebuf cache time from 2 to 5 mins
...
for better performance. cache empty dolebufs
if winner tree list was not from cache, so
in case we have a huge spiderdb scan list
of urls we aren't spidering we can cache it,
like twitter.com e.g. do not call strstr
in getUrlFilterNum2() for .css? or /print/
since it was taking way too much cpu time.
2015-02-21 15:17:28 -07:00
Matt
4c19453ea9
working with -m32 for basic testing.
...
compiles for 64-bit.
2014-11-12 11:38:37 -08:00
Matt
96b8197ad3
now it compiles with -m32
2014-11-10 14:45:11 -08:00
Matt Wells
e7dd8f7956
replace long long with int64_t
2014-10-30 13:36:39 -06:00
mwells
74385939c6
put dropped requests in bold red
2014-09-04 11:01:49 -07:00
Matt Wells
b265e8d027
change bad master link to admin link
2014-07-22 10:42:30 -07:00
Matt Wells
acd05aa740
fix a few minor bugs.
...
/master/->/admin/ and crawl type mismatch.
2014-03-16 10:34:58 -07:00
Matt Wells
7cd746f567
fix msge0 msg0 overload in sockets table
...
when all diffbot replies timed out at once
at released thousands of spiders.
2014-01-22 20:34:55 -08:00
Matt Wells
8a9b1f7a19
added diffbot retry rules.
...
added maxTotalSpiders parm for
all colls to follow.
tried to fix msg 0x00 socket jam up.
2014-01-22 19:57:38 -08:00
Matt Wells
33c5d9c07f
a lot of times rdb tree has invalid collection
...
numbers in it so fix our counting algo in case
the collection rec no longer exists!
2014-01-21 19:01:44 -08:00
Matt Wells
089d7f34a0
more spiderdb spider request fixes
2014-01-19 18:00:56 -08:00
Matt Wells
b755b4d581
formatting fixes
2014-01-19 00:57:20 -08:00
Matt Wells
fe3a879758
formatting changes
2014-01-19 00:38:02 -08:00
Matt Wells
f6e560c1f4
Initial file population.
2013-08-02 13:12:24 -07:00