mwells
467e70bd98
improvements for thumbnail generator.
2014-05-11 08:44:38 -07:00
mwells
b49e4ab05f
fix core
2014-05-10 12:57:11 -07:00
Matt Wells
e70f760d87
us gbstatus: and gbstatusmsg: field operators
2014-05-09 18:10:38 -07:00
Matt Wells
b1cd0cac86
indexing spider replies now working.
...
use type:status to see them or
gbstatus:success or gbstatus:tcp or gbstatus:0.
2014-05-09 18:07:38 -07:00
Matt Wells
eb49094343
try to start indexing spider replies
...
as regular search results in the index so
you can query on those. get histograms of
spider status msgs, etc. ability to turn
that and images on/off.
2014-05-09 11:18:24 -07:00
Matt Wells
e21e0a404c
fixed bug for product title extraction.
...
titledb-saved.dat tree loop corruption bug.
no main coll bug.
put the ajax widget on spider status page so you can
see spider going in realtime. will give customers
a good idea of the spider moving along.
more widget fixes, to use new base64 thumbs, etc.
2014-04-28 13:30:24 -07:00
Matt Wells
de4a0a13a8
more thumbnail generation updates
2014-04-27 11:05:30 -07:00
mwells
ca0843aa8b
more bool query fixes.
2014-03-20 10:03:25 -07:00
mwells
cfbec626e8
more righteous fixes for bool queries
2014-03-19 13:51:32 -07:00
mwells
ab3368b5a0
more bool fixes. not operator support.
2014-03-19 09:38:45 -07:00
mwells
1bb91149d6
more bool fixes
2014-03-18 14:42:50 -07:00
mwells
652892dc10
more bool fixes
2014-03-18 14:37:59 -07:00
mwells
f392826b1e
nested bool query fixes
2014-03-18 14:08:59 -07:00
Matt Wells
3b97682cc3
more bool query fixes
2014-03-18 10:44:56 -07:00
mwells
cb483c42ea
more fixes for bool searching before
...
using a slightly different and simpler approach
2014-03-13 16:00:23 -07:00
mwells
7812f5c746
more bool fixes. still needs a little more work
2014-03-13 13:54:23 -07:00
mwells
3b2d981dff
more fixes for new boolean logic.
2014-03-13 13:09:33 -07:00
Matt Wells
fb0123ad53
nothing
2014-03-13 11:27:28 -07:00
Matt Wells
fbd1bcd349
initial attempt at new boolean query logic.
...
supports unlimited # of boolean query terms.
already docid phased from phasing logic already there
but could be phased more to save more mem and speed up
a little more.
2014-03-12 20:53:44 -07:00
Matt Wells
573a04bccd
fix bug in gbminint.
2014-02-06 21:36:47 -08:00
Matt Wells
2d4af1aefe
index numbers as integers too, not just floats
...
so we can sort by spider date without losing
128 seconds of resolution.
2014-02-06 20:57:54 -08:00
Matt Wells
f9dbd64056
get streaming time sliced results working
2014-02-06 14:25:44 -08:00
Matt Wells
d3b498a057
time slice checkpoint
2014-02-03 19:17:58 -08:00
Matt Wells
10f4443974
quite a few fixes to the quota system, cleanups etc.
2014-01-18 16:23:13 -08:00
Matt Wells
f9d0a02dbe
test and get gbparenturl: query working.
2014-01-18 09:28:58 -08:00
Matt Wells
fbcd6b8afd
display json objects that are not in arrays
...
in csv. show csv header. how to deal
with heterogenous object lists?
index spiderdate: for gbsortby:spiderdate.
added gbrevsortby: support.
2013-11-12 13:51:52 -08:00
Matt Wells
09f28b2f26
now we index all numbers that have field names
...
(so can't just be a number in the body) but it
can be in a meta tag or json item. then use
like gbsortby:products.offerPrice to sort the
search results (json objects) by that.
2013-11-08 16:16:13 -08:00
Matt Wells
8c9d5d824b
support for gbcontenthash:xxxxx for doing
...
exact match deduping. highest site rank
page wins, on ties, lowest docid wins.
2013-11-04 13:47:13 -08:00
Matt Wells
fe97e08281
move from groups to shards. got rid of annoying
...
groupid bit mask thing.
2013-10-04 16:18:56 -07:00
Matt Wells
c77453348f
Merge branch 'master' into diffbot
...
Conflicts:
SearchInput.cpp
XmlDoc.cpp
2013-09-18 09:23:48 -07:00
Matt Wells
834128a076
Fixed heap breaches caused by our bult-in
...
electric fence code from death queries.
Use HTTP/1.0 not 1.1 since we disabled keep-alive
support a long time ago.
2013-08-10 09:51:14 -07:00
mwells
651b899453
oops, wrong sign direction.
2013-08-09 22:14:13 -06:00
Matt Wells
9b94e0feac
fix core from huge death query.
2013-08-09 21:05:38 -07:00
Matt Wells
f6e560c1f4
Initial file population.
2013-08-02 13:12:24 -07:00