Commit Graph

34 Commits

Author SHA1 Message Date
mwells
467e70bd98 improvements for thumbnail generator. 2014-05-11 08:44:38 -07:00
mwells
b49e4ab05f fix core 2014-05-10 12:57:11 -07:00
Matt Wells
e70f760d87 us gbstatus: and gbstatusmsg: field operators 2014-05-09 18:10:38 -07:00
Matt Wells
b1cd0cac86 indexing spider replies now working.
use type:status to see them or
gbstatus:success or gbstatus:tcp or gbstatus:0.
2014-05-09 18:07:38 -07:00
Matt Wells
eb49094343 try to start indexing spider replies
as regular search results in the index so
you can query on those. get histograms of
spider status msgs, etc. ability to turn
that and images on/off.
2014-05-09 11:18:24 -07:00
Matt Wells
e21e0a404c fixed bug for product title extraction.
titledb-saved.dat tree loop corruption bug.
no main coll bug.
put the ajax widget on spider status page so you can
see spider going in realtime. will give customers
a good idea of the spider moving along.
more widget fixes, to use new base64 thumbs, etc.
2014-04-28 13:30:24 -07:00
Matt Wells
de4a0a13a8 more thumbnail generation updates 2014-04-27 11:05:30 -07:00
mwells
ca0843aa8b more bool query fixes. 2014-03-20 10:03:25 -07:00
mwells
cfbec626e8 more righteous fixes for bool queries 2014-03-19 13:51:32 -07:00
mwells
ab3368b5a0 more bool fixes. not operator support. 2014-03-19 09:38:45 -07:00
mwells
1bb91149d6 more bool fixes 2014-03-18 14:42:50 -07:00
mwells
652892dc10 more bool fixes 2014-03-18 14:37:59 -07:00
mwells
f392826b1e nested bool query fixes 2014-03-18 14:08:59 -07:00
Matt Wells
3b97682cc3 more bool query fixes 2014-03-18 10:44:56 -07:00
mwells
cb483c42ea more fixes for bool searching before
using a slightly different and simpler approach
2014-03-13 16:00:23 -07:00
mwells
7812f5c746 more bool fixes. still needs a little more work 2014-03-13 13:54:23 -07:00
mwells
3b2d981dff more fixes for new boolean logic. 2014-03-13 13:09:33 -07:00
Matt Wells
fb0123ad53 nothing 2014-03-13 11:27:28 -07:00
Matt Wells
fbd1bcd349 initial attempt at new boolean query logic.
supports unlimited # of boolean query terms.
already docid phased from phasing logic already there
but could be phased more to save more mem and speed up
a little more.
2014-03-12 20:53:44 -07:00
Matt Wells
573a04bccd fix bug in gbminint. 2014-02-06 21:36:47 -08:00
Matt Wells
2d4af1aefe index numbers as integers too, not just floats
so we can sort by spider date without losing
128 seconds of resolution.
2014-02-06 20:57:54 -08:00
Matt Wells
f9dbd64056 get streaming time sliced results working 2014-02-06 14:25:44 -08:00
Matt Wells
d3b498a057 time slice checkpoint 2014-02-03 19:17:58 -08:00
Matt Wells
10f4443974 quite a few fixes to the quota system, cleanups etc. 2014-01-18 16:23:13 -08:00
Matt Wells
f9d0a02dbe test and get gbparenturl: query working. 2014-01-18 09:28:58 -08:00
Matt Wells
fbcd6b8afd display json objects that are not in arrays
in csv. show csv header. how to deal
with heterogenous object lists?
index spiderdate: for gbsortby:spiderdate.
added gbrevsortby: support.
2013-11-12 13:51:52 -08:00
Matt Wells
09f28b2f26 now we index all numbers that have field names
(so can't just be a number in the body) but it
can be in a meta tag or json item. then use
like gbsortby:products.offerPrice to sort the
search results (json objects) by that.
2013-11-08 16:16:13 -08:00
Matt Wells
8c9d5d824b support for gbcontenthash:xxxxx for doing
exact match deduping. highest site rank
page wins, on ties, lowest docid wins.
2013-11-04 13:47:13 -08:00
Matt Wells
fe97e08281 move from groups to shards. got rid of annoying
groupid bit mask thing.
2013-10-04 16:18:56 -07:00
Matt Wells
c77453348f Merge branch 'master' into diffbot
Conflicts:
	SearchInput.cpp
	XmlDoc.cpp
2013-09-18 09:23:48 -07:00
Matt Wells
834128a076 Fixed heap breaches caused by our bult-in
electric fence code from death queries.
Use HTTP/1.0 not 1.1 since we disabled keep-alive
support a long time ago.
2013-08-10 09:51:14 -07:00
mwells
651b899453 oops, wrong sign direction. 2013-08-09 22:14:13 -06:00
Matt Wells
9b94e0feac fix core from huge death query. 2013-08-09 21:05:38 -07:00
Matt Wells
f6e560c1f4 Initial file population. 2013-08-02 13:12:24 -07:00