Commit Graph

1199 Commits

Author SHA1 Message Date
Matt Wells
67202f3731 Merge branch 'diffbot' into diffbot-testing 2014-03-20 15:39:03 -07:00
Matt Wells
99bd9319fd temp hack to reduce network comm
between trinity and neo
2014-03-20 15:42:34 -07:00
Matt Wells
5ed19026d9 temp debug comments 2014-03-20 15:33:37 -07:00
Matt Wells
b8d0e95035 Merge branch 'diffbot' into diffbot-testing 2014-03-20 10:26:55 -07:00
mwells
ca0843aa8b more bool query fixes. 2014-03-20 10:03:25 -07:00
mwells
cfbec626e8 more righteous fixes for bool queries 2014-03-19 13:51:32 -07:00
mwells
ab3368b5a0 more bool fixes. not operator support. 2014-03-19 09:38:45 -07:00
mwells
1bb91149d6 more bool fixes 2014-03-18 14:42:50 -07:00
mwells
652892dc10 more bool fixes 2014-03-18 14:37:59 -07:00
mwells
f392826b1e nested bool query fixes 2014-03-18 14:08:59 -07:00
mwells
b7d80fd02d more bool query fixes 2014-03-18 13:41:36 -07:00
mwells
b31eaee9fd simple bool queries work 2014-03-18 12:07:29 -07:00
Matt Wells
d4302e3301 fix core 2014-03-18 11:12:50 -07:00
Matt Wells
3b97682cc3 more bool query fixes 2014-03-18 10:44:56 -07:00
Matt Wells
6e23d37e47 Merge branch 'diffbot' into diffbot-testing 2014-03-17 17:27:28 -07:00
mwells
54cc8088fb more bool query fixes. hopefully this will do it,
but still can do some optimizations for speed.
2014-03-17 17:00:08 -07:00
Matt Wells
9d3c35ad17 nothing 2014-03-17 13:53:19 -07:00
Matt Wells
acd05aa740 fix a few minor bugs.
/master/->/admin/ and crawl type mismatch.
2014-03-16 10:34:58 -07:00
Matt Wells
edbd61b0c5 thread fixes. if pthread_create fails then
keep thread queue and just return. will try to
relaunch later. do not count delete keys towards
shard rebalance count.
2014-03-15 20:07:02 -07:00
Matt Wells
5ca411e3e2 tuning the rebalance loop 2014-03-15 14:56:11 -07:00
Matt Wells
86147fe22c tight merge during rebalance to save
disk space, so neg recs annihilate pos recs.
2014-03-14 23:37:30 -07:00
Matt Wells
6c704f6fdf Merge branch 'diffbot' of github.com:gigablast/open-source-search-engine into diffbot 2014-03-14 22:16:40 -07:00
Matt Wells
e37eebd76f when rebalancing wait for merge to complete before scanning
more
2014-03-14 22:16:25 -07:00
Matt Wells
82ac3fab6c merge fixes 2014-03-14 22:15:08 -07:00
Matt Wells
df46a6fc1d Merge branch 'diffbot' of github.com:gigablast/open-source-search-engine into diffbot-matt 2014-03-14 19:32:10 -07:00
Matt Wells
1f162ce7b2 update localhosts.conf too 2014-03-14 19:20:23 -07:00
Matt Wells
553aefdb55 keep files tightly merged when doing rebalanced
to avoid running out of disk space
2014-03-14 19:19:41 -07:00
mwells
cb483c42ea more fixes for bool searching before
using a slightly different and simpler approach
2014-03-13 16:00:23 -07:00
mwells
7812f5c746 more bool fixes. still needs a little more work 2014-03-13 13:54:23 -07:00
mwells
3b2d981dff more fixes for new boolean logic. 2014-03-13 13:09:33 -07:00
Matt Wells
fb0123ad53 nothing 2014-03-13 11:27:28 -07:00
Matt Wells
9acb7ef0f4 fix core &token= core 2014-03-13 07:57:06 -07:00
Matt Wells
018258bcaa Merge branch 'diffbot-testing' of github.com:gigablast/open-source-search-engine into diffbot-testing 2014-03-12 20:55:21 -07:00
Matt Wells
fbd1bcd349 initial attempt at new boolean query logic.
supports unlimited # of boolean query terms.
already docid phased from phasing logic already there
but could be phased more to save more mem and speed up
a little more.
2014-03-12 20:53:44 -07:00
Matt Wells
3e7243c6ce fix add url core 2014-03-12 08:28:42 -07:00
Matt Wells
312438a32b Merge branch 'diffbot-dan' into diffbot-testing 2014-03-11 17:02:59 -07:00
Matt Wells
84784d8d76 minor fixups 2014-03-11 17:02:24 -07:00
Daniel Steinberg
2331b4673d Defect #2099: throw an error a crawl request was made with a name that already existed for bulk request (or the other way around) 2014-03-11 16:21:58 -07:00
Matt Wells
8445e53c61 fix query reindex some more 2014-03-11 14:46:49 -07:00
Matt Wells
c4b38a5c72 fix a few cores from previous code updates 2014-03-11 09:36:33 -07:00
Matt Wells
5c2e78e5fa Merge branch 'diffbot' into diffbot-testing 2014-03-10 20:26:30 -07:00
Matt Wells
483f3c5bae fix core 2014-03-10 18:17:28 -07:00
Daniel Steinberg
f9fdc96563 no use in newline separating the list of urls if they're going to be read back in and need to be space separated 2014-03-10 15:22:43 -07:00
Daniel Steinberg
e293d465a3 snprintf instead of sprintf 2014-03-10 14:03:28 -07:00
Daniel Steinberg
41e3988fbc not a conf file 2014-03-10 13:57:13 -07:00
Daniel Steinberg
4a7bf5d4d0 Story #2040: store raw URL submissions for customer bulk jobs 2014-03-10 13:50:30 -07:00
Matt Wells
bfcb7082f4 fix bug from nuking doledb on a new collection. 2014-03-10 13:48:00 -07:00
Matt Wells
bd4484db3c Merge branch 'testing' into diffbot-testing 2014-03-10 12:08:23 -07:00
Matt Wells
662b6d4b32 doc updates 2014-03-09 20:43:49 -07:00
Matt Wells
90ff2c2a25 update example site lists 2014-03-09 20:35:45 -07:00