Commit Graph

1523 Commits

Author SHA1 Message Date
Matt Wells
8477ef72f8 support gbmin gbmax gbminint gbmaxint range query
terms properly, when generating the docidvotebuf.
fixes boolean queries using them as well.
2014-06-05 14:47:45 -07:00
Matt Wells
780fd43aae timestamp bug fix 2014-06-04 15:50:26 -07:00
Matt Wells
546d135007 fix boolean queries to do the on-demand
mini merges of the termlists. should fix
gbmin:offerprice:100 AND (text:lord OR text:helicopter)
2014-06-04 14:33:54 -07:00
Matt Wells
d98cf4b2b0 try to prevent slamming diffbot backend
with bulk jobs consisting of hundreds of
different domains/ips.
2014-06-04 12:37:49 -07:00
Matt Wells
4298e4e752 sanity checks for debugging duplicate
titledb file bug.
2014-06-04 12:15:12 -07:00
Matt Wells
b7d9002a05 fix log bug 2014-06-04 10:57:25 -07:00
Matt Wells
8b74bd855b Merge branch 'master' into diffbot-testing 2014-06-04 09:37:55 -07:00
Matt Wells
fcc8bc85cc update bulk job restart 2014-06-04 09:36:26 -07:00
Matt Wells
e2ca303fe2 doc updates 2014-06-04 07:38:40 -07:00
mwells
a734240474 minor date change in documentation. 2014-06-04 07:26:46 -07:00
mwells
beba94013e remove clustermaintenance documentation. seemed pretty
obsolete.
2014-06-04 07:26:10 -07:00
mwells
3fd973a53e documentation updates for scaling the cluster 2014-06-04 07:17:34 -07:00
Matt Wells
ec1b66aff5 Merge branch 'master' into diffbot-testing 2014-06-03 20:50:59 -07:00
Matt Wells
585e6a357f parm documentation update for url filters 2014-06-03 20:50:22 -07:00
Matt Wells
b534ac5812 do not print completed time if spidering is going on 2014-06-03 20:30:10 -07:00
Matt Wells
694b19e053 Merge branch 'diffbot-testing' of github.com:gigablast/open-source-search-engine into diffbot-testing 2014-06-03 18:07:51 -07:00
Matt Wells
07cf2f1129 fix core 2014-06-03 18:07:35 -07:00
Matt Wells
50468293e7 fix bool expressions with only one operand.
i.e. double parens bug.
2014-06-03 14:46:28 -07:00
Matt Wells
bf70823260 take out <moreResultsFollow> for &stream=1
for now. maybe add back in later but would be
at end of the reply.
2014-06-03 14:09:24 -07:00
mwells
d23032241d fix mem leak when downloading images is turned on. 2014-06-03 13:26:56 -07:00
Matt Wells
da677eb8a4 fix for searching for query pipe operator in quotes. 2014-06-03 13:08:35 -07:00
mwells
ba2329808b fix siteListIsEmpty bug causing spider to
spider the whole internet when it shouldn't
2014-06-03 11:37:31 -07:00
Matt Wells
c3a823c99d fix relative url bug when relative url starts with ? 2014-06-03 10:54:50 -07:00
Matt Wells
536b43e19f Merge branch 'master' into diffbot-testing 2014-06-03 10:17:00 -07:00
mwells
51bb653bb3 fix stack smash core. 2014-06-01 10:42:49 -07:00
mwells
918f43f80e still searched for stripped words even if
has a synset. fixed query lang
detector in SearchInput.cpp.
2014-06-01 10:18:24 -07:00
Matt Wells
d15f5d3ce7 when user searches for a word without the
accent marks, we now also search for the same
word but with the proper accent marks.
2014-06-01 09:37:00 -07:00
Matt Wells
6f704d3d6a fix wiktionary-based generation code so we can map
a word with accents stripped to the
word with the accents in place.
2014-06-01 06:33:16 -07:00
mwells
f16414b774 fix stripAccentMarks() to use libiconv stuff
so all languages are now supported.
2014-05-31 08:14:39 -07:00
mwells
5f16013a9e add support for stripping accent marks from greek letters. 2014-05-30 20:09:37 -07:00
Matt Wells
509ae2fed8 remove limitations on # of search results requested.
we are more of a back-end service so that can be
handled by a middle or front layer.
2014-05-29 21:32:24 -07:00
Matt Wells
132aabf589 Merge branch 'diffbot-dan' into diffbot-testing 2014-05-29 10:28:38 -07:00
Daniel Steinberg
79b2d4859b printCrawlDetailsInJson signature without version 2014-05-28 10:41:32 -07:00
Daniel Steinberg
1fae88b739 check version less than 99 2014-05-28 10:30:26 -07:00
Matt Wells
a970c12f65 Merge branch 'diffbot' into diffbot-testing 2014-05-28 09:59:39 -07:00
Matt Wells
bc5b126f2a Merge branch 'diffbot' 2014-05-28 09:15:48 -07:00
Matt Wells
662a8a33d0 emergency core fix 2014-05-28 09:29:54 -07:00
mwells
b3dcca6356 added make master-rpm 2014-05-28 07:48:02 -07:00
Matt Wells
d8aa79c90d Merge branch 'master' into diffbot-testing 2014-05-28 07:41:45 -07:00
mwells
9b985fc233 Merge branch 'testing' 2014-05-28 07:36:45 -07:00
mwells
17e1fbc16c fix getPitPosLL() error causing
lang detection to screw up.
2014-05-28 07:35:05 -07:00
Daniel Steinberg
c06f9fde36 gigablast now has a notion of version based on the request 2014-05-27 20:11:12 -07:00
Daniel Steinberg
7448e8a1ff don't use "expand" for mode= requests or non-analyze requests 2014-05-26 20:38:44 -07:00
mwells
da328a8d2f turn off spider reply indexing by default
until we stop indexing simple words in url
2014-05-26 13:22:43 -06:00
mwells
068a299339 udpate documentation 2014-05-26 13:09:57 -06:00
Matt Wells
2d4fb483b2 disambiguate error msg 2014-05-26 10:46:10 -07:00
mwells
8149e99965 added developer.html warning msg. 2014-05-26 11:41:12 -06:00
mwells
58a2c04e30 more admin.html updates. 2014-05-26 11:39:26 -06:00
mwells
cea69c35cb make sure all subsections of admin.html
have a last updated time or a warning if
documentation is old.
2014-05-26 11:31:54 -06:00
mwells
c89c1f1471 Merge branch 'master' into testing
Conflicts:
	html/admin.html
2014-05-26 11:25:33 -06:00