Matt Wells
|
8477ef72f8
|
support gbmin gbmax gbminint gbmaxint range query
terms properly, when generating the docidvotebuf.
fixes boolean queries using them as well.
|
2014-06-05 14:47:45 -07:00 |
|
Matt Wells
|
780fd43aae
|
timestamp bug fix
|
2014-06-04 15:50:26 -07:00 |
|
Matt Wells
|
546d135007
|
fix boolean queries to do the on-demand
mini merges of the termlists. should fix
gbmin:offerprice:100 AND (text:lord OR text:helicopter)
|
2014-06-04 14:33:54 -07:00 |
|
Matt Wells
|
d98cf4b2b0
|
try to prevent slamming diffbot backend
with bulk jobs consisting of hundreds of
different domains/ips.
|
2014-06-04 12:37:49 -07:00 |
|
Matt Wells
|
4298e4e752
|
sanity checks for debugging duplicate
titledb file bug.
|
2014-06-04 12:15:12 -07:00 |
|
Matt Wells
|
b7d9002a05
|
fix log bug
|
2014-06-04 10:57:25 -07:00 |
|
Matt Wells
|
8b74bd855b
|
Merge branch 'master' into diffbot-testing
|
2014-06-04 09:37:55 -07:00 |
|
Matt Wells
|
fcc8bc85cc
|
update bulk job restart
|
2014-06-04 09:36:26 -07:00 |
|
Matt Wells
|
e2ca303fe2
|
doc updates
|
2014-06-04 07:38:40 -07:00 |
|
mwells
|
a734240474
|
minor date change in documentation.
|
2014-06-04 07:26:46 -07:00 |
|
mwells
|
beba94013e
|
remove clustermaintenance documentation. seemed pretty
obsolete.
|
2014-06-04 07:26:10 -07:00 |
|
mwells
|
3fd973a53e
|
documentation updates for scaling the cluster
|
2014-06-04 07:17:34 -07:00 |
|
Matt Wells
|
ec1b66aff5
|
Merge branch 'master' into diffbot-testing
|
2014-06-03 20:50:59 -07:00 |
|
Matt Wells
|
585e6a357f
|
parm documentation update for url filters
|
2014-06-03 20:50:22 -07:00 |
|
Matt Wells
|
b534ac5812
|
do not print completed time if spidering is going on
|
2014-06-03 20:30:10 -07:00 |
|
Matt Wells
|
694b19e053
|
Merge branch 'diffbot-testing' of github.com:gigablast/open-source-search-engine into diffbot-testing
|
2014-06-03 18:07:51 -07:00 |
|
Matt Wells
|
07cf2f1129
|
fix core
|
2014-06-03 18:07:35 -07:00 |
|
Matt Wells
|
50468293e7
|
fix bool expressions with only one operand.
i.e. double parens bug.
|
2014-06-03 14:46:28 -07:00 |
|
Matt Wells
|
bf70823260
|
take out <moreResultsFollow> for &stream=1
for now. maybe add back in later but would be
at end of the reply.
|
2014-06-03 14:09:24 -07:00 |
|
mwells
|
d23032241d
|
fix mem leak when downloading images is turned on.
|
2014-06-03 13:26:56 -07:00 |
|
Matt Wells
|
da677eb8a4
|
fix for searching for query pipe operator in quotes.
|
2014-06-03 13:08:35 -07:00 |
|
mwells
|
ba2329808b
|
fix siteListIsEmpty bug causing spider to
spider the whole internet when it shouldn't
|
2014-06-03 11:37:31 -07:00 |
|
Matt Wells
|
c3a823c99d
|
fix relative url bug when relative url starts with ?
|
2014-06-03 10:54:50 -07:00 |
|
Matt Wells
|
536b43e19f
|
Merge branch 'master' into diffbot-testing
|
2014-06-03 10:17:00 -07:00 |
|
mwells
|
51bb653bb3
|
fix stack smash core.
|
2014-06-01 10:42:49 -07:00 |
|
mwells
|
918f43f80e
|
still searched for stripped words even if
has a synset. fixed query lang
detector in SearchInput.cpp.
|
2014-06-01 10:18:24 -07:00 |
|
Matt Wells
|
d15f5d3ce7
|
when user searches for a word without the
accent marks, we now also search for the same
word but with the proper accent marks.
|
2014-06-01 09:37:00 -07:00 |
|
Matt Wells
|
6f704d3d6a
|
fix wiktionary-based generation code so we can map
a word with accents stripped to the
word with the accents in place.
|
2014-06-01 06:33:16 -07:00 |
|
mwells
|
f16414b774
|
fix stripAccentMarks() to use libiconv stuff
so all languages are now supported.
|
2014-05-31 08:14:39 -07:00 |
|
mwells
|
5f16013a9e
|
add support for stripping accent marks from greek letters.
|
2014-05-30 20:09:37 -07:00 |
|
Matt Wells
|
509ae2fed8
|
remove limitations on # of search results requested.
we are more of a back-end service so that can be
handled by a middle or front layer.
|
2014-05-29 21:32:24 -07:00 |
|
Matt Wells
|
132aabf589
|
Merge branch 'diffbot-dan' into diffbot-testing
|
2014-05-29 10:28:38 -07:00 |
|
Daniel Steinberg
|
79b2d4859b
|
printCrawlDetailsInJson signature without version
|
2014-05-28 10:41:32 -07:00 |
|
Daniel Steinberg
|
1fae88b739
|
check version less than 99
|
2014-05-28 10:30:26 -07:00 |
|
Matt Wells
|
a970c12f65
|
Merge branch 'diffbot' into diffbot-testing
|
2014-05-28 09:59:39 -07:00 |
|
Matt Wells
|
bc5b126f2a
|
Merge branch 'diffbot'
|
2014-05-28 09:15:48 -07:00 |
|
Matt Wells
|
662a8a33d0
|
emergency core fix
|
2014-05-28 09:29:54 -07:00 |
|
mwells
|
b3dcca6356
|
added make master-rpm
|
2014-05-28 07:48:02 -07:00 |
|
Matt Wells
|
d8aa79c90d
|
Merge branch 'master' into diffbot-testing
|
2014-05-28 07:41:45 -07:00 |
|
mwells
|
9b985fc233
|
Merge branch 'testing'
|
2014-05-28 07:36:45 -07:00 |
|
mwells
|
17e1fbc16c
|
fix getPitPosLL() error causing
lang detection to screw up.
|
2014-05-28 07:35:05 -07:00 |
|
Daniel Steinberg
|
c06f9fde36
|
gigablast now has a notion of version based on the request
|
2014-05-27 20:11:12 -07:00 |
|
Daniel Steinberg
|
7448e8a1ff
|
don't use "expand" for mode= requests or non-analyze requests
|
2014-05-26 20:38:44 -07:00 |
|
mwells
|
da328a8d2f
|
turn off spider reply indexing by default
until we stop indexing simple words in url
|
2014-05-26 13:22:43 -06:00 |
|
mwells
|
068a299339
|
udpate documentation
|
2014-05-26 13:09:57 -06:00 |
|
Matt Wells
|
2d4fb483b2
|
disambiguate error msg
|
2014-05-26 10:46:10 -07:00 |
|
mwells
|
8149e99965
|
added developer.html warning msg.
|
2014-05-26 11:41:12 -06:00 |
|
mwells
|
58a2c04e30
|
more admin.html updates.
|
2014-05-26 11:39:26 -06:00 |
|
mwells
|
cea69c35cb
|
make sure all subsections of admin.html
have a last updated time or a warning if
documentation is old.
|
2014-05-26 11:31:54 -06:00 |
|
mwells
|
c89c1f1471
|
Merge branch 'master' into testing
Conflicts:
html/admin.html
|
2014-05-26 11:25:33 -06:00 |
|