Commit Graph

30 Commits

Author SHA1 Message Date
Matt
5b92b5f6d5 now term freqs are almost exact for qatest123.
sometimes an off by 1 bug. we should really call
msg5 to get the list w/o thread and get a truly
exact term freq for qatest123 for consistency.
that would be in Posdb.cpp::getTermFreq()
2014-11-25 15:54:15 -07:00
Matt
4e8a42e024 text replacements for bad int32_t substitutions 2014-11-17 18:24:38 -08:00
Matt
4c19453ea9 working with -m32 for basic testing.
compiles for 64-bit.
2014-11-12 11:38:37 -08:00
Matt
96b8197ad3 now it compiles with -m32 2014-11-10 14:45:11 -08:00
Matt Wells
e7dd8f7956 replace long long with int64_t 2014-10-30 13:36:39 -06:00
Matt Wells
b13f3d24d7 replaced unsigned long long with uint64_t 2014-10-30 13:30:39 -06:00
mwells
103138aa81 various bug fixes. more qa tests. 2014-09-24 20:03:16 -07:00
mwells
9aa0847316 added langw and langwieght to control weight
received by a result's score if its language matches
that of query, or is "unknown".
2014-09-21 18:47:30 -07:00
mwells
312b39c059 lookup facet values to get their text representations. 2014-07-29 16:17:18 -07:00
mwells
c314e61968 make sectiondb stats just a special case of facets 2014-06-17 16:39:02 -06:00
mwells
d71922168e facetize the sectiondb stuff 2014-06-16 20:40:35 -07:00
mwells
20c4ac4205 got it marking up html now with sectiondb stats.
seems to work ok.
2014-06-12 14:42:08 -07:00
Matt Wells
8477ef72f8 support gbmin gbmax gbminint gbmaxint range query
terms properly, when generating the docidvotebuf.
fixes boolean queries using them as well.
2014-06-05 14:47:45 -07:00
Matt Wells
c729b51ae5 fixed exact # search results hit count
when using min/max/sort operators.
2014-05-20 13:45:00 -07:00
Matt Wells
2f331d55e5 widget updates 2014-05-06 10:47:57 -07:00
mwells
ca0843aa8b more bool query fixes. 2014-03-20 10:03:25 -07:00
mwells
cfbec626e8 more righteous fixes for bool queries 2014-03-19 13:51:32 -07:00
mwells
54cc8088fb more bool query fixes. hopefully this will do it,
but still can do some optimizations for speed.
2014-03-17 17:00:08 -07:00
mwells
cb483c42ea more fixes for bool searching before
using a slightly different and simpler approach
2014-03-13 16:00:23 -07:00
Matt Wells
fbd1bcd349 initial attempt at new boolean query logic.
supports unlimited # of boolean query terms.
already docid phased from phasing logic already there
but could be phased more to save more mem and speed up
a little more.
2014-03-12 20:53:44 -07:00
Matt Wells
27e8e810d2 use collnum instead of coll string.
more stable since resetting collections
keeps string the same but changes the collnum.
2014-03-06 15:48:11 -08:00
Matt Wells
2d4af1aefe index numbers as integers too, not just floats
so we can sort by spider date without losing
128 seconds of resolution.
2014-02-06 20:57:54 -08:00
Matt Wells
d3b498a057 time slice checkpoint 2014-02-03 19:17:58 -08:00
Matt Wells
8a49e87a61 got code with shard rebalancing compiling.
now we store a "sharded by termid" bit in posdb
key for checksums, etc keys that are not sharded
by docid. save having to do disk seeks on every
host in the cluster to do a dup check, etc.
2014-01-11 16:08:42 -08:00
Matt Wells
364216ff16 fixed bugs in sort by prices, etc. 2013-11-11 18:58:45 -08:00
Matt Wells
09f28b2f26 now we index all numbers that have field names
(so can't just be a number in the body) but it
can be in a meta tag or json item. then use
like gbsortby:products.offerPrice to sort the
search results (json objects) by that.
2013-11-08 16:16:13 -08:00
Matt Wells
991e2f30f7 speed up whitelist hashtable like 20x
using hashtable key magic.
2013-09-15 21:10:53 -07:00
Matt Wells
928dc36a03 get "&site=abc.com+xyz.com"... working to restrict
search results to specified sites. tested a little.
2013-09-15 20:16:48 -07:00
mwells
107037c6a2 new &sites=xyz.com+abc.com+... functionality compiles ok. 2013-09-15 18:14:32 -06:00
Matt Wells
f6e560c1f4 Initial file population. 2013-08-02 13:12:24 -07:00