Commit Graph

25 Commits

Author SHA1 Message Date
Matt
6b293f17e6 now show "totalDocsWithField" for each facet, so we know
how many docs had that field, with any particular value,
so we can do tf/idf type things.
2015-04-01 09:16:42 -06:00
Matt Wells
0a3d26f893 added support for &nf=50 to limit to top 50 facets. 2015-01-29 10:34:22 -07:00
Matt
96b8197ad3 now it compiles with -m32 2014-11-10 14:45:11 -08:00
Matt Wells
e7dd8f7956 replace long long with int64_t 2014-10-30 13:36:39 -06:00
mwells
9aa0847316 added langw and langwieght to control weight
received by a result's score if its language matches
that of query, or is "unknown".
2014-09-21 18:47:30 -07:00
mwells
842d72b5db Merge branch 'testing' into diffbot-matt 2014-07-08 09:58:54 -07:00
mwells
d7cc290a1f added a few new search parms that can be used
to override collection defaults.
hide all clustered results.
max title len.
max summary excerpt/line width.
2014-07-08 07:01:51 -07:00
mwells
d9ae010371 shard gbfacetstr:gbxpathsitehash123456 terms by termid for speed.
got them working again multicasting a msg 0x39 to the appropriate shard.
set special msg39request flag for better performance for those guys.
2014-07-07 12:32:27 -07:00
mwells
b0e82edc93 new facet crap compiling now. 2014-06-20 12:28:50 -07:00
mwells
a09d4cd723 Merge branch 'master' into diffbot-matt
Conflicts:
	Collectiondb.cpp
	Pages.cpp
	XmlDoc.cpp
	gb.conf
2014-06-20 09:35:39 -07:00
Matt Wells
f1ec530eef critical bug fixes 2014-06-18 09:16:28 -07:00
mwells
c314e61968 make sectiondb stats just a special case of facets 2014-06-17 16:39:02 -06:00
mwells
d71922168e facetize the sectiondb stuff 2014-06-16 20:40:35 -07:00
mwells
5c0b371dc9 Merge branch 'testing' into diffbot-matt
Conflicts:
	Collectiondb.cpp
	HttpServer.cpp
	Make.depend
	Parms.cpp
	Parms.h
2014-06-13 11:00:09 -07:00
mwells
3cf3cddc5c beginning of total parm overhaul.
new injection parms, just need to engage them.
2014-06-12 21:27:06 -07:00
mwells
20c4ac4205 got it marking up html now with sectiondb stats.
seems to work ok.
2014-06-12 14:42:08 -07:00
mwells
e4ce9bc9ac squidproxycache/floaters/sectiondbtagging all compiles.
need to do run-time debugging now.
2014-06-11 17:57:28 -07:00
Matt Wells
4cba959529 revised msg39.cpp in order to fix boolean bug 2014-05-13 20:50:11 -07:00
Matt Wells
2f331d55e5 widget updates 2014-05-06 10:47:57 -07:00
Matt Wells
27e8e810d2 use collnum instead of coll string.
more stable since resetting collections
keeps string the same but changes the collnum.
2014-03-06 15:48:11 -08:00
Matt Wells
25cf0efdbf first compiled stab at multi collection searching. 2014-03-06 10:45:13 -08:00
Matt Wells
2d4af1aefe index numbers as integers too, not just floats
so we can sort by spider date without losing
128 seconds of resolution.
2014-02-06 20:57:54 -08:00
mwells
107037c6a2 new &sites=xyz.com+abc.com+... functionality compiles ok. 2013-09-15 18:14:32 -06:00
mwells
b684414e16 almost done adding support for whitelists.
i.e. list of sites to restrict search results to,
for instance.
2013-09-15 15:15:56 -06:00
Matt Wells
f6e560c1f4 Initial file population. 2013-08-02 13:12:24 -07:00