Commit Graph

116 Commits

Author SHA1 Message Date
Kevin Truong
f2a9e68998 Fixes #2920. Allow facet ranges to include asterisk 2015-06-11 13:45:55 -07:00
Matt Wells
2aeb88e19b update search api doc 2015-04-22 18:55:25 -06:00
Matt Wells
b0b26126a5 fix parens bug for gbsortbyint:gbspiderdate)
do not include ( or ) as part of the field value
since they are associated with boolean syntax.
2015-04-22 14:02:28 -06:00
Matt
8433c49aa9 make sure we index a spider status doc for each diffbot
object. that way we can tell if diffbot objects are deduping,
how they are changing over time, etc.
2015-04-03 14:59:09 -06:00
Matt
6b293f17e6 now show "totalDocsWithField" for each facet, so we know
how many docs had that field, with any particular value,
so we can do tf/idf type things.
2015-04-01 09:16:42 -06:00
mwells
7de9f6940b documentation for new gbss spider status doc fields. 2015-03-20 12:43:21 -06:00
Matt
dea534827e langidbits init bug leftover from searchinput reset
memset fix i think.
2015-03-17 15:04:31 -06:00
Matt Wells
3b39b1d37a fix facet mem leak from QueryTerm::m_facetHashTable and safebuf
when doing federated queries over a token.
2015-03-15 07:18:32 -07:00
Matt Wells
3c2b082540 gbfacetstr: is case-sensitive. 2015-03-12 13:54:11 -07:00
Matt
2413a9b9b1 query stop words now based on selected langid. 2015-03-08 15:16:24 -07:00
Matt Wells
0a3d26f893 added support for &nf=50 to limit to top 50 facets. 2015-01-29 10:34:22 -07:00
Matt Wells
a2ce92cd71 fix core 2015-01-23 18:59:50 -08:00
Matt
da86470143 fix query bug. 2015-01-21 16:39:24 -07:00
mwells
4969aa728e Merge branch 'diffbot-testing' of github.com:gigablast/open-source-search-engine into diffbot-testing
Conflicts:
	Address.cpp
	PagePerf.cpp
	Parms.cpp
	Xml.cpp
2015-01-13 12:29:49 -07:00
mwells
87285ba3cd use gbmemcpy not memcpy so we can get profiler working again
since memcpy can't be interrupted and backtrace() called.
2015-01-13 12:25:42 -07:00
Matt Wells
0ae882e1b5 fix query syntax help bugs related to facets 2015-01-12 10:41:02 -08:00
Matt
ca68ae022a fix punct at beginning of term bug. 2014-12-17 10:29:26 -08:00
Matt
b89f071f7c quite a few bug fixes from adding the new query
syntax qa test.
2014-12-11 18:24:28 -08:00
Matt
9a3489773d query syntax updates 2014-12-11 14:37:30 -08:00
Matt Wells
720517c2f5 fix facet range lists 2014-12-09 16:51:14 -08:00
Matt Wells
d0bed16be5 fix type in sytnax.html page 2014-12-09 14:15:00 -08:00
Matt Wells
0331363893 show language query synonym terms came from
in the xml/json feed.
2014-12-04 10:57:01 -07:00
Matt Wells
654084f557 fix 64bit conversion bug. realloc offset should have
been 64bit not 32bit in Linkdb.cpp.
2014-12-03 07:35:14 -08:00
Matt
ea67c688b9 fixed a couple really nasty mem leak bugs from new facet code 2014-11-25 11:00:27 -07:00
Matt
adcef39376 Merge branch 'diffbot-testing' into diffbot-matt
Conflicts:
	Collectiondb.cpp
	Collectiondb.h
	Conf.cpp
	Conf.h
	Msg39.cpp
	PageEvents.cpp
	PageResults.cpp
	PageTurk.cpp
	Pages.cpp
	Parms.cpp
	Posdb.cpp
	Proxy.cpp
	Query.cpp
	Query.h
	RdbBase.cpp
	RdbMap.cpp
	Repair.cpp
	Repair.h
	SafeBuf.cpp
	Spider.cpp
	Tagdb.cpp
	TopTree.cpp
	XmlDoc.cpp
	main.cpp
2014-11-20 16:53:07 -08:00
Matt
4e8a42e024 text replacements for bad int32_t substitutions 2014-11-17 18:24:38 -08:00
Matt
931a1c4bc6 good checkpoint. quite a few fixes. 2014-11-17 18:13:36 -08:00
Matt
994dc02d62 make user weight a float, not int 2014-11-17 14:46:31 -08:00
Matt Wells
8a73cbe946 added a couple more facet examples 2014-11-12 17:22:21 -08:00
Matt
4c19453ea9 working with -m32 for basic testing.
compiles for 64-bit.
2014-11-12 11:38:37 -08:00
Matt
ab58809811 no longer piggyback gbfieldmatch: on facet termlists
because it needs to be super fast.
2014-11-11 13:16:08 -08:00
Matt
96b8197ad3 now it compiles with -m32 2014-11-10 14:45:11 -08:00
Matt Wells
118a5c1a33 fix gbfieldmatch:pageUrl:"http://...." query.
problem with the : in the http:// and problem
with Query.cpp converting pageUrl to lower case
when it was hashed as "pageUrl" in XmlDoc::hashFacet2().
2014-11-04 13:53:36 -08:00
Matt Wells
e7dd8f7956 replace long long with int64_t 2014-10-30 13:36:39 -06:00
Matt Wells
b13f3d24d7 replaced unsigned long long with uint64_t 2014-10-30 13:30:39 -06:00
mwells
854767e074 add example for gbsortby:sitenuminlinks into syntax page 2014-10-01 12:07:01 -07:00
mwells
8e6365f476 minor fixes in docs 2014-09-27 20:26:21 -07:00
mwells
a3a0b52ee4 added the query reindex smoke test.
fixed a lot of related bugs.
2014-09-25 17:44:35 -07:00
Matt Wells
d7b323afc6 fix image detection 2014-09-21 07:41:45 -07:00
Matt Wells
36b82474d9 fix infinite loop bug in Query.cpp 2014-09-18 19:38:32 -07:00
mwells
44f00da70e facet fixes 2014-09-05 10:21:02 -07:00
mwells
79ec8bd268 updates help/syntax page 2014-09-04 22:36:28 -07:00
mwells
efc7f3c064 support facet ranges now like
gbfacetint:gblang,0-1,1-2,2-3,3-4,5-6,6-18
2014-09-04 20:41:37 -07:00
mwells
ccaad9d5da first stab at facet range buckets for histograms. 2014-09-04 14:55:03 -07:00
mwells
9de4e4bf3d Merge branch 'testing' into diffbot-testing 2014-08-29 11:23:13 -07:00
mwells
e7aa933959 awesome updates to help.html page 2014-08-27 22:21:30 -07:00
mwells
ad8168f214 updates for query help table 2014-08-27 23:10:27 -06:00
mwells
5c69d49176 fix html.html bug 2014-08-25 21:49:32 -07:00
Matt Wells
0c43fc82ea Merge branch 'diffbot-matt' into diffbot-testing
Conflicts:
	Query.cpp
2014-08-25 17:16:31 -07:00
mwells
6607cc2cbe added gbfieldmatch: operator for exactly matching
full field names. case sensitive. uses gbfacetstr:
values that were hashed at index time. example:
gbfieldmatch:object.field:"Some Value"
See help.html for more examples.
2014-08-25 13:57:55 -07:00