Commit Graph

2312 Commits

Author SHA1 Message Date
mwells
4e7152b487 fix more bugs in squid proxy implementation.
force squid proxy stack to use floaters.
2014-10-02 11:54:50 -07:00
mwells
42b891219d several fixes for floater proxy through squid proxy.
gb needs to act like squid for the rendering machines so
it can do crawl delay backoff and load balancing over the
floaters.
2014-10-02 02:08:38 -07:00
mwells
00a09104ca Merge branch 'diffbot-testing' into testing 2014-10-01 21:27:18 -07:00
mwells
8f96ba0187 tell diffbot to use gigablast host #0 as
its proxy and diffbot will in turn send
the request to one of the spider proxies in its list.
this way it can do its load balancing etc. algo.
2014-10-01 21:25:39 -07:00
Matt Wells
3a0d60d6b9 remove graphix 2014-10-01 20:00:35 -07:00
Matt Wells
7d2ee6df37 Merge branch 'diffbot-testing' of github.com:gigablast/open-source-search-engine into diffbot-testing 2014-10-01 19:55:45 -07:00
Matt Wells
abd75e5eca remove size limit on coll.conf 2014-10-01 19:55:24 -07:00
mwells
c7a5073139 added sorting by site # inlinks/pop to menu for testing 2014-10-01 12:09:43 -07:00
mwells
854767e074 add example for gbsortby:sitenuminlinks into syntax page 2014-10-01 12:07:01 -07:00
mwells
0075dfee84 fix nytimes.com cookie/redir bug again. 2014-10-01 11:53:47 -07:00
mwells
145341dcb3 import patch from diffbot-testing 2014-10-01 11:36:02 -07:00
Matt Wells
1d22e9525a fixes to pass smoke tests. 2014-10-01 11:33:39 -07:00
mwells
65840d969e update to spider proxy choose set logic 2014-10-01 10:00:24 -07:00
mwells
7d4c4e8db1 update spider proxy logic. 2014-10-01 09:26:41 -07:00
mwells
603b350e09 misnomer 2014-09-30 17:40:02 -07:00
mwells
b4ca812ef8 added parm to reset proxy stats in table. erases
all our knowledge/stats for each proxy.
2014-09-30 17:38:59 -07:00
Matt Wells
3ae773a1f3 Merge branch 'testing' into diffbot-testing 2014-09-30 16:22:37 -07:00
Matt Wells
83111cb5c1 fixes before smoke testing 2014-09-30 16:22:18 -07:00
Matt Wells
23d26e26ba Merge branch 'testing' into diffbot-testing 2014-09-30 16:02:07 -07:00
mwells
ce56fb93ab fix qa test so we can roll out proxy code. 2014-09-30 15:40:02 -07:00
mwells
2af806993b update proxy algo so not all proxies get cutoff
at once.
2014-09-30 13:08:35 -07:00
Matt Wells
98ce40967f more collection swapping fixes 2014-09-29 21:52:58 -07:00
Matt Wells
8c6d216a14 lots of fixes for collection swapping. 2014-09-29 20:16:39 -07:00
mwells
7275765fbb get collection/root login system working 2014-09-29 19:56:31 -07:00
mwells
e3dbeafa5f more updates to cloud code 2014-09-29 18:28:36 -07:00
Matt Wells
cfb2ab7e82 fix core when deleting collection
that is not swapped out.
2014-09-29 14:00:10 -07:00
mwells
bca24fb0e6 fix collection swap logic a bunch. seems to work now. 2014-09-29 13:05:20 -07:00
mwells
257a7e3c10 first stab at swapping out collection recs
to save memory when # of collections is high
2014-09-29 11:37:05 -07:00
mwells
46290fa52f new password systems. individual collection passwords/accessIps. 2014-09-28 18:59:49 -07:00
mwells
66dcf61fd7 fix empty summary related core.
added \n to scoring info on serps to make diffs
in qa.cpp simpler.
2014-09-28 14:31:56 -07:00
mwells
235d69571b hid indexbody parm.
show pure xml in cached page if it's xml.
do not show summaries for xml/json docs in the serps, pointless.
fix hashSections().
2014-09-28 13:47:54 -07:00
mwells
a8c5d6a46e fix gbfacetstr: operator for xml docs 2014-09-28 12:09:04 -07:00
mwells
2366776da3 fix parsing inconsistency bug from fixing the
hashing gblang:de etc.
2014-09-28 11:43:02 -07:00
mwells
7d3bcd7672 1 spider out at a time for qa test consistency 2014-09-28 11:00:31 -07:00
mwells
7a0f9fe370 fix support for indexing xml docs.
no longer use hacks gbxmltitle and gbxmllinks.
no longer convert html entities for xml docs using hacks
since we have XmlDoc::hashXmlFields() function.
added qaxml() qa test to test xml doc indexing and searching.
ignore <?xml> tag when generating xml tag compound name.
2014-09-28 10:43:41 -07:00
mwells
52c88aee94 index xml docs properly like we do json 2014-09-28 09:20:16 -07:00
Matt Wells
a2beb23d87 added Xml::getCompoundName() 2014-09-28 08:39:46 -07:00
Matt Wells
47eecf4165 msie img border fix 2014-09-27 21:49:48 -07:00
Matt Wells
8308915654 render tabs cleaner for MSIE 2014-09-27 21:41:15 -07:00
Matt Wells
903e53a239 fix login cookie for msie 2014-09-27 21:13:14 -07:00
mwells
8e6365f476 minor fixes in docs 2014-09-27 20:26:21 -07:00
mwells
675d21df0e fix so gblang:en gblang:"zh_cn" terms work 2014-09-27 19:35:30 -07:00
mwells
4b611edf8d link to add gigablast to browser search bar 2014-09-27 19:21:00 -07:00
mwells
8ffa4fe24e doc updates 2014-09-27 17:49:49 -07:00
mwells
0267e865b8 minor fixes 2014-09-27 17:01:16 -07:00
mwells
afd41676d2 bring back meta tag display in results again.
added qa tests for advanced search and api parms.
various api parm fixes and hides.
do not do test url on proxies if test url empty.
2014-09-27 15:54:55 -07:00
mwells
9d738cdb8b more advanced search fixes 2014-09-27 11:51:37 -07:00
mwells
6de7a3f6b3 get advanced search working again 2014-09-27 11:12:47 -07:00
mwells
6c94cfceef add <omitCount> stuff. fix getDocIds() recalls
when too many results invisible (dedupped etc.).
show in xml, json, html. provide link in html to
show omitted results.
2014-09-27 09:56:23 -07:00
mwells
8c53ce2a79 undo hashtab change. too much overhead. 2014-09-27 08:39:22 -07:00