Commit Graph

35 Commits

Author SHA1 Message Date
mwells
fa9f81bd7c trying to fix json decoding bug.
make highlight class use safebuf.
2013-10-24 17:55:01 -07:00
Matt Wells
92f37343c3 fix xml search results output 2013-10-21 19:06:13 -07:00
Matt Wells
d9b132fd5a make : into . for indexing json names. 2013-10-16 17:43:46 -07:00
Matt Wells
fc17521697 Merge branch 'master' into diffbot
Conflicts:
	Hostdb.cpp
	Makefile
	PageResults.cpp
	PageRoot.cpp
	Pages.cpp
	Rdb.cpp
	SearchInput.cpp
	SearchInput.h
	Spider.cpp
	Spider.h
	XmlDoc.cpp
2013-10-16 14:28:42 -07:00
mwells
90fca8c171 fix "search in category" link. 2013-10-14 22:39:42 -07:00
mwells
9e9ef9c2cc still getting statsdb link to work. a little
better now.
2013-10-14 21:21:27 -07:00
mwells
3ac5838b8f fix the search tabs for the dmoz directory search
box. allow more error types when spidering dmoz docs.
2013-10-13 18:43:45 -07:00
mwells
3bc85cf528 a few cleanups for the new dmoz code. 2013-10-13 16:48:59 -07:00
mwells
d41d5554da fix dmoz search. 2013-10-13 16:00:44 -07:00
mwells
4cbb31e180 added searchbox for dmoz pages/sites. 2013-10-13 15:45:12 -07:00
mwells
b60bdcc038 documentation updates. fixed sd=0. 2013-10-13 14:24:41 -07:00
mwells
2c7bc9031f documentation updates. 2013-10-13 13:15:31 -07:00
mwells
8547b8f802 print pretty dmoz pages. 2013-10-13 00:39:05 -07:00
mwells
1d133e87c9 just print dmoz pages verbatim for now.
later we can show the dmoz entries as
search results.
2013-10-10 23:18:57 -07:00
mwells
55c5ad2921 fix "Top/" issues in breadcrumb etc. 2013-10-10 22:27:49 -07:00
mwells
ca6af65217 git dmoz nagivation system working.
now we just need to index the urls to
populate dmoz.
2013-10-10 22:08:21 -07:00
mwells
7ba9994804 many dmoz fixes. but still more we need to do.
isn't printing subcategories right now.
2013-10-08 23:55:11 -07:00
mwells
63c7764cd1 c=dmoz3 to c=dmoz 2013-10-06 17:12:45 -07:00
mwells
6c2c9f7774 trying to bring back dmoz integration. 2013-10-02 22:34:21 -06:00
Matt Wells
c0f1330d70 Merge branch 'master' into diffbot
Conflicts:

	HttpServer.cpp
	Makefile
	PageGet.cpp
	Pages.h
	SafeBuf.h
2013-09-28 13:13:12 -07:00
mwells
5884951190 only do certain things if running
on a machine in matt wells datacenter.
like fan switching based on temps,
or printing seo links. made seo functions
weak overridable placeholder stubs so if
seo.o is linked in it will override.
include seo.o object if seo.cpp file exists
for automatic seo module building and linking.
2013-09-28 13:43:56 -06:00
Matt Wells
7fa4138d1c fix Next 10 link 2013-09-17 17:19:41 -07:00
Matt Wells
98caa3225a fix query prepend logic for json searches 2013-09-17 17:16:39 -07:00
Matt Wells
c16fe8601b more crawlbot api fixes 2013-09-17 15:32:28 -07:00
Matt Wells
4c11265a98 more updates to crawlbot api 2013-09-16 13:59:11 -07:00
Matt Wells
78a334198b Merge branch 'master' into diffbot 2013-09-16 09:05:37 -07:00
Matt Wells
3ac79de92e fix type adurl -> addurl. 2013-09-16 08:11:06 -07:00
Matt Wells
991e2f30f7 speed up whitelist hashtable like 20x
using hashtable key magic.
2013-09-15 21:10:53 -07:00
mwells
b684414e16 almost done adding support for whitelists.
i.e. list of sites to restrict search results to,
for instance.
2013-09-15 15:15:56 -06:00
Matt Wells
a412c798bf Merge branch 'master' into diffbot
Conflicts:
	PageResults.cpp
2013-09-13 09:24:28 -07:00
Matt Wells
5dc7bd2ab4 integrate diffbot from svn back into git. 2013-09-13 09:23:18 -07:00
mwells
34b6d3e74a fixed some cores. brought in fixes from
old repo.
2013-09-08 16:16:13 -06:00
mwells
91c4e768b1 more family filter fixes 2013-09-01 18:28:49 -06:00
Matt Wells
94e6492916 removed MAX_COLL_RECS so we can have unlimited
collections, really limited by the sizeof(collnum_t) only now,
which is 16bits, 15bits unsigned, which is the limitation.
can always expand this so we can have more than 32k collections.
2013-08-30 16:20:38 -07:00
Matt Wells
f6e560c1f4 Initial file population. 2013-08-02 13:12:24 -07:00