mwells
fa9f81bd7c
trying to fix json decoding bug.
...
make highlight class use safebuf.
2013-10-24 17:55:01 -07:00
Matt Wells
92f37343c3
fix xml search results output
2013-10-21 19:06:13 -07:00
Matt Wells
d9b132fd5a
make : into . for indexing json names.
2013-10-16 17:43:46 -07:00
Matt Wells
fc17521697
Merge branch 'master' into diffbot
...
Conflicts:
Hostdb.cpp
Makefile
PageResults.cpp
PageRoot.cpp
Pages.cpp
Rdb.cpp
SearchInput.cpp
SearchInput.h
Spider.cpp
Spider.h
XmlDoc.cpp
2013-10-16 14:28:42 -07:00
mwells
90fca8c171
fix "search in category" link.
2013-10-14 22:39:42 -07:00
mwells
9e9ef9c2cc
still getting statsdb link to work. a little
...
better now.
2013-10-14 21:21:27 -07:00
mwells
3ac5838b8f
fix the search tabs for the dmoz directory search
...
box. allow more error types when spidering dmoz docs.
2013-10-13 18:43:45 -07:00
mwells
3bc85cf528
a few cleanups for the new dmoz code.
2013-10-13 16:48:59 -07:00
mwells
d41d5554da
fix dmoz search.
2013-10-13 16:00:44 -07:00
mwells
4cbb31e180
added searchbox for dmoz pages/sites.
2013-10-13 15:45:12 -07:00
mwells
b60bdcc038
documentation updates. fixed sd=0.
2013-10-13 14:24:41 -07:00
mwells
2c7bc9031f
documentation updates.
2013-10-13 13:15:31 -07:00
mwells
8547b8f802
print pretty dmoz pages.
2013-10-13 00:39:05 -07:00
mwells
1d133e87c9
just print dmoz pages verbatim for now.
...
later we can show the dmoz entries as
search results.
2013-10-10 23:18:57 -07:00
mwells
55c5ad2921
fix "Top/" issues in breadcrumb etc.
2013-10-10 22:27:49 -07:00
mwells
ca6af65217
git dmoz nagivation system working.
...
now we just need to index the urls to
populate dmoz.
2013-10-10 22:08:21 -07:00
mwells
7ba9994804
many dmoz fixes. but still more we need to do.
...
isn't printing subcategories right now.
2013-10-08 23:55:11 -07:00
mwells
63c7764cd1
c=dmoz3 to c=dmoz
2013-10-06 17:12:45 -07:00
mwells
6c2c9f7774
trying to bring back dmoz integration.
2013-10-02 22:34:21 -06:00
Matt Wells
c0f1330d70
Merge branch 'master' into diffbot
...
Conflicts:
HttpServer.cpp
Makefile
PageGet.cpp
Pages.h
SafeBuf.h
2013-09-28 13:13:12 -07:00
mwells
5884951190
only do certain things if running
...
on a machine in matt wells datacenter.
like fan switching based on temps,
or printing seo links. made seo functions
weak overridable placeholder stubs so if
seo.o is linked in it will override.
include seo.o object if seo.cpp file exists
for automatic seo module building and linking.
2013-09-28 13:43:56 -06:00
Matt Wells
7fa4138d1c
fix Next 10 link
2013-09-17 17:19:41 -07:00
Matt Wells
98caa3225a
fix query prepend logic for json searches
2013-09-17 17:16:39 -07:00
Matt Wells
c16fe8601b
more crawlbot api fixes
2013-09-17 15:32:28 -07:00
Matt Wells
4c11265a98
more updates to crawlbot api
2013-09-16 13:59:11 -07:00
Matt Wells
78a334198b
Merge branch 'master' into diffbot
2013-09-16 09:05:37 -07:00
Matt Wells
3ac79de92e
fix type adurl -> addurl.
2013-09-16 08:11:06 -07:00
Matt Wells
991e2f30f7
speed up whitelist hashtable like 20x
...
using hashtable key magic.
2013-09-15 21:10:53 -07:00
mwells
b684414e16
almost done adding support for whitelists.
...
i.e. list of sites to restrict search results to,
for instance.
2013-09-15 15:15:56 -06:00
Matt Wells
a412c798bf
Merge branch 'master' into diffbot
...
Conflicts:
PageResults.cpp
2013-09-13 09:24:28 -07:00
Matt Wells
5dc7bd2ab4
integrate diffbot from svn back into git.
2013-09-13 09:23:18 -07:00
mwells
34b6d3e74a
fixed some cores. brought in fixes from
...
old repo.
2013-09-08 16:16:13 -06:00
mwells
91c4e768b1
more family filter fixes
2013-09-01 18:28:49 -06:00
Matt Wells
94e6492916
removed MAX_COLL_RECS so we can have unlimited
...
collections, really limited by the sizeof(collnum_t) only now,
which is 16bits, 15bits unsigned, which is the limitation.
can always expand this so we can have more than 32k collections.
2013-08-30 16:20:38 -07:00
Matt Wells
f6e560c1f4
Initial file population.
2013-08-02 13:12:24 -07:00