Commit Graph

8 Commits

Author SHA1 Message Date
Matt Wells
941c8f1892 now added CT_STATUS type results into serps.
one for each spider reply we add so we can query
spider replies. using url: or type:status etc.
2014-05-09 13:52:12 -07:00
mwells
e0ed0f62b8 more widget updates. 2014-04-16 21:36:28 -07:00
Matt Wells
44ae7c4de6 mem labelling fixes.
fixed bad alloc when generating gigabits.
2013-12-09 14:05:02 -07:00
Matt Wells
fc17521697 Merge branch 'master' into diffbot
Conflicts:
	Hostdb.cpp
	Makefile
	PageResults.cpp
	PageRoot.cpp
	Pages.cpp
	Rdb.cpp
	SearchInput.cpp
	SearchInput.h
	Spider.cpp
	Spider.h
	XmlDoc.cpp
2013-10-16 14:28:42 -07:00
mwells
37a9e82060 update the dirty word list. but we still
should remove tags, except maybe outlinks,
and detect the dirty words on what remains.
getting too many false positives in tags still.
2013-10-15 01:01:19 -07:00
Matt Wells
5dc7bd2ab4 integrate diffbot from svn back into git. 2013-09-13 09:23:18 -07:00
Matt Wells
834128a076 Fixed heap breaches caused by our bult-in
electric fence code from death queries.
Use HTTP/1.0 not 1.1 since we disabled keep-alive
support a long time ago.
2013-08-10 09:51:14 -07:00
Matt Wells
f6e560c1f4 Initial file population. 2013-08-02 13:12:24 -07:00