+++URL: http://hsxa.ece.wisc.edu/
HTTP/1.1 200 OK
Date: Fri, 10 Feb 2006 19:15:25 GMT
Server: Apache/2.0.50 (Unix) mod_ssl/2.0.50 OpenSSL/0.9.7d
Last-Modified: Thu, 05 May 2005 22:44:34 GMT
ETag: "255ebab-297-b7ae9880"
Accept-Ranges: bytes
Keep-Alive: timeout=15
Connection: Keep-Alive
Content-Type: text/html
sample doc 1
This document has both cats and dogs in it.
+++URL: http://www.afrikaschule.de.vu/
HTTP/1.1 200 OK
Date: Fri, 10 Feb 2006 19:15:29 GMT
Server: Apache/1.3.27 (Linux/SuSE) mod_fastcgi/2.4.2 FrontPage/4.0.4.3 PHP/4.4.1 mod_perl/1.27 mod_ssl/2.8.12 OpenSSL/0.9.6i
Last-Modified: Wed, 18 May 2005 17:16:49 GMT
ETag: "925b7-776-428b7881"
Accept-Ranges: bytes
Keep-Alive: timeout=1, max=100
Connection: Keep-Alive
Content-Type: text/html
sample doc 2
Now here we have just cat singular and dog as well.
+++URL: http://www.mp3.com/page1
HTTP/1.1 200 OK
Content-Type: text/html
sample doc 3
This has mp3 and take and five.
+++URL: http://www.mp3.com/page2
HTTP/1.1 200 OK
Content-Type: text/html
sample doc 3
This has mp3 and take five the phrase.
+++URL: http://www.bmx.com/page1
HTTP/1.1 200 OK
Content-Type: text/html
sample doc 4
This new game I played is about bmx racing.
+++URL: http://www.bmx.com/page2
HTTP/1.1 200 OK
Content-Type: text/html
sample doc 5
I am totally into real-life bmx racing.
+++URL: http://www.john.com/page1
HTTP/1.1 200 OK
Content-Type: text/html
testing 1
john smith and bob dole walk into a bar.
+++URL: http://www.john.com/page2
HTTP/1.1 200 OK
Content-Type: text/html
testing 2
john smith and dole bob are here.
+++URL: http://www.john.com/page3
HTTP/1.1 200 OK
Content-Type: text/html
testing 3
smith john and dole bob are here.
+++URL: http://www.json.com/page1
HTTP/1.1 200 OK
Content-Type: application/json
{"document":{
"foo":"bar",
"title":"papers"
}
}
+++URL: http://www.json.com/page2
HTTP/1.1 200 OK
Content-Type: application/json
{"document":{
"foo":"bar",
"title":"boxes"
}
}
+++URL: http://www.fields.com/page1
HTTP/1.1 200 OK
Content-Type: application/json
{"strings":{
"foo":"bar",
"vendor":"Uncle Leroy"
}
}
+++URL: http://www.fields.com/page2
HTTP/1.1 200 OK
Content-Type: application/json
{"strings":{
"foo":"bar",
"vendor":"My Vendor Inc."
}
}
+++URL: http://www.fields.com/page3
HTTP/1.1 200 OK
Content-Type: application/json
{"strings":{
"foo":"bar",
"vendor":"my vendor inc."
}
}
+++URL: http://www.abc.com/page.html
HTTP/1.1 200 OK
Content-Type: text/html
ABC.COM
A wonderful web page.
+++URL: http://www.somewhere.com/foo.doc
HTTP/1.1 200 OK
Content-Type: text/html
Extension is a word document
This url ends in the word document extension.
+++URL: http://www.linker.com/page1
HTTP/1.1 200 OK
Content-Type: text/html
We link to gigablast.
link is here.
+++URL: http://www.linker.com/page1
HTTP/1.1 200 OK
Content-Type: text/html
We link to gigablast on another page.
another link is here.
+++URL: http://abc.mysite.com/page1
HTTP/1.1 200 OK
Content-Type: text/html
A page on mysite.com
Used to test the site: query operator.
+++URL: http://abc.mysite.com/dir1/dir2/somepage.html
HTTP/1.1 200 OK
Content-Type: text/html
Another page on mysite.com
Used to test the site: query operator with subdirectories.
+++URL: http://www.feline.com/
HTTP/1.1 200 OK
Content-Type: text/html
A page about cats and perhaps some food
Used to test the title: query operator.
+++URL: http://www.feline.com/page2
HTTP/1.1 200 OK
Content-Type: text/html
A page about cat food only
Used to test the title: query operator with quotes.
+++URL: http://www.naughty.com/
HTTP/1.1 200 OK
Content-Type: text/html
A naught adult content document
Fuck, shit does the adult content detector work?
+++URL: http://www.imagesrc.com/
HTTP/1.1 200 OK
Content-Type: text/html
Has an image.
What a nice image that is. This is for
testing the gbimage: query operator.
+++URL: http://www.somezip.com/
HTTP/1.1 200 OK
Content-Type: text/html
Has a zipcode meta tag.
This zipcode is for beverly hills, CA.
+++URL: http://www.somezip.com/
HTTP/1.1 200 OK
Windows-1252 charset
For testing gbcharset:latin1 even though gigablast converts everything to utf-8 we do index the original charset.
+++URL: http://www.deutsch.com/
HTTP/1.1 200 OK
Deutschland
Gerne sind wir Ihnen bei der Planung Ihres Besuches am Geburtsort des Entdeckers der Röntgenstrahlen behilflich.
(gblang:de)
+++URL: http://www.pathlen.com/subdir1/subdir2/leaf.html
HTTP/1.1 200 OK
For testing the gbpathdepth:3 query
This should match it.
+++URL: http://www.oldstuff.com/oldpage.cgi
HTTP/1.1 200 OK
Old school style
Should match the gbiscgi:1 query operator.
+++URL: http://www.allforms.com/
HTTP/1.1 200 OK
Has some forms
+++URL: http://www.jsoncams.com/page1
HTTP/1.1 200 OK
Content-Type: application/json
{
"title":"A nice camera for sale.",
"price":599.99
"color":"red"
}
+++URL: http://www.jsoncams.com/page2
HTTP/1.1 200 OK
Content-Type: application/json
{
"title":"An ok camera for sale.",
"price":350.00,
"color":"red"
}
+++URL: http://www.jsoncams.com/page3
HTTP/1.1 200 OK
Content-Type: application/json
{
"title":"Two bad cameras for sale.",
"price":199.00
"color":"black"
}
+++URL: http://www.jsoncams.com/page4
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"A nice camera for sale.",
"price":599.99,
"color":"red"
}}
+++URL: http://www.jsoncams.com/page5
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"An ok camera for sale.",
"price":350.00,
"color":"red"
}}
+++URL: http://www.jsoncams.com/page6
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"Two bad cameras for sale for cheap.",
"price":99.00,
"description":"put desc here.",
"color":"black"
}}
+++URL: http://www.bigairline.com/foo1
HTTP/1.1 200 OK
Content-Type: application/json
{
"Description":"Hires pilots to fly planes.",
"Employees":630
}
+++URL: http://www.smallairline.com/foo1
HTTP/1.1 200 OK
Content-Type: application/json
{
"Description":"Hires pilots to fly planes.",
"Employees":44
}
+++URL: http://www.bigcompany.com/page1.html
HTTP/1.1 200 OK
Content-Type: application/json
{"Company":{
"Description":"A big company.",
"Employees":1920
}}
+++URL: http://www.smallcompany.com/page1.html
HTTP/1.1 200 OK
Content-Type: application/json
{"Company":{
"Description":"A small company.",
"Employees":13
}}
+++URL: http://www.products.com/page1.html
HTTP/1.1 200 OK
Content-Type: application/json
{"product":{
"Description":"A cheap harmonica.",
"price":1.23
}}
+++URL: http://www.cpus.com/page1
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"CPU #1",
"cores":4
}}
+++URL: http://www.cpus.com/page2
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"CPU #2",
"cores":8
}}
+++URL: http://www.cpus.com/page3
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"CPU #3",
"cores":4
}}
+++URL: http://www.cpus.com/page4
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"CPU #4",
"cores":1
}}
+++URL: http://www.buildings.com/page1
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"BLDG #1",
"size":7
}}
+++URL: http://www.buildings.com/page2
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"BLDG #2",
"size":9
}}
+++URL: http://www.buildings.com/page3
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"BLDG #3",
"size":25
}}
+++URL: http://www.buildings.com/page4
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"BLDG #4",
"size":1500
}}
+++URL: http://www.buildings.com/page5
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"BLDG #5",
"size":1000
}}
+++URL: http://www.buildings.com/page6
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"BLDG #6",
"size":10000
}}
+++URL: http://www.buildings.com/page7
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"BLDG #7",
"size":10001
}}
+++URL: http://www.chickens.com/page1
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"chicken #1",
"weight":"1.5"
}}
+++URL: http://www.chickens.com/page2
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"chicken #2",
"weight":"1.8"
"price":4.99
}}
+++URL: http://www.chickens.com/page3
HTTP/1.1 200 OK
Content-Type: application/json
{ "product":{
{
"title":"chicken #3",
"weight":"2.3333333333333333333333333333333333333333333"
}}
+++URL: http://www.abc.com/page.html
HTTP/1.1 200 OK
Content-Type: text/html
A special web page
Test the url2: operator.
+++URL: http://mysite.com/special/dog/page1.html
HTTP/1.1 200 OK
Content-Type: text/html
A special web page, again
Test the site2: operator. And the inurl2: operator.
+++URL: http://www.boolean.com/page1.html
HTTP/1.1 200 OK
Content-Type: text/html
Test bool ops - pigs only
This is just about pigs.
+++URL: http://www.boolean.com/page2.html
HTTP/1.1 200 OK
Content-Type: text/html
Test bool ops - cat dog only
Only about the famous cat dog.
+++URL: http://www.boolean.com/page3.html
HTTP/1.1 200 OK
Content-Type: text/html
Test bool ops - dog only
Only about a little dog.
+++URL: http://www.boolean.com/page4.html
HTTP/1.1 200 OK
Content-Type: text/html
Test bool ops - cat and pig only
Just cat and pig I'm afraid.
+++URL: http://www.boolean.com/page5.html
HTTP/1.1 200 OK
Content-Type: text/html
Test bool ops - only cat
Did we do this one already?