<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-21734243</id><updated>2011-07-28T13:58:37.840-07:00</updated><title type='text'>Web Mining Project Blog</title><subtitle type='html'>Robots Exclusion standard is a de-facto standard that is used to inform the crawlers about the disallowed sections of a web server. Although the standard has been there for almost a decade, extensive research regarding its usage has not been done. This project is to perform a statistical analysis of the usage of the above standard.</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://sjb659.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/21734243/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://sjb659.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>An Analysis of the Usage Statistics of Robots Excl</name><uri>http://www.blogger.com/profile/16027129793969577539</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>7</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-21734243.post-114559578425379573</id><published>2006-04-20T21:52:00.000-07:00</published><updated>2006-04-20T22:30:54.190-07:00</updated><title type='text'>Result of the Project</title><content type='html'>These are the results we have obtained from our crawling. So in summery about 22% percent of the web uses Robots exclusion standard while 14% of the content is hidden.&lt;br /&gt;&lt;br /&gt;The most interesting observation is the amount of errors present in robots.txt. About 20% of the robots.txt's we have crawled has errors in them. Although the de-facto standard was there for about a decade still there seems to be no proper agreement for the correctness.&lt;br /&gt;&lt;br /&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;Use of Robots Exclusion Standard in Different Domains&lt;/span&gt;&lt;/p&gt;  &lt;table id="table6" style="width: 163pt; border-collapse: collapse;" str="" border="0" cellpadding="0" cellspacing="0" width="217"&gt;  &lt;colgroup&gt;&lt;col style="width: 81pt;" width="108"&gt;&lt;col style="width: 82pt;" width="109"&gt;&lt;/colgroup&gt; &lt;tbody&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border: 0.5pt solid rgb(0, 128, 0); padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; width: 81pt; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" bordercolorlight="#000000" height="17" width="108"&gt;Domain&lt;/td&gt; &lt;td style="border-style: solid solid solid none; border-color: rgb(0, 128, 0); border-width: 0.5pt 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; width: 82pt; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" bordercolorlight="#000000" width="109"&gt;%&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" bordercolorlight="#000000" height="17"&gt;com&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" bordercolorlight="#000000" fmla="=ROUND(D4/C4*100,2)" align="right"&gt;23.69&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" bordercolorlight="#000000" height="17"&gt;org&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" bordercolorlight="#000000" fmla="=ROUND(D5/C5*100,2)" align="right"&gt;20.18&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" bordercolorlight="#000000" height="17"&gt;net&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" bordercolorlight="#000000" fmla="=ROUND(D6/C6*100,2)" align="right"&gt;21.85&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" bordercolorlight="#000000" height="17"&gt;edu&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" bordercolorlight="#000000" fmla="=ROUND(D7/C7*100,2)" align="right"&gt;25.72&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" bordercolorlight="#000000" height="17"&gt;gov&lt;/td&gt; &lt;td style="border-style: none solid none none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt medium medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" align="right"&gt;42.98&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" bordercolorlight="#000000" height="17"&gt;info&lt;/td&gt; &lt;td style="border-style: solid solid solid none; border-color: rgb(0, 128, 0); border-width: 0.5pt 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" bordercolorlight="#000000" fmla="=ROUND(D9/C9*100,2)" align="right"&gt;26.51&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" bordercolorlight="#000000" height="17"&gt;Total&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" bordercolorlight="#000000" fmla="=ROUND(D10/C10*100,2)" align="right"&gt;22.41&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt; &lt;/table&gt;  &lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;br /&gt;Hidden Fraction of the Web - Domain Wise&lt;/span&gt;&lt;/p&gt;  &lt;table id="table7" style="width: 221px; border-collapse: collapse;" str="" border="0" cellpadding="0" cellspacing="0"&gt;  &lt;colgroup&gt;&lt;col style="width: 48pt;" span="3" width="64"&gt;&lt;/colgroup&gt; &lt;tbody&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border: 0.5pt solid rgb(0, 128, 0); padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; width: 48pt; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" height="17" width="64"&gt;&lt;br /&gt;&lt;/td&gt; &lt;td style="border-style: solid solid solid none; border-color: rgb(0, 128, 0); border-width: 0.5pt 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; width: 84px; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-align: center; text-decoration: none;"&gt;Document  wise&lt;/td&gt; &lt;td style="border-style: solid solid solid none; border-color: rgb(0, 128, 0); border-width: 0.5pt 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; width: 66px; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-align: center; text-decoration: none;"&gt;Size  wise&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" height="17"&gt;Domain&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" width="84"&gt;%&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" width="66"&gt;%&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" height="17"&gt;info&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(C17/B17*100,2)" align="right" width="84"&gt;18.75&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(F17/E17*100,2)" align="right" width="66"&gt;17.29&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" height="17"&gt;gov&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(C18/B18*100,2)" align="right" width="84"&gt;8.87&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(F18/E18*100,2)" align="right" width="66"&gt;9.21&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" height="17"&gt;edu&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(C19/B19*100,2)" align="right" width="84"&gt;11.9&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(F19/E19*100,2)" align="right" width="66"&gt;16.7&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" height="17"&gt;net&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(C20/B20*100,2)" align="right" width="84"&gt;13.1&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(F20/E20*100,2)" align="right" width="66"&gt;12.26&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" height="17"&gt;org&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(C21/B21*100,2)" align="right" width="84"&gt;11.29&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(F21/E21*100,2)" align="right" width="66"&gt;12.22&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" height="17"&gt;com&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(C22/B22*100,2)" align="right" width="84"&gt;15.33&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(F22/E22*100,2)" align="right" width="66"&gt;16.62&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" height="17"&gt;Total&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(C23/B23*100,2)" align="right" width="84"&gt;13.81&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(F23/E23*100,2)" align="right" width="66"&gt;14.65&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt; &lt;/table&gt;  &lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;br /&gt;Error and Warning Percentages in the robots.txt in different  domains&lt;/span&gt;&lt;/p&gt;  &lt;table id="table8" style="width: 222px; border-collapse: collapse;" str="" border="0" cellpadding="0" cellspacing="0"&gt;  &lt;colgroup&gt;&lt;col style="width: 48pt;" span="3" width="64"&gt;&lt;/colgroup&gt; &lt;tbody&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border: 0.5pt solid rgb(0, 128, 0); padding-right: 1px; padding-left: 1px; font-weight: 700; font-size: 10pt; vertical-align: bottom; width: 48pt; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" height="17" width="64"&gt;Domain&lt;/td&gt; &lt;td style="border-style: solid solid solid none; border-color: rgb(0, 128, 0); border-width: 0.5pt 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 700; font-size: 10pt; vertical-align: bottom; width: 84px; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;"&gt;Err  %&lt;/td&gt; &lt;td style="border-style: solid solid solid none; border-color: rgb(0, 128, 0); border-width: 0.5pt 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 700; font-size: 10pt; vertical-align: bottom; width: 67px; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;"&gt;War  %&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" fmla="=ROUND(F17/E17*100,2)" height="17"&gt;com&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(C29/B29*100,2)" align="right" width="84"&gt;21.02&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(D29/B29*100,2)" align="right" width="67"&gt;30.83&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" fmla="=ROUND(F18/E18*100,2)" height="17"&gt;net&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(C30/B30*100,2)" align="right" width="84"&gt;21.12&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(D30/B30*100,2)" align="right" width="67"&gt;36.59&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" fmla="=ROUND(F19/E19*100,2)" height="17"&gt;info&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(C31/B31*100,2)" align="right" width="84"&gt;21.05&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(D31/B31*100,2)" align="right" width="67"&gt;41.8&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" fmla="=ROUND(F20/E20*100,2)" height="17"&gt;edu&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(C32/B32*100,2)" align="right" width="84"&gt;18.63&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(D32/B32*100,2)" align="right" width="67"&gt;35.08&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" fmla="=ROUND(F21/E21*100,2)" height="17"&gt;gov&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(C33/B33*100,2)" align="right" width="84"&gt;13.37&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(D33/B33*100,2)" align="right" width="67"&gt;22.28&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" fmla="=ROUND(F22/E22*100,2)" height="17"&gt;org&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(C34/B34*100,2)" align="right" width="84"&gt;18.01&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(D34/B34*100,2)" align="right" width="67"&gt;36.2&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" fmla="=ROUND(F23/E23*100,2)" height="17"&gt;Total&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(C35/B35*100,2)" align="right" width="84"&gt;20.05&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" fmla="=ROUND(D35/B35*100,2)" align="right" width="67"&gt;33.81&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt; &lt;/table&gt;  &lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;br /&gt;Warning Types and their percentages&lt;/span&gt;&lt;/p&gt;  &lt;table id="table9" style="width: 218px; border-collapse: collapse;" str="" border="0" cellpadding="0" cellspacing="0"&gt;  &lt;colgroup&gt;&lt;col style="width: 48pt;" span="2" width="64"&gt;&lt;/colgroup&gt; &lt;tbody&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border: 0.5pt solid rgb(0, 128, 0); padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; width: 135px; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" height="17"&gt;Error Type:&lt;/td&gt; &lt;td style="border-style: solid solid solid none; border-color: rgb(0, 128, 0); border-width: 0.5pt 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; width: 76px; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;"&gt;%&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" fmla="=ROUND(D29/B29*100,2)" height="17" width="135"&gt;Capitalization&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" align="right" width="76"&gt;30.72&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" fmla="=ROUND(D30/B30*100,2)" height="17" width="135"&gt;No user agent&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" align="right" width="76"&gt;22.59&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" fmla="=ROUND(D31/B31*100,2)" height="17" width="135"&gt;Unrecognized Line&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" align="right" width="76"&gt;43.55&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-decoration: none;" fmla="=ROUND(D32/B32*100,2)" height="17" width="135"&gt;White space&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" align="right" width="76"&gt;3.13&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt; &lt;/table&gt;  &lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;br /&gt;Warning Types and their percentages &lt;/span&gt;&lt;/p&gt;  &lt;table id="table10" style="width: 251pt; border-collapse: collapse;" str="" border="0" cellpadding="0" cellspacing="0" width="335"&gt;  &lt;colgroup&gt;&lt;col style="width: 179pt;" width="239"&gt;&lt;col style="width: 72pt;" width="96"&gt;&lt;/colgroup&gt; &lt;tbody&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border: 0.5pt solid rgb(0, 128, 0); padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; width: 179pt; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-align: left; text-decoration: none;" height="17" width="239"&gt;Warning Type:&lt;/td&gt; &lt;td style="border-style: solid solid solid none; border-color: rgb(0, 128, 0); border-width: 0.5pt 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; width: 72pt; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" width="96"&gt;%&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-align: left; text-decoration: none;" height="17"&gt;Paths should be absolute&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" align="right"&gt;53.79&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; width: 179pt; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: normal; height: 12.75pt; text-align: left; text-decoration: none;" height="17" width="239"&gt;Allow is not widely supported&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" align="right"&gt;2.07&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; width: 179pt; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: normal; height: 12.75pt; text-align: left; text-decoration: none;" height="17" width="239"&gt;No restrictions&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" align="right"&gt;20.47&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-align: left; text-decoration: none;" height="17"&gt;Unrecognized field&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" align="right"&gt;9.79&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-align: left; text-decoration: none;" height="17"&gt;Space In path&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" align="right"&gt;8.44&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-align: left; text-decoration: none;" height="17"&gt;Wildcards aren't supported&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" align="right"&gt;3.48&lt;/td&gt;&lt;/tr&gt; &lt;tr style="height: 12.75pt;" height="17"&gt; &lt;td style="border-style: none solid solid; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; height: 12.75pt; text-align: left; text-decoration: none;" height="17"&gt;Repeated User agent&lt;/td&gt; &lt;td style="border-style: none solid solid none; border-color: rgb(0, 128, 0); border-width: medium 0.5pt 0.5pt medium; padding-right: 1px; padding-left: 1px; font-weight: 400; font-size: 10pt; vertical-align: bottom; color: windowtext; padding-top: 1px; font-style: normal; font-family: Arial,sans-serif; white-space: nowrap; text-decoration: none;" align="right"&gt;1.96&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt; &lt;/table&gt;  &lt;p&gt; &lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21734243-114559578425379573?l=sjb659.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sjb659.blogspot.com/feeds/114559578425379573/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=21734243&amp;postID=114559578425379573' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/21734243/posts/default/114559578425379573'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/21734243/posts/default/114559578425379573'/><link rel='alternate' type='text/html' href='http://sjb659.blogspot.com/2006/04/result-of-project.html' title='Result of the Project'/><author><name>An Analysis of the Usage Statistics of Robots Excl</name><uri>http://www.blogger.com/profile/16027129793969577539</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-21734243.post-114473140252460285</id><published>2006-04-10T21:45:00.000-07:00</published><updated>2006-04-10T21:56:43.126-07:00</updated><title type='text'>Update</title><content type='html'>This is a brief update of what we have done :&lt;br /&gt;&lt;br /&gt;After our intial data set which we felt was biased towards sites that have robots.txts, we decided to increase the data set.To do so, we got the RDF from DMOZ  and classified URLs into different domains.In each domain, we pinged every site for existence of robots.txt upto a maximum of  50000 sites in each domain.&lt;br /&gt;&lt;br /&gt;For those sites that had robots.txts, we crawled 2 levels completely using JoBo to get the size and usage statistics of robots.txt.Upto a maximum of 1000 random websites which had robots.txt for each domain or whichever is maximum.&lt;br /&gt;&lt;br /&gt;For all the sites that had robots.txt, we validated the same using the validation logic in the below website :&lt;br /&gt;&lt;a href="http://www.sxw.org.uk/computing/robots/check.html"&gt;http://www.sxw.org.uk/computing/robots/check.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;A total of 30000 robots.txts have been validated using an automated testing tool called iMacros Browser.We would like to mention that we got an academic trial lisence for 30 days that allowed up to use the tool for such a huge number..Thanks to iOpus,  (&lt;a href="http://www.iopus.com"&gt;www.iopus.com&lt;/a&gt;) for giving us the same which otherwise would have costed $500.&lt;br /&gt;&lt;br /&gt;Other testing tools like WinRunner, TestComplete were considered but abandoned due to either being very heavy weight and lack of a trial version that would automate such a large data set.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21734243-114473140252460285?l=sjb659.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sjb659.blogspot.com/feeds/114473140252460285/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=21734243&amp;postID=114473140252460285' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/21734243/posts/default/114473140252460285'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/21734243/posts/default/114473140252460285'/><link rel='alternate' type='text/html' href='http://sjb659.blogspot.com/2006/04/update.html' title='Update'/><author><name>An Analysis of the Usage Statistics of Robots Excl</name><uri>http://www.blogger.com/profile/16027129793969577539</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-21734243.post-114159402210087776</id><published>2006-03-05T13:23:00.000-08:00</published><updated>2006-03-05T13:27:08.520-08:00</updated><title type='text'>Moved the code to svn.</title><content type='html'>We have set up a svn repository for the modified crawler (jobo).&lt;br /&gt;Here is the link. &lt;a href="http://svn2.cvsdude.com/jaliya/robo"&gt;http://svn2.cvsdude.com/jaliya/robo&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21734243-114159402210087776?l=sjb659.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sjb659.blogspot.com/feeds/114159402210087776/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=21734243&amp;postID=114159402210087776' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/21734243/posts/default/114159402210087776'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/21734243/posts/default/114159402210087776'/><link rel='alternate' type='text/html' href='http://sjb659.blogspot.com/2006/03/moved-code-to-svn.html' title='Moved the code to svn.'/><author><name>An Analysis of the Usage Statistics of Robots Excl</name><uri>http://www.blogger.com/profile/16027129793969577539</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-21734243.post-114135979105760268</id><published>2006-03-02T20:13:00.000-08:00</published><updated>2006-03-02T20:23:11.360-08:00</updated><title type='text'>Project update</title><content type='html'>Identified and finalized the web crawler to use to crawl the web.Jobo in written in Java and can be easily customised for the requirements of our project.&lt;br /&gt;&lt;br /&gt;Customizations include :&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Performing the crawl in many threads to complete the crawl sooner&lt;/li&gt;&lt;li&gt;Crawl the URLs in the robots.txt.This would violate the robot exclusion standards but our goal is to collect statistics and analyse the same.Also we would crawl just once and then analyse the data&lt;/li&gt;&lt;li&gt;Collect the data in a MySql database in various tables to store the URLs crawled(allowed and disallowed) and their sizes, the contents of robots.txt and the host they belong to&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;To do :&lt;/p&gt;&lt;ul&gt;&lt;li&gt;For greater efficiency, have a pool of database connection to perform database operations&lt;/li&gt;&lt;li&gt;Eliminate unwanted code in Jobo that is not being used by our application&lt;/li&gt;&lt;li&gt;Remove large buffers that currently cause 'Out of memory' errors when the crawler crawls beyond 2000 URLs.Examine whether any other issue causes this error &lt;/li&gt;&lt;/ul&gt;&lt;p&gt;2)Process the data : Once we collect the data , think about how we could analyse the same and discuss with Fil before commencement of spring break and implement the same.&lt;/p&gt;&lt;p&gt; &lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21734243-114135979105760268?l=sjb659.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sjb659.blogspot.com/feeds/114135979105760268/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=21734243&amp;postID=114135979105760268' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/21734243/posts/default/114135979105760268'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/21734243/posts/default/114135979105760268'/><link rel='alternate' type='text/html' href='http://sjb659.blogspot.com/2006/03/project-update.html' title='Project update'/><author><name>An Analysis of the Usage Statistics of Robots Excl</name><uri>http://www.blogger.com/profile/16027129793969577539</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-21734243.post-113954411394834363</id><published>2006-02-09T19:39:00.000-08:00</published><updated>2006-02-09T20:08:49.936-08:00</updated><title type='text'>Initial Project Plan , Goals and Deadlines</title><content type='html'>&lt;div align="justify"&gt;1) 13th Feb 2006&lt;/div&gt;&lt;div align="justify"&gt; &lt;/div&gt;&lt;div align="justify"&gt;Shortlist a crawler and compare to see which is easier to modify,observe results.&lt;/div&gt;&lt;div align="justify"&gt;&lt;/div&gt;&lt;div align="justify"&gt;As part of this, Jaliya would evaluate the basic Java web crawler and I shall evaluate another open source crawler called JoBo.&lt;/div&gt;&lt;div align="justify"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div align="justify"&gt;2)17th Feb 2006&lt;/div&gt;&lt;ul&gt;&lt;li&gt;&lt;div align="justify"&gt;Basic design for the application&lt;/div&gt;&lt;/li&gt;&lt;li&gt;&lt;div align="justify"&gt;Set up SVN on CS server for the project - Smitha&lt;/div&gt;&lt;/li&gt;&lt;li&gt;&lt;div align="justify"&gt;Long term deadlines for the project tasks&lt;/div&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div align="justify"&gt;&lt;/div&gt;&lt;div align="justify"&gt;3)What next? &lt;/div&gt;&lt;div align="justify"&gt; &lt;/div&gt;&lt;div align="justify"&gt;&lt;/div&gt;&lt;div align="justify"&gt;These are the main steps involved as part of the project :&lt;/div&gt;&lt;ul&gt;&lt;li&gt;&lt;div align="justify"&gt;Modify the crawler to download 'robots.txt' . Crawl the site to get the entire size of the crawlable portion of the site.To start with, the 'Open Directory' shall be used as the seed URL.&lt;/div&gt;&lt;/li&gt;&lt;li&gt;&lt;div align="justify"&gt;Continue crawling all web sites until a fixed number of robots.txt have been downloaded(200 robots.txt files which can be configurable)&lt;/div&gt;&lt;/li&gt;&lt;li&gt;&lt;div align="justify"&gt;Another process would examine the robots.txt to get the size of the sites in robots.txt which represents the size of the site that is disallowed for crawling. This could be done using html header or through a perl interface.&lt;/div&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p align="justify"&gt;     Steps involved in getting the size of the sites blocked by robots.txt :&lt;/p&gt;&lt;p align="justify"&gt;     a)For each of the URLs in the robots.txt, crawl that URL and get all the sites in that URL&lt;/p&gt;&lt;p align="justify"&gt;     b)For each of these sites, check if initial URL matches that of the blocked URL.&lt;/p&gt;&lt;p align="justify"&gt;     c)If so, consider that site to calculate the size, else ignore that site&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&lt;div align="justify"&gt;Another process would validate the robots.txt to check for correctness&lt;/div&gt;&lt;/li&gt;&lt;li&gt;&lt;div align="justify"&gt;Analysis and collating together statistical information namely, the domains that use robots.txt, what type of content is typically hidden(cgi-bin), what is the page rank/importance of the sites that use valid/invalid robots.txt&lt;/div&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21734243-113954411394834363?l=sjb659.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sjb659.blogspot.com/feeds/113954411394834363/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=21734243&amp;postID=113954411394834363' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/21734243/posts/default/113954411394834363'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/21734243/posts/default/113954411394834363'/><link rel='alternate' type='text/html' href='http://sjb659.blogspot.com/2006/02/initial-project-plan-goals-and.html' title='Initial Project Plan , Goals and Deadlines'/><author><name>An Analysis of the Usage Statistics of Robots Excl</name><uri>http://www.blogger.com/profile/16027129793969577539</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-21734243.post-113868268181812808</id><published>2006-01-30T20:42:00.000-08:00</published><updated>2006-01-30T20:47:13.226-08:00</updated><title type='text'>The ROBOT Meta tag</title><content type='html'>&lt;p face="arial" class="MsoNormal"&gt;This is used as an easy alternative to robots.txt, to specify whether a robot can access and index a particular web page. This is done through a ‘Robots’ &lt;st1:place st="on"&gt;META&lt;/st1:place&gt; tag[4] which can be contained in the head of an html document as shown below&lt;o:p&gt;&lt;br /&gt;&lt;/o:p&gt;&lt;/p&gt;             &lt;p style="font-family: arial;" class="MsoNormal"&gt;&amp;lt;&lt;st1:place st="on"&gt;META&lt;/st1:place&gt;&lt;br /&gt;NAME=&amp;quot;ROBOTS&amp;quot; CONTENT=&amp;quot;NOINDEX,NOFOLLOW&amp;quot;&amp;gt;&lt;br /&gt;&lt;br /&gt;&amp;lt;&lt;st1:place st="on"&gt;META&lt;/st1:place&gt; NAME=&amp;quot;DESCRIPTION&amp;quot; CONTENT=&amp;quot;THIS PAGE&lt;br /&gt;....&amp;quot;&amp;gt;&lt;o:p&gt;&lt;br /&gt;&lt;/o:p&gt;&lt;/p&gt;   &lt;p style="font-family: arial;" class="MsoNormal"&gt;As part of the project, we would analyze the html content of web pages to find out whether it contains a ROBOT tag and evaluate the portion of the web that uses these tags as against the robots.txt.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21734243-113868268181812808?l=sjb659.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sjb659.blogspot.com/feeds/113868268181812808/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=21734243&amp;postID=113868268181812808' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/21734243/posts/default/113868268181812808'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/21734243/posts/default/113868268181812808'/><link rel='alternate' type='text/html' href='http://sjb659.blogspot.com/2006/01/robot-meta-tag.html' title='The ROBOT Meta tag'/><author><name>An Analysis of the Usage Statistics of Robots Excl</name><uri>http://www.blogger.com/profile/16027129793969577539</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-21734243.post-113867542378862672</id><published>2006-01-30T18:40:00.000-08:00</published><updated>2006-01-30T18:43:43.790-08:00</updated><title type='text'>Project Proposal</title><content type='html'>&lt;p style="font-family: arial;" class="MsoNormal"&gt;&lt;b style=""&gt;&lt;span style="font-size:130%;"&gt;Project Goals:&lt;br /&gt;&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;br /&gt;Robots Exclusion standard [1] is a de-facto standard that is used to inform the crawlers about the disallowed sections of a web server. It has been in general use since mid nineteen ninety and heavily used to limit the access of pages or sections such as very deep virtual trees, duplicated information, temporary information, or cgi-scripts with side-effects (such as voting). From the aspect of a crawler this is a voluntary standard since it does not provide a mechanism to stop crawlers from accessing disallowed section. However, most crawlers adopt this standard and obey its rules.&lt;o:p&gt;&lt;br /&gt;&lt;/o:p&gt;&lt;/p&gt;     &lt;p style="font-family: arial;" class="MsoNormal"&gt;Although the standard has been there for almost a decade, extensive research regarding its usage has not been done. As part of the project, we will perform a statistical analysis of the usage of the above standard and we will explore the following areas ;&lt;o:p&gt;&lt;/o:p&gt;&lt;br /&gt;&lt;/p&gt; &lt;ul style="font-family: arial;"&gt;   &lt;li&gt;Usage      of the standard – what percent of the web use (follow) the above standard.&lt;/li&gt;   &lt;li&gt;Hidden      Web – What percentage of the web is covered or hidden for the robots&lt;/li&gt;   &lt;li&gt;Accuracy      of the de-facto standard – Analyze the accuracy of the robots.txt      documents that we may collect during the research to come with a      statistical figure of their accuracy&lt;/li&gt;   &lt;li&gt;Other      means of preventing robots such as ‘Robots’ &lt;st1:place st="on"&gt;META&lt;/st1:place&gt;      tag[4]&lt;o:p&gt;&lt;br /&gt;    &lt;/o:p&gt;&lt;/li&gt; &lt;/ul&gt;         &lt;p style="font-family: arial;" class="MsoNormal"&gt;Based on the results of the analysis, we plan to recommend that the robots.txt be accepted as an official standard.&lt;b style=""&gt;&lt;o:p&gt;&lt;br /&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;/p&gt;     &lt;p style="font-family: arial;" class="MsoNormal"&gt;&lt;b style=""&gt;&lt;span style="font-size:130%;"&gt;Implementation options:&lt;/span&gt;&lt;/b&gt;&lt;o:p&gt;&lt;br /&gt;&lt;/o:p&gt;&lt;/p&gt;       &lt;p style="font-family: arial;" class="MsoNormal"&gt;Currently we are considering two approaches to get the required information using crawlers namely:&lt;br /&gt;&lt;/p&gt; &lt;ul style="font-family: arial;"&gt;   &lt;li&gt;Crawl      a random collection of sites from different domains and get the      statistics.&lt;br /&gt;The results of this approach will depend on the quality and the breadth of the sample that we select for crawling.&lt;/li&gt;   &lt;li&gt;&lt;o:p&gt;&lt;/o:p&gt;Crawl      the web up to a certain maximum and collect the required information in      the process of crawling. This depends on the amount of the sites that we      crawl during the research. This approach provides the flexibility of using      the crawler with different maximum values and hence will be able to      improve the correctness of the results.&lt;o:p&gt;&lt;br /&gt;    &lt;/o:p&gt;&lt;/li&gt; &lt;/ul&gt;             &lt;p style="font-family: arial;" class="MsoNormal"&gt;We will use an open source crawler such as Apache Nutch [2] or Heritix [3] and enhance the same to suite the project’s requirements. Modifications would be done to calculate the size of the web pages that the crawler comes across and to calculate the size of the disallowed portion of the web. This can be achieved by downloading the HTTP header contents without downloading the entire web page. In addition the crawler will save the robots.txt for various sites and later they will be used to check the correctness using a valuator.&lt;o:p&gt;&lt;br /&gt;&lt;/o:p&gt;&lt;/p&gt;   &lt;p style="font-family: arial;" class="MsoNormal"&gt;&lt;b style=""&gt;&lt;span style="font-size:130%;"&gt;References:&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;/p&gt;     &lt;p style="font-family: arial;" class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;br /&gt;[1] Robots Exclusion Standard, http://www.robotstxt.org/wc/norobots.html&lt;/p&gt;   &lt;p style="font-family: arial;" class="MsoNormal"&gt;[2] Apache Nutch, http://lucene.apache.org/nutch/&lt;span style=""&gt;  &lt;/span&gt;&lt;/p&gt;   &lt;p style="font-family: arial;" class="MsoNormal"&gt;[3] Heritix, &lt;a href="http://crawler.archive.org/"&gt;http://crawler.archive.org/&lt;/a&gt;&lt;/p&gt;   &lt;p style="font-family: arial;" class="MsoNormal"&gt;[4] Robot &lt;st1:place st="on"&gt;Meta&lt;/st1:place&gt; Tag&lt;span style=""&gt;  &lt;/span&gt;http://www.searchengineworld.com/metatag/robots.htm&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21734243-113867542378862672?l=sjb659.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sjb659.blogspot.com/feeds/113867542378862672/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=21734243&amp;postID=113867542378862672' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/21734243/posts/default/113867542378862672'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/21734243/posts/default/113867542378862672'/><link rel='alternate' type='text/html' href='http://sjb659.blogspot.com/2006/01/project-proposal.html' title='Project Proposal'/><author><name>An Analysis of the Usage Statistics of Robots Excl</name><uri>http://www.blogger.com/profile/16027129793969577539</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
