![]() | |
![]() |
| | Thread Tools | Display Modes |
#1
| |||
| |||
|
#2
| |||
| |||
|
|
At around 09:00 UTC this morning. First it tried to get 0T717Q3K81F45P9CHK78.htm - obviously a 404 functionality test. Then it proceeded to download the site. Yes, all of it. We'll see what turns up in the SERPs. |
#3
| |||
| |||
|
|
__/ [ Phil Payne ] on Friday 05 May 2006 11:17 \__ At around 09:00 UTC this morning. First it tried to get 0T717Q3K81F45P9CHK78.htm - obviously a 404 functionality test. Then it proceeded to download the site. Yes, all of it. We'll see what turns up in the SERPs. How many pages in total? Googlebot never appears to do 404 tests. Neither do MSNBot, Yahoo/Inktom Slurp and other noticeable spiders (albeit Yahoo used to be so buggy, so it crawled incorrectly to request wrong files from the wrong sites). What I am trying to suggest that somebody may have forged user-agent. It's very simple to do this. It gives a cloak of stealth to someone wishing to rip off your site entirely, possibly using a grabber, e.g. wget -R --user-agent="Googlebot whatever..." your_site_URL Best wishes, Roy |
#4
| |||
| |||
|
#5
| |||
| |||
|
|
You may well be right. It doesn't look like a Google dotted quad: 2006-05-05 08:56:53 212.94.37.218 - W3SVC19 WWW4 217.161.12.181 80 GET /0T717Q3K81F45P9CHK78.htm - 404 2 4203 169 0 HTTP/1.1 www.hotlines.co.uk Mozilla/4.0+(compatible;+MSIE+6.0;+Windows+NT+5.1) - - 2006-05-05 08:58:30 212.94.37.218 - W3SVC19 WWW4 217.161.12.181 80 GET /index.html - 200 0 4543 168 453 HTTP/1.1 www.hotlines.co.uk Mozilla/5.0+(compatible;+Googlebot/2.42;++http://www.google.com/bot.html) - - Oh, well. Here comes YET ANOTHER six-month penalty from Google for duplicate content. |
![]() |
| Thread Tools | |
| Display Modes | |
| |