![]() | |
![]() |
| | Thread Tools | Display Modes |
#1
| |||
| |||
|
|
About 8 months ago I set a up a wikipedia mirror. I also let search engines crawl it. It return, I got about $10 per day adsense earnings and an incredible amount of hassle. Googlebot is completely out of control and was mercilessly hammering my website. It does around 4 queries per second. I think that I pay in bandwidth about as much as I make, plus I have a big headache. So, I decided to keep wikipedia mirror (I use it as content for some of my chapters), but I will no longer let search engines, especially the badly behaving googlebot, index them. Last night, I made changes for robots.txt, so far no effect. I tried using sitemaps to tell googlebot not to crawl page more than 1x per months, but that made it only worse and bolder. i |
#2
| |||
| |||
|
|
On Mon, 06 Mar 2006 19:32:17 GMT, Ignoramus23035 ignoramus23035 (AT) NOSPAM (DOT) 23035.invalid> wrote: ... Last night, I made changes for robots.txt, so far no effect. I tried using sitemaps to tell googlebot not to crawl page more than 1x per months, but that made it only worse and bolder. Take the pages down for a bit, then put them back up again. Let the Googlebot get the idea that they aren't there... |
#3
| |||
| |||
|
|
Big Bill <kruse (AT) cityscape (DOT) co.uk> said: Take the pages down for a bit, then put them back up again. Let the Googlebot get the idea that they aren't there... how long is a bit? |
#4
| |||
| |||
|
|
Last night, I made changes for robots.txt, so far no effect. I tried using sitemaps to tell googlebot not to crawl page more than 1x per months, but that made it only worse and bolder. Take the pages down for a bit, then put them back up again. Let the Googlebot get the idea that they aren't there... how long is a bit? |

#5
| |||
| |||
|
|
Take the pages down for a bit, then put them back up again. Let the Googlebot get the idea that they aren't there. Also validate your robots.txt. That's an interesting idea. If I can get googlebot to crawl a lot less often, I would certainly like to resume. |
#6
| |||
| |||
|
|
Fleeing from the madness of the NTL jungle Big Bill <kruse (AT) cityscape (DOT) co.uk> stumbled into news:alt.internet.search-engines,alt.www.webmaster and said: On Mon, 06 Mar 2006 19:32:17 GMT, Ignoramus23035 ignoramus23035 (AT) NOSPAM (DOT) 23035.invalid> wrote: ... Last night, I made changes for robots.txt, so far no effect. I tried using sitemaps to tell googlebot not to crawl page more than 1x per months, but that made it only worse and bolder. Take the pages down for a bit, then put them back up again. Let the Googlebot get the idea that they aren't there... how long is a bit? |
![]() |
| Thread Tools | |
| Display Modes | |
| |