![]() | |
![]() |
| | Thread Tools | Display Modes |
#1
| |||
| |||
|
|
About 8 months ago I set a up a wikipedia mirror. I also let search engines crawl it. It return, I got about $10 per day adsense earnings and an incredible amount of hassle. Googlebot is completely out of control and was mercilessly hammering my website. It does around 4 queries per second. I think that I pay in bandwidth about as much as I make, plus I have a big headache. So, I decided to keep wikipedia mirror (I use it as content for some of my chapters), but I will no longer let search engines, especially the badly behaving googlebot, index them. Last night, I made changes for robots.txt, so far no effect. I tried using sitemaps to tell googlebot not to crawl page more than 1x per months, but that made it only worse and bolder. i |
#2
| |||
| |||
|
|
I already tried crawl-delay, to no effect. |
![]() |
| Thread Tools | |
| Display Modes | |
| |