![]() | |
![]() |
| | Thread Tools | Display Modes |
#1
| |||
| |||
|
#2
| |||
| |||
|
|
Anyone know how to allow Google's robots to index protected content? My company has a site that requires a subscription to access the info, but we'd like to have google index those pages. I see there are many sites who've managed this. |
|
I can't allow by user-agent since my authentication software doesn't allow that. Is there any way to give Google a username and password? Or is there an IP, or range of IPs, that google uses? |
#3
| |||
| |||
|
|
Anyone know how to allow Google's robots to index protected content? My company has a site that requires a subscription to access the info, but we'd like to have google index those pages. I see there are many sites who've managed this. |
#4
| |||
| |||
|
#5
| |||
| |||
|
|
"Sholom" <sdeen (AT) diamonds (DOT) net> wrote: Anyone know how to allow Google's robots to index protected content? My company has a site that requires a subscription to access the info, but we'd like to have google index those pages. I see there are many sites who've managed this. Yup, it's called cloaking. I'll report it when I see it. I can't allow by user-agent since my authentication software doesn't allow that. Is there any way to give Google a username and password? Or is there an IP, or range of IPs, that google uses? Yes, and this might get you banned. |
#6
| |||
| |||
|
|
On Wed, 07 Jun 2006 20:54:28 +0200, Sholom <sdeen (AT) diamonds (DOT) net> wrote: Anyone know how to allow Google's robots to index protected content? My company has a site that requires a subscription to access the info, but we'd like to have google index those pages. I see there are many sites who've managed this. Easy way to get banned. I hate sites that are indexed but not accessible. Usually I do two things at the same time - first, I read cached content. Second, I report such site to Google. Best, Borek |
#7
| |||
| |||
|
|
Thanks to all for the replies. I had no idea this was such a sensitive issue. |
|
(As an aside, re the cache issue, I was under the impression that a "robots=nocache" meta tag prevents the search engine from showing a cached page.) |
#8
| |||
| |||
|
|
On Wed, 07 Jun 2006 20:54:28 +0200, Sholom <sdeen (AT) diamonds (DOT) net> wrote: Anyone know how to allow Google's robots to index protected content? My company has a site that requires a subscription to access the info, but we'd like to have google index those pages. I see there are many sites who've managed this. Easy way to get banned. I hate sites that are indexed but not accessible. Usually I do two things at the same time - first, I read cached content. Second, I report such site to Google. |
#9
| |||
| |||
|
|
__/ [ Borek ] on Wednesday 07 June 2006 20:03 \__ On Wed, 07 Jun 2006 20:54:28 +0200, Sholom <sdeen (AT) diamonds (DOT) net wrote: Anyone know how to allow Google's robots to index protected content? My company has a site that requires a subscription to access the info, but we'd like to have google index those pages. I see there are many sites who've managed this. Easy way to get banned. I hate sites that are indexed but not accessible. Usually I do two things at the same time - first, I read cached content. Second, I report such site to Google. There is a way around this. Change user-agent string to googlebot and you're in. |
|
To be honest, I didn't know this trick until somebody told me last week. |
#10
| |||
| |||
|
|
Roy Schestowitz <newsgroups (AT) schestowitz (DOT) com> wrote: __/ [ Borek ] on Wednesday 07 June 2006 20:03 \__ On Wed, 07 Jun 2006 20:54:28 +0200, Sholom <sdeen (AT) diamonds (DOT) net wrote: Anyone know how to allow Google's robots to index protected content? My company has a site that requires a subscription to access the info, but we'd like to have google index those pages. I see there are many sites who've managed this. Easy way to get banned. I hate sites that are indexed but not accessible. Usually I do two things at the same time - first, I read cached content. Second, I report such site to Google. There is a way around this. Change user-agent string to googlebot and you're in. If they check for that, yup. Some sites check for the crawlers, based on IP or name. |
|
To be honest, I didn't know this trick until somebody told me last week. Wasn't me, but 2+ years ago: http://johnbokma.com/mexit/2004/04/2...useragent.html Funny, I notice that I have a link to report spam with google on my site :-D My site is getting too big. Or maybe I should say: a site is getting good when you limit Google to your site when looking for some info (which I do now and then, I even made a special keymark for it :-D) |
![]() |
| Thread Tools | |
| Display Modes | |
| |