![]() | |
![]() |
| | Thread Tools | Display Modes |
#1
| |||
| |||
|
#2
| |||
| |||
|
|
I have recently noticed some of my pages showing up in the Google cache even though the page contained a "noindex" META Tag. These are private pages for inter office use and are not meant for public display. Is there another META tag that will prevent Google from caching these pages? Since the pages are not meant for public view, I have just re-named the files so anyone that may click them from Google will just get my not found page. My problem is that I really have no way to keep up with pages which Google has ignored my noindex META. I have now included the noarchive meta in the hopes the Googlebot might understand that one. Any suggestions? Rik P.S. I posted this question in the public forum but received no suggestions so I'm hoping some-one in here may have run in to this problem. Please pardon my cross post. |
#3
| |||
| |||
|
|
I have recently noticed some of my pages showing up in the Google cache even though the page contained a "noindex" META Tag. These are private pages for inter office use and are not meant for public display. Is there another META tag that will prevent Google from caching these pages? Since the pages are not meant for public view, I have just re-named the files so anyone that may click them from Google will just get my not found page. My problem is that I really have no way to keep up with pages which Google has ignored my noindex META. I have now included the noarchive meta in the hopes the Googlebot might understand that one. Any suggestions? |
#4
| |||
| |||
|
|
On 7 Jun 2006 14:54:58 -0700, "Rik" <rik (AT) rmcaudio (DOT) com> wrote: I have recently noticed some of my pages showing up in the Google cache even though the page contained a "noindex" META Tag. These are private pages for inter office use and are not meant for public display. Is there another META tag that will prevent Google from caching these pages? Since the pages are not meant for public view, I have just re-named the files so anyone that may click them from Google will just get my not found page. My problem is that I really have no way to keep up with pages which Google has ignored my noindex META. I have now included the noarchive meta in the hopes the Googlebot might understand that one. Any suggestions? Use robots.txt to exclude the files. You can't do this with the old ones but you can with any new ones you make. You know robots.txt? Have a read; http://www.robotstxt.org/wc/robots.html BB |
#5
| |||
| |||
|
|
On 7 Jun 2006 14:54:58 -0700, "Rik" <rik (AT) rmcaudio (DOT) com> wrote: I have recently noticed some of my pages showing up in the Google cache even though the page contained a "noindex" META Tag. These are private pages for inter office use and are not meant for public display. Is there another META tag that will prevent Google from caching these pages? Since the pages are not meant for public view, I have just re-named the files so anyone that may click them from Google will just get my not found page. My problem is that I really have no way to keep up with pages which Google has ignored my noindex META. I have now included the noarchive meta in the hopes the Googlebot might understand that one. Any suggestions? Rik P.S. I posted this question in the public forum but received no suggestions so I'm hoping some-one in here may have run in to this problem. Please pardon my cross post. What about password protected files ? plh Paul |
#6
| ||||||
| ||||||
|
|
Paul wrote: On 7 Jun 2006 14:54:58 -0700, "Rik" <rik (AT) rmcaudio (DOT) com> wrote: I have recently noticed some of my pages showing up in the Google cache |
|
even though the page contained a "noindex" META Tag. These are private pages for inter office use and are not meant for public display. Is there another META tag that will prevent Google from caching these pages? |
|
Since the pages are not meant for public view, I have just re-named the files so anyone that may click them from Google will just get my not found page. My problem is that I really have no way to keep up with pages which Google has ignored my noindex META. I have now included the noarchive meta in the hopes the Googlebot might understand that one. Any suggestions? |
|
Rik P.S. I posted this question in the public forum but received no suggestions so I'm hoping some-one in here may have run in to this problem. Please pardon my cross post. |
|
What about password protected files ? |
|
The page that contains the links leading to our private pages is password protected. That navagation page resides in a folder that is disallowed through my robots.txt file. The private pages in question reside in folders that contain public pages so I was afraid to disallow anything in those folders using the robots.txt file for fear of the bot ignoring the folder. That's why I chose to use the noindex meta on the individual pages. Is it common for Google to ignore META tags like the noindex,noarchive I am currently using? I have seen Google ignore my robots.txt file before but this is the first time I have seen them ignore the noindex command. |
#7
| |||
| |||
|
|
I notice that Paul (or you) has reduced the distributions |
#8
| |||
| |||
|
|
Paul wrote: On 7 Jun 2006 14:54:58 -0700, "Rik" <rik (AT) rmcaudio (DOT) com> wrote: I have recently noticed some of my pages showing up in the Google cache even though the page contained a "noindex" META Tag. These are private pages for inter office use and are not meant for public display. Is there another META tag that will prevent Google from caching these pages? Since the pages are not meant for public view, I have just re-named the files so anyone that may click them from Google will just get my not found page. My problem is that I really have no way to keep up with pages which Google has ignored my noindex META. I have now included the noarchive meta in the hopes the Googlebot might understand that one. Any suggestions? Rik P.S. I posted this question in the public forum but received no suggestions so I'm hoping some-one in here may have run in to this problem. Please pardon my cross post. What about password protected files ? plh Paul The page that contains the links leading to our private pages is password protected. That navagation page resides in a folder that is disallowed through my robots.txt file. The private pages in question reside in folders that contain public pages so I was afraid to disallow anything in those folders using the robots.txt file for fear of the bot ignoring the folder. That's why I chose to use the noindex meta on the individual pages. Is it common for Google to ignore META tags like the noindex,noarchive I am currently using? I have seen Google ignore my robots.txt file before but this is the first time I have seen them ignore the noindex command. |
#9
| |||
| |||
|
|
On Wed, 07 Jun 2006 23:25:33 GMT, Big Bill <kruse (AT) cityscape (DOT) co.uk wrote: On 7 Jun 2006 14:54:58 -0700, "Rik" <rik (AT) rmcaudio (DOT) com> wrote: I have recently noticed some of my pages showing up in the Google cache even though the page contained a "noindex" META Tag. These are private pages for inter office use and are not meant for public display. Is there another META tag that will prevent Google from caching these pages? Since the pages are not meant for public view, I have just re-named the files so anyone that may click them from Google will just get my not found page. My problem is that I really have no way to keep up with pages which Google has ignored my noindex META. I have now included the noarchive meta in the hopes the Googlebot might understand that one. Any suggestions? Use robots.txt to exclude the files. You can't do this with the old ones but you can with any new ones you make. You know robots.txt? Have a read; http://www.robotstxt.org/wc/robots.html BB Only works with good bots though BB. password protected is far better. plh Paul -- ----== Posted via Newsfeeds.Com - Unlimited-Unrestricted-Secure Usenet News==---- http://www.newsfeeds.com The #1 Newsgroup Service in the World! 120,000+ Newsgroups ----= East and West-Coast Server Farms - Total Privacy via Encryption =---- |
#10
| |||
| |||
|
|
"Paul" <lamewolf2004[REMOVE]@yahoo.com> wrote in message news:ejoe821oq38jo6oatjs6162ikda8h70jgh (AT) 4ax (DOT) com... On Wed, 07 Jun 2006 23:25:33 GMT, Big Bill <kruse (AT) cityscape (DOT) co.uk wrote: On 7 Jun 2006 14:54:58 -0700, "Rik" <rik (AT) rmcaudio (DOT) com> wrote: I have recently noticed some of my pages showing up in the Google cache even though the page contained a "noindex" META Tag. These are private pages for inter office use and are not meant for public display. Is there another META tag that will prevent Google from caching these pages? Since the pages are not meant for public view, I have just re-named the files so anyone that may click them from Google will just get my not found page. My problem is that I really have no way to keep up with pages which Google has ignored my noindex META. I have now included the noarchive meta in the hopes the Googlebot might understand that one. Any suggestions? Use robots.txt to exclude the files. You can't do this with the old ones but you can with any new ones you make. You know robots.txt? Have a read; http://www.robotstxt.org/wc/robots.html BB Only works with good bots though BB. password protected is far better. plh Paul -- ----== Posted via Newsfeeds.Com - Unlimited-Unrestricted-Secure Usenet News==---- http://www.newsfeeds.com The #1 Newsgroup Service in the World! 120,000+ Newsgroups ----= East and West-Coast Server Farms - Total Privacy via Encryption =---- See this http://www.google.co.uk/intl/en/webmasters/remove.html Also, it is a good idea to have a default home page in every subdirectory (e.g. called index.html) to prevent the server revealing listings of all files present. Best regards, Eric. |
![]() |
| Thread Tools | |
| Display Modes | |
| |