![]() | |
![]() |
| | Thread Tools | Display Modes |
#41
| ||||||||
| ||||||||
|
|
John Bokma <john (AT) castleamber (DOT) com> wrote in |
|
You tell me. I'm one of the several hundred thousand bloggers that have had their page rank bitchslapped by google, first to zero pr, then to status "unranked". |
|
Andy Beard's blog had an idea recently about using robots.txt to tell google not to index pages that contain the paid links that they're so upset about. I'm thinking that this gives the best of both worlds.. the advertiser gets their paid link that doesn't have rel="nofollow" on it and google gets told "don't crawl this page". |
|
The problem is with intermediaries that might decide this is no better than putting rel="nofollow" on the link (which I don't see that it is). The idea is to keep them from being able to read the robots.txt that is being given to Google should they think of it. |
|
Advertiser wants links without rel="nofollow" |
|
Google doesn't like paid links unless they have rel="nofollow". |
|
Please forgive my attitude earlier.... Real life is .... being a problem |
|
I'm not trying to defraud, I just want to get back to earning a living. I was doing pretty good untill the "BitchSlap of '07" |
#42
| |||
| |||
|
|
Don <lostinspace (AT) 123-universe (DOT) com> wrote: It's done ALL the time. What matters is that it's done for an appropiate reason and is accomplished server side. To me, what matters, is that a user doesn't click on a search result, and comes on a page that doesn't make the expected data available. |
|
webmasterworld (IIRC) did use cloaking, maybe still does. I've reported this several times to Google, but no use (unless it has been fixed) |
#43
| |||
| |||
|
|
Joe Fox <ny152 (AT) none (DOT) invalid> wrote in news:Xns9A4BD1118343D891563@ 127.0.0.1: Don <lostinspace (AT) 123-universe (DOT) com> wrote in news:Xns9A4BD360F498Clostinspace123univer (AT) 207 (DOT) 115.33.102: Joe Fox <ny152 (AT) none (DOT) invalid> wrote in news:Xns9A49EC6462EC5891563 (AT) 127 (DOT) 0.0.1: I'm using a robots.txt file to control what is and is not crawled by search engine bots but I'd like to block anything that isn't a known search engine bot doesn't get the file I'm feeding to google, yahoo and the others. From what I've read this could be done with .htacess but I've not been able to make heads or tails out of that. I'd really be grateful for some help here. Thanks Some tutorials http://baremetal.com/gadgets/htaccess/ http://evolt.org/node/226 http://www.edginet.org/techie/website/htaccess.html http://www.dimi.uniud.it/labs/docume.../Challenger1.2 /U ser/htaccess/htaccess.html http://www.webhelpinghand.com/htaccess_deny.htm http://www.javascriptkit.com/howto/htaccess.shtml http://www.serverwatch.com/tutorials...0825_1127711_1 http://www.verio.com/support/documen...fm?doc_id=3624 Some of those are familiar but I'll take a look at 'em anyway. My big problem is I'm not a coder. Simple stuf I can handle but figuring out docs and helps takes forever Joe, There are more beneficial forums for htaccess and Apache. The Apache Server forum at Webmaster World is excellent and the moderator makes a superb effort to assist far too many people. The Search Engine Spider ID was the predecessor to the Apache as far as htaccess coding. Rgistration is free to most forums. I may be able to assist you, however my extensive use of htaccess has been limited to the "KISS" thought. When it comes to simulated-wildcards and complicated expressions, I'm daft! |
#44
| |||
| |||
|
|
Paul <noone (AT) houstoncrafts (DOT) com> wrote: You have email John. Thanks Paul, looking into it (got the Gecko one as well, haven't had time to check it out, thanks). |
#45
| |||||||||
| |||||||||
|
|
John Bokma <john (AT) castleamber (DOT) com> wrote in news:Xns9A4BB7830D344castleamber (AT) 130 (DOT) 133.1.4: Joe Fox <ny152 (AT) none (DOT) invalid> wrote: As I said in another post, Can we save discussion of *why* for another time and talk about *how*? Like I wrote in reply to that other post: we're trying to help you for *free*. But even if you were paying me, I would ask you *why*. How about that. A voice of reason on UseNet. Nearly an extinct species these days. ;-) |
|
Too often people think they have an X problem, and try to find all kinds of solutions to that, while the real problem is Y. If you ever have been helping others on Usenet, you certainly know what I mean. NOW I see what you're saying and because you're saying it so reasonably, I'll go into why. It sounds to me like you're afraid to educated others (give away your secrets) by hiding your robots.txt. If that is indeed the case you *do* have a X -> Y problem. You tell me. I'm one of the several hundred thousand bloggers that have had their page rank bitchslapped by google, first to zero pr, then to status "unranked". There's a big problem with this because up until then pr has been a determining factor in the income of these bloggers. |
|
Obviously a lot of people have want to get pagerank back and still be able to do the work they enjoy and get paid for it as before... without google hitting them with pr zero or unranking them entirely. Getting advertisers and intermediaries to stop using pr as one of their value assessment metrics is being attempted, |
|
but simply put, advertisers want "link juice" and visibility in search engines and they're never going to stop wanting the links they pay for on pages with a certain pagerank. |
|
Andy Beard's blog had an idea recently about using robots.txt to tell google not to index pages that contain the paid links that they're so upset about. I'm thinking that this gives the best of both worlds.. the advertiser gets their paid link that doesn't have rel="nofollow" on it and google gets told "don't crawl this page". |
|
Google gets to keep their index "pure" by not crawling (and thus indexing ) the page with the paid links, and the advertisers get some pr because while the page won't be crawled, there will still be links to it so that it can pass pagerank (though maybe not as much as otherwise). |
|
The problem is with intermediaries that might decide this is no better than putting rel="nofollow" on the link (which I don't see that it is). The idea is to keep them from being able to read the robots.txt that is being given to Google should they think of it. Advertiser wants links without rel="nofollow" Google doesn't like paid links unless they have rel="nofollow". There are bloggers who need the money and must find a way to do both at the same time. |
|
Seems to me that this method should work as long as intermediaries don't get the robots.txt being given to google. Thus the need to ensure that ONLY google or other Search engines get the "real" robots.txt. Problem is, I'm not a coder. |
|
I'm trying to figure out how to do this with .htaccess and it's very slow going. There are seemingly more pitfalls than answers simply because I do not understand the language. Thus I seek help from those who Do know the language. Please forgive my attitude earlier.... Real life is .... being a problem I'm not trying to defraud, I just want to get back to earning a living. I was doing pretty good untill the "BitchSlap of '07" |
#46
| |||
| |||
|
|
John Bokma <john (AT) castleamber (DOT) com> wrote in news:Xns9A4BB6E764926castleamber (AT) 130 (DOT) 133.1.4: Joe Fox <ny152 (AT) none (DOT) invalid> wrote: Can we simply agree to disagree and save discussion of *why* for another time and go into some details about *how*? Welcome to Usenet. Remember people try to help you in *their* spare time, for *free*. Yes, that's right, they do. and I have always apprecieated the help and input I find on UseNet and other sources. That's why I didn't see the need for a big deal about why. I didn't see a need to waste people's time with *why*. |
#47
| |||
| |||
|
|
John Bokma <john (AT) castleamber (DOT) com> wrote in |
|
RewriteCond %{HTTP_USER_AGENT} =UA1 [OR] RewriteCond %{HTTP_USER_AGENT} =UA2 [OR] RewriteCond %{HTTP_USER_AGENT} =UA3 [OR] RewriteRule ^robots.txt$ real-robots.txt [L] with UA1..UAn the *exact* UA plain string, e.g. Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) See: http://httpd.apache.org/docs/1.3/mod/mod_rewrite.html John, Just a heads up (not critique). The last "[OR]" is invalid. |
#48
| ||||
| ||||
|
|
On Thu, 21 Feb 2008 20:00:21 -0600, Joe Fox <ny152 (AT) none (DOT) invalid wrote: John Bokma <john (AT) castleamber (DOT) com> wrote in news:Xns9A4BB7830D344castleamber (AT) 130 (DOT) 133.1.4: Joe Fox <ny152 (AT) none (DOT) invalid> wrote: As I said in another post, Can we save discussion of *why* for another time and talk about *how*? Like I wrote in reply to that other post: we're trying to help you for *free*. But even if you were paying me, I would ask you *why*. How about that. A voice of reason on UseNet. Nearly an extinct species these days. ;-) He was extinct for a while. He came back. Too often people think they have an X problem, and try to find all kinds of solutions to that, while the real problem is Y. If you ever have been helping others on Usenet, you certainly know what I mean. NOW I see what you're saying and because you're saying it so reasonably, I'll go into why. It sounds to me like you're afraid to educated others (give away your secrets) by hiding your robots.txt. If that is indeed the case you *do* have a X -> Y problem. You tell me. I'm one of the several hundred thousand bloggers that have had their page rank bitchslapped by google, first to zero pr, then to status "unranked". There's a big problem with this because up until then pr has been a determining factor in the income of these bloggers. Because you can sell links priced according to your PR? Obviously a lot of people have want to get pagerank back and still be able to do the work they enjoy and get paid for it as before... without google hitting them with pr zero or unranking them entirely. Getting advertisers and intermediaries to stop using pr as one of their value assessment metrics is being attempted, How'd I guess? but simply put, advertisers want "link juice" and visibility in search engines and they're never going to stop wanting the links they pay for on pages with a certain pagerank. They will when they get educated about it. Andy Beard's blog had an idea recently about using robots.txt to tell google not to index pages that contain the paid links that they're so upset about. I'm thinking that this gives the best of both worlds.. the advertiser gets their paid link that doesn't have rel="nofollow" on it and google gets told "don't crawl this page". If it isn't indexed or crawled, how will it get any PR? The home page PR is irrelevant, by the way... |
|
Google gets to keep their index "pure" by not crawling (and thus indexing ) the page with the paid links, and the advertisers get some pr because while the page won't be crawled, there will still be links to it so that it can pass pagerank (though maybe not as much as otherwise). Um. Difficult to see how Google could give PR to a page that's not even indexed. |
|
The problem is with intermediaries that might decide this is no better than putting rel="nofollow" on the link (which I don't see that it is). The idea is to keep them from being able to read the robots.txt that is being given to Google should they think of it. Advertiser wants links without rel="nofollow" Google doesn't like paid links unless they have rel="nofollow". There are bloggers who need the money and must find a way to do both at the same time. There are bloggers who shouldn't give up their day job. |
|
Seems to me that this method should work as long as intermediaries don't get the robots.txt being given to google. Thus the need to ensure that ONLY google or other Search engines get the "real" robots.txt. Problem is, I'm not a coder. I agree :-) I'm trying to figure out how to do this with .htaccess and it's very slow going. There are seemingly more pitfalls than answers simply because I do not understand the language. Thus I seek help from those who Do know the language. Please forgive my attitude earlier.... Real life is .... being a problem I'm not trying to defraud, I just want to get back to earning a living. I was doing pretty good untill the "BitchSlap of '07" Doesn't mean you yourself personally were doing anything wrong (although it sounds like you were actually). Plenty of people, myself included, lost PR because people linking to them were involved in buying/selling links and got penalised accordingly. I don't get any fewer visitors as a consequence, though. PR is not the metric people think it is. |
![]() |
| Thread Tools | |
| Display Modes | |
| |