HighDots Forums  

Errors indexing

Search Engine Optimization Discussion about SEO/Search Engine Optimization (alt.internet.search-engines)


Discuss Errors indexing in the Search Engine Optimization forum.



Reply
 
Thread Tools Display Modes
  #1  
Old   
fernando@floraqueen.com
 
Posts: n/a

Default Errors indexing - 10-02-2006 , 11:47 AM






Hello,

Recently we start to have a lot of errors in google indexing of our
site, something like:
////////////////////////////////////////////////////////
http://www.floraqueen.com/://floraqueen.com/ 404 (No se ha
encontrado) [?] 25-sep-2006
http://www.floraqueen.com/://www.floraqueen.com/ 404 (No se ha
encontrado) [?] 18-sep-2006
.....
///////////////////////////////////////////////////////

I don't understand? wich is the problem?

I have checked my sitemap.xml and all my site files looking for a
broken link or something
similar and I don't find anything.

Do you know where is the problem coming from? or some idea how to
check?

So in other way is it possible to indicate google that It only index
the contents
of sitemap.xml? no crawl!

If i disable all files in robots.txt does still google look into
sitemap.xml?

Thank you in advance.


Reply With Quote
  #2  
Old   
Nikita the Spider
 
Posts: n/a

Default Re: Errors indexing - 10-02-2006 , 02:49 PM






In article <1159804053.992703.286800 (AT) b28g2000cwb (DOT) googlegroups.com>,
"fernando (AT) floraqueen (DOT) com" <fernando (AT) floraqueen (DOT) com> wrote:

Quote:
Hello,

Recently we start to have a lot of errors in google indexing of our
site, something like:
////////////////////////////////////////////////////////
http://www.floraqueen.com/://floraqueen.com/ 404 (No se ha
encontrado) [?] 25-sep-2006
http://www.floraqueen.com/://www.floraqueen.com/ 404 (No se ha
encontrado) [?] 18-sep-2006
....
///////////////////////////////////////////////////////

I don't understand? wich is the problem?
I don't know anything about Google sitemaps, but I don't think that's
the problem. I think you have one or more links on your site coded like
this:

<a href="://floraqueen.com/">blah blah blah</a>

Someone made a typo ("typographical mistake"). That would be the simple
explanation.

Good luck

--
Philip
http://NikitaTheSpider.com/
Whole-site HTML validation, link checking and more


Reply With Quote
  #3  
Old   
Turbo
 
Posts: n/a

Default Re: Errors indexing - 10-02-2006 , 09:14 PM



Yeah.. someone left the http and then google bot assumed it to be a
relative url and concatenated it with the baseurl to form the new url.
And the new url was obviosuly wrong.
-----------------
http://sandy007smarty.seo.iitm.ac.in/

Nikita the Spider wrote:
Quote:
In article <1159804053.992703.286800 (AT) b28g2000cwb (DOT) googlegroups.com>,
"fernando (AT) floraqueen (DOT) com" <fernando (AT) floraqueen (DOT) com> wrote:

Hello,

Recently we start to have a lot of errors in google indexing of our
site, something like:
////////////////////////////////////////////////////////
http://www.floraqueen.com/://floraqueen.com/ 404 (No se ha
encontrado) [?] 25-sep-2006
http://www.floraqueen.com/://www.floraqueen.com/ 404 (No se ha
encontrado) [?] 18-sep-2006
....
///////////////////////////////////////////////////////

I don't understand? wich is the problem?

I don't know anything about Google sitemaps, but I don't think that's
the problem. I think you have one or more links on your site coded like
this:

a href="://floraqueen.com/">blah blah blah</a

Someone made a typo ("typographical mistake"). That would be the simple
explanation.

Good luck

--
Philip
http://NikitaTheSpider.com/
Whole-site HTML validation, link checking and more


Reply With Quote
  #4  
Old   
Big Bill
 
Posts: n/a

Default Re: Errors indexing - 10-03-2006 , 01:03 AM



On 2 Oct 2006 18:14:46 -0700, "Turbo" <sandeep.iiit (AT) gmail (DOT) com> wrote:

Quote:
Yeah.. someone left the http and then google bot assumed it to be a
relative url and concatenated it with the baseurl to form the new url.
And the new url was obviosuly wrong.
-----------------
http://sandy007smarty.seo.iitm.ac.in/
!!!

BB

Quote:
Nikita the Spider wrote:
In article <1159804053.992703.286800 (AT) b28g2000cwb (DOT) googlegroups.com>,
"fernando (AT) floraqueen (DOT) com" <fernando (AT) floraqueen (DOT) com> wrote:

Hello,

Recently we start to have a lot of errors in google indexing of our
site, something like:
////////////////////////////////////////////////////////
http://www.floraqueen.com/://floraqueen.com/ 404 (No se ha
encontrado) [?] 25-sep-2006
http://www.floraqueen.com/://www.floraqueen.com/ 404 (No se ha
encontrado) [?] 18-sep-2006
....
///////////////////////////////////////////////////////

I don't understand? wich is the problem?

I don't know anything about Google sitemaps, but I don't think that's
the problem. I think you have one or more links on your site coded like
this:

a href="://floraqueen.com/">blah blah blah</a

Someone made a typo ("typographical mistake"). That would be the simple
explanation.

Good luck

--
Philip
http://NikitaTheSpider.com/
Whole-site HTML validation, link checking and more
--

http://www.kruse.co.uk/seo-sitemap.htm
http://www.here-be-posters.co.uk/art-prints-sitemap.htm
http://www.here-be-posters.co.uk/lithographs.htm


Reply With Quote
  #5  
Old   
Borek
 
Posts: n/a

Default Re: Errors indexing - 10-03-2006 , 03:01 AM



On Mon, 02 Oct 2006 17:47:34 +0200, fernando (AT) floraqueen (DOT) com
<fernando (AT) floraqueen (DOT) com> wrote:

Quote:
Recently we start to have a lot of errors in google indexing of our
site, something like:
////////////////////////////////////////////////////////
http://www.floraqueen.com/://floraqueen.com/ 404 (No se ha
encontrado) [?] 25-sep-2006
http://www.floraqueen.com/://www.floraqueen.com/ 404 (No se ha
encontrado) [?] 18-sep-2006
....
///////////////////////////////////////////////////////

I don't understand? wich is the problem?

I have checked my sitemap.xml and all my site files looking for a
broken link or something
similar and I don't find anything.
If your site doesn't contain these - most likely they are external and out
of your control. It happens all the time, morons linking to your site,
morons writing bots that list spidered urls but are not able to correctly
read them and so on. It can be aso new Google bug, Google engineers are
known to be creative when it comes to such things. Like it was some months
ago (or was it 2005?) with www.example.com///where-are-these-slashes-from?

Quote:
Do you know where is the problem coming from? or some idea how to
check?
You already did what you can - you have checked your site. You may also
browse logs looking if these urls are not GETted with referer string
given, perhaps such an url is displayed on some page and sometimes
followed by humans.

Quote:
If i disable all files in robots.txt does still google look into
sitemap.xml?
AFAIR robots.txt supersedes sitemap.

If your page sends correct 404 and these links are not originating from
your site - don't bother.

Borek
--
http://www.chembuddy.com
http://www.ph-meter.info
http://www.terapia-kregoslupa.waw.pl


Reply With Quote
  #6  
Old   
josepe
 
Posts: n/a

Default Re: Errors indexing - 10-03-2006 , 05:08 AM




Thank you very much for your help.

Why do you say not to bother if the page sends correct 404?
We have configured our server to give a 404 error with a custom page.
is it correct?

If there any posibility to trace if an external link is causing the
problem?

Do you have any more ideas?
We are depress...:-(

Thank you a lot.




Borek ha escrito:

Quote:
On Mon, 02 Oct 2006 17:47:34 +0200, fernando (AT) floraqueen (DOT) com
fernando (AT) floraqueen (DOT) com> wrote:

Recently we start to have a lot of errors in google indexing of our
site, something like:
////////////////////////////////////////////////////////
http://www.floraqueen.com/://floraqueen.com/ 404 (No se ha
encontrado) [?] 25-sep-2006
http://www.floraqueen.com/://www.floraqueen.com/ 404 (No se ha
encontrado) [?] 18-sep-2006
....
///////////////////////////////////////////////////////

I don't understand? wich is the problem?

I have checked my sitemap.xml and all my site files looking for a
broken link or something
similar and I don't find anything.

If your site doesn't contain these - most likely they are external and out
of your control. It happens all the time, morons linking to your site,
morons writing bots that list spidered urls but are not able to correctly
read them and so on. It can be aso new Google bug, Google engineers are
known to be creative when it comes to such things. Like it was some months
ago (or was it 2005?) with www.example.com///where-are-these-slashes-from?

Do you know where is the problem coming from? or some idea how to
check?

You already did what you can - you have checked your site. You may also
browse logs looking if these urls are not GETted with referer string
given, perhaps such an url is displayed on some page and sometimes
followed by humans.

If i disable all files in robots.txt does still google look into
sitemap.xml?

AFAIR robots.txt supersedes sitemap.

If your page sends correct 404 and these links are not originating from
your site - don't bother.

Borek
--
http://www.chembuddy.com
http://www.ph-meter.info
http://www.terapia-kregoslupa.waw.pl


Reply With Quote
  #7  
Old   
Borek
 
Posts: n/a

Default Re: Errors indexing - 10-03-2006 , 05:27 AM



On Tue, 03 Oct 2006 11:08:12 +0200, josepe <josepe (AT) gmail (DOT) com> wrote:

Quote:
Thank you very much for your help.
Please don't top post.

Quote:
Why do you say not to bother if the page sends correct 404?
We have configured our server to give a 404 error with a custom page.
is it correct?
Yes. Google will not index page if 404 header is sent (as is in your
case). So don't loose your sleep just because it tries to fetch some
non-existing page. It won't hurt your SERPS.

Quote:
If there any posibility to trace if an external link is causing the
problem?
Only through your logs - if it will be there.

Borek
--
http://www.chembuddy.com
http://www.ph-meter.info
http://www.terapia-kregoslupa.waw.pl


Reply With Quote
  #8  
Old   
josepe
 
Posts: n/a

Default Re: Errors indexing - 10-03-2006 , 08:07 AM




Hi,

Quote:
Only through your logs - if it will be there.?
Can you help me?

I can't find any url like:
"http://www.floraqueen.com/://floraqueen.com*"
in my web server logs.

What do you mean "through your logs"


Thank you.





Borek ha escrito:

Quote:
On Tue, 03 Oct 2006 11:08:12 +0200, josepe <josepe (AT) gmail (DOT) com> wrote:

Thank you very much for your help.

Please don't top post.

Why do you say not to bother if the page sends correct 404?
We have configured our server to give a 404 error with a custom page.
is it correct?

Yes. Google will not index page if 404 header is sent (as is in your
case). So don't loose your sleep just because it tries to fetch some
non-existing page. It won't hurt your SERPS.

If there any posibility to trace if an external link is causing the
problem?

Only through your logs - if it will be there.

Borek
--
http://www.chembuddy.com
http://www.ph-meter.info
http://www.terapia-kregoslupa.waw.pl


Reply With Quote
  #9  
Old   
Borek
 
Posts: n/a

Default Re: Errors indexing - 10-03-2006 , 08:48 AM



On Tue, 03 Oct 2006 14:07:58 +0200, josepe <josepe (AT) gmail (DOT) com> wrote:

Quote:
Only through your logs - if it will be there.?

Can you help me?
I can try, if you will stop top posting.

Quote:
I can't find any url like:
"http://www.floraqueen.com/://floraqueen.com*"
in my web server logs.
So how do you know Google tries to index this url?

Quote:
What do you mean "through your logs"
Analyzing them.

Borek
--
http://www.chembuddy.com
http://www.ph-meter.info
http://www.terapia-kregoslupa.waw.pl


Reply With Quote
  #10  
Old   
josepe
 
Posts: n/a

Default Re: Errors indexing - 10-03-2006 , 11:00 AM




Sorry for multi-post, we are two diferent persons work in it.

Quote:
So how do you know Google tries to index this url?
I know this because it is in my account of google sitemap webmaster
tool.
We have 4 diferent domains in this account.
I can check every domain in "http errors" function of the google tool.
In this functions google show me all errors and here there are:
more than 1000 urls " with 404 http error" all my web site is like
"http://www.floraqueen.com/://floraqueen.com/"
"http://www.floraqueen.com/://floraqueen.com/pag1.html"
"http://www.floraqueen.com/://floraqueen.com/pag2.html"
......

This happened in each domain of account
www.floraqueen.com
flores.floraqueen.com
fleurs.floraqueen.com
blumen.floraqueen.com


In other way we are think in a idea insert in robots.txt the line
disalow http://www.floraqueen.com/:://*

Do You think that it will works?


Thank you and sorry an other time for multi post.





Borek ha escrito:

Quote:
On Tue, 03 Oct 2006 14:07:58 +0200, josepe <josepe (AT) gmail (DOT) com> wrote:

Only through your logs - if it will be there.?

Can you help me?

I can try, if you will stop top posting.

I can't find any url like:
"http://www.floraqueen.com/://floraqueen.com*"
in my web server logs.

So how do you know Google tries to index this url?

What do you mean "through your logs"

Analyzing them.

Borek
--
http://www.chembuddy.com
http://www.ph-meter.info
http://www.terapia-kregoslupa.waw.pl


Reply With Quote
Reply




Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off



Powered by vBulletin Version 3.5.4
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.