Jump to content

Sitemaps, robots.txt and Google Webmaster


Ckay

Recommended Posts

Hi and thanks in advance.

 

I've recently published a new website and added it to Google Webmaster Tools, Google Analytics, added sitemaps etc. All the usual stuff.

 

However, in my Google Webmaster Tools I've noticed there are tons (and increasing) of 404 errors being listed. Virtually all of these 404's are for product pages in foreign languages which I don't actually want at this stage. When the products were being added they were done exclusively for an English (en) audience.

 

I've noticed, on the server, that there are xml sitemap files (see attached image) for Germany(1_de_0_sitemap.xml), Spain (1_es_0_sitemap.xml), France(1_fr_0_sitemap.xml) an English one (1_en_0_sitemap.xml) and a seemingly generic one (1_en_0_sitemap.xml).

 

The generic xml file contains the code below which apparently points to each of the language/countries I've mentioned above.

 

<?xml version="1.0" encoding="UTF-8"?>
<sitemapindex xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"><sitemap><loc>http://www.c-tec.com/1_en_0_sitemap.xml</loc><lastmod>2016-08-08T09:25:07+01:00</lastmod></sitemap><sitemap><loc>http://www.c-tec.com/1_de_0_sitemap.xml</loc><lastmod>2016-08-08T09:25:07+01:00</lastmod></sitemap><sitemap><loc>http://www.c-tec.com/1_es_0_sitemap.xml</loc><lastmod>2016-08-08T09:25:07+01:00</lastmod></sitemap><sitemap><loc>http://www.c-tec.com/1_fr_0_sitemap.xml</loc><lastmod>2016-08-08T09:25:07+01:00</lastmod></sitemap></sitemapindex>
 

 

If I'm honest, I don't even know how these xml files were generated and I certainly don't want the non-UK ones for now. 

 

Furthermore, in the robots.txt file I've noticed the following code:

 

# Sitemap
 
I've also used an online resource to generate a sitemap.xml file (which is now on the server) which does not contain any url's pointing to the unwanted foreign language pages. I want this xml file to be the default. 
 
My question: If I change the above code in the robots.txt file to http://www.c-tec.com/sitemap.xml and remove all the other unwanted xml files from the server, will this solve the problem?

post-989338-0-44372000-1471516751_thumb.jpg

Link to comment
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...