By Sean McElherron on Thursday, 07 June 2018
Posted in Technical Issues
Replies 16
Likes 0
Views 751
Votes 0
Hi - how do I prevent Easyblog from generating multilingual URLs on my site? I only wish to have Easyblog appear in English and the URLs are also mixing the languages.
Based on what i see your screenshot, it seems like Google indexed your page that time already 2 month ago, since you only show blog URL without any language code, Google shouldn't able to index it.

Unless that time these URL did appear on the site, this is why Google can indexed these invalid page.

Perhaps this time you try re-submit your sitemap to Google webmaster tool and see how it goes. (Ensure that inside the sitemap do not have any invalid blog post URL e.g. de/blog/xxxx , fr/blog/xxx)

Keep us update if Google re-indexed again on your site, it still can able to indexed those invalid page.

If that is the case, mean some of the page did show these invalid URL on your site.
·
Thursday, 07 June 2018 19:19
·
0 Likes
·
0 Votes
·
0 Comments
·
Hi Arlex

This problem still exists and I have a lot of mixed language urls such as:

https://www.cannyco.com/nl/blog/categories/quirky-and-fun
https://www.cannyco.com/fr/blog/yes-dogs-can-be-allergic-to-fleas

Etc etc, which were found by Google on Sept 2 and 3 this year.

I thought sh404sef might be the problem and uninstalled it but the problem persists. Any advice very welcome.
·
Friday, 14 September 2018 05:00
·
0 Likes
·
0 Votes
·
0 Comments
·
Hey Sean,

Currently I have no ideas yet, because i did tried to check each of your French menu link, it seems like no any page generate this URL e.g. https://www.cannyco.com/fr/blog/yes-dogs-can-be-allergic-to-fleas

Can you try add this following code into your robot.txt file and see how it goes.


Disallow: /de/blog/*
Disallow: /nl/blog/*
Disallow: /fr/blog/*
Disallow: /it/blog/*
Disallow: /es/blog/*
Disallow: /pt/blog/*
·
Friday, 14 September 2018 12:15
·
0 Likes
·
0 Votes
·
0 Comments
·
Hi Arlex

OK, thank you, have done that. Will keep you posted.
·
Friday, 14 September 2018 14:52
·
0 Likes
·
0 Votes
·
0 Comments
·
You're most welcome, keep us update then.
·
Saturday, 15 September 2018 12:28
·
0 Likes
·
0 Votes
·
0 Comments
·
Hi Arlex

This is now getting out of control, please help. Attached are the crawl errors from our site http://www.cannyco.com/it. Until the 25th September, there were 8 errors, now there are 64, the vast majority having the 'blog' in the URL. I am guessing that was the date that Google crawled our newly submitted sitemaps.

Our blog should only appear on the English page as previously stated. The sitemaps for each language (other than English) do not contain any URLs with the word 'blog' in them.

Adding the disallows in the robots.txt file hasn't done anything.
·
Wednesday, 03 October 2018 18:34
·
0 Likes
·
0 Votes
·
0 Comments
·
Hey Sean,

Is it possible provide us with your Google access so I can check further for those 404 link where those coming from?
·
Wednesday, 03 October 2018 19:13
·
0 Likes
·
0 Votes
·
0 Comments
·
Hi Arlex - not a problem. What email address should I use?
·
Wednesday, 03 October 2018 19:20
·
0 Likes
·
0 Votes
·
0 Comments
·
Hi Arlex - I need an email address from you to be able to set you up as a user on my Google account.
·
Thursday, 04 October 2018 03:05
·
0 Likes
·
0 Votes
·
0 Comments
·
Hey Sean,

I am really sorry that i delayed of this reply, this is my email address arlex.wong@stackideas.com
·
Thursday, 04 October 2018 12:21
·
0 Likes
·
0 Votes
·
0 Comments
·
Hi Arlex - I have added you here: https://www.google.com/webmasters/tools/user-admin?hl=en-GB&siteUrl=https://www.cannyco.com/
·
Thursday, 04 October 2018 13:11
·
0 Likes
·
0 Votes
·
0 Comments
·
Thanks, I've checked your Google webmaster tool report, it seems like Google haven't re-index your site for all the pages yet, you can take a look of my screenshot here : http://take.ms/SfaD1 || http://take.ms/hqoTN .

From the last time you added those rules into your site robot.txt file as what i suggested at 2018/9/14 , so compare with the result from your Google webmaster tool, it seems like no more crawl those xx/blog 404 URL page since 2018/9/15 until today.

By the way, those 404 error actually doesn't harm your site performance in search, you can take a look my screenshot here : http://take.ms/i5OL3
·
Thursday, 04 October 2018 17:30
·
0 Likes
·
0 Votes
·
0 Comments
·
Hi Arlex - that's not correct, please see attached screenshot. The majority of errors (this is for smartphone) are for Sept 25 and 26, after we made the change to robots.txt file.
·
Thursday, 04 October 2018 19:24
·
0 Likes
·
0 Votes
·
0 Comments
·
Hey Sean,

Currently I still no ideas how Google can indexed those pages into your site.

I've updated some customisation code under this file JoomlaFolder/components/com_easyblog/easyblog.php on your site, whenever someone trying to access your other language blog page e.g. /de/blog , /de/?option=com_easyblog and else , it will always redirect to your English blog page which is https://www.cannyco.com/blog

Since last time we added some robot rules, then it doesn't help much for this, I already help you removed it.


Disallow: /de/blog/*
Disallow: /nl/blog/*
Disallow: /fr/blog/*
Disallow: /it/blog/*
Disallow: /es/blog/*
Disallow: /pt/blog/*


Can you monitor again and see how it goes.
·
Friday, 05 October 2018 12:13
·
0 Likes
·
0 Votes
·
0 Comments
·
Hi Arlex

Thanks for your help - I will keep an eye on it, so how it goes.
·
Friday, 05 October 2018 16:14
·
0 Likes
·
0 Votes
·
0 Comments
·
You're most welcome, keep us update then.
·
Friday, 05 October 2018 18:10
·
0 Likes
·
0 Votes
·
0 Comments
·
View Full Post