By Oliver Kirstein on Thursday, 13 September 2018
Posted in Technical Issues
Replies 6
Likes 0
Views 599
Votes 0
Hi,
To avoid duplicate content we set in robots.txt Disallow: /*? and Disallow: /*=
Now we have an error in google search console for the amp version of the articles, because the url look like: https://www.storzmedical.com/es/especialidad/urologia/base-de-datos-de-literatura-sobre-swl/base-de-datos-de-literatura/entry/shinde-s.html?format=amp
Can give us a tip to handle this problem?
Thank you
Oliver
Hey there,

Can you check what error message did google search console give for the AMP version?

We just tested https://www.storzmedical.com/es/especialidad/urologia/base-de-datos-de-literatura-sobre-swl/base-de-datos-de-literatura/entry/shinde-s.html?format=amp in the Google AMP test tool and the result shows it is a valid AMP page.

The following link is the result of it:

[gist type="php"]
https://search.google.com/test/amp?id=afND3N6cDqskp0tW_dklhA
[/gist]
·
Thursday, 13 September 2018 19:19
·
0 Likes
·
0 Votes
·
0 Comments
·
Hey Oliver,

It is not necessary to index those ?format=amp urls because in each of your blog posts, EasyBlog has already added the necessary metadata to inform crawlers that there is an AMP version of the post
·
Thursday, 13 September 2018 19:19
·
0 Likes
·
0 Votes
·
0 Comments
·
Hi,
Thank you for your answers.
The error message is, that Google was blocked from them robots.txt (because we set Disallow: /*? and Disallow: /*=).
And the blog url name has a "?" and "="
After deleting Disallow: /*? and Disallow: /*= in robots.txt it's all ok. But to avoid duplicate conten from joomla we would like allow URLs with "?" and "="
·
Thursday, 13 September 2018 19:51
·
0 Likes
·
0 Votes
·
0 Comments
·
I am actually a little lost here. Having the ?format=amp will not cause any duplicate urls. These are amp specific pages and Google will not treat them as duplicate contents.
·
Thursday, 13 September 2018 21:13
·
0 Likes
·
0 Votes
·
0 Comments
·
This is a joomla Problem. Different kind of links shows the same page. For example:
/index.php?option=com_content&view=article&id=2
show the same page than the seo optimized link. Booth versions would be crawled from Google. So you have the problem with the duplicate content.
To avoid this Problem, you set
Disallow: /*?
Disallow: /*=
in robots.txt
But now, Google can't crawl easy blog AMP articles, because "?" and "=" is a part of the url
·
Thursday, 13 September 2018 21:25
·
0 Likes
·
0 Votes
·
0 Comments
·
Unfortunately there is no way to remove the ?format=amp in the URL because without this, Joomla will treat it as a normal html view. By the way, as long as there is a canonical tag on the page, it is actually perfectly fine.
·
Thursday, 13 September 2018 21:34
·
0 Likes
·
0 Votes
·
0 Comments
·
View Full Post