Avoid duplicate indexing in google when hosting in subdomain and using reverse proxy to /subfolder/

I have reverse proxied my webflow blog hosted in sub-domain to sub-folder
from
https://subdomain.mydomain.com
to
https://mydomain.com/blog/

Problem is now I have duplicate content and there is a risk of google indexing both my subdomain url and /blog url.

  1. Is there any way to avoid google from indexing the pages inside my subdomain, and index only subfolder pages inside /blog/.
  2. When I am generating the sitemap, it creates sitemap with subdomain.yourdomain.com, but not yourdomain.com/blog. Is there any easy way to create sitemaps for subfolder instead of urls from subdomains. Copy pasting manually from sitemap generator everytime a new blog is posted sounds tedious.

Here is my site Read-Only: LINK

1 Like

I suggest you look at the following resource which explains the problem and provides a solution.
Consolidate Duplicate URLs with Canonicals | Google Search Central

@webdev Jeff, thanks for responding. Any workarounds for the sitemap question that I asked above ?

Hello @webdev, I have similar issues and maybe you could help me… I used Cloudflare for reverse proxy, and have my 2 projects on Webflow, URLs work fine but:

  1. Google is not indexing the pages I create with my secondary project. Any solution to this please?

  2. Should the canonical tag be placed in the of the new page created on the secondary project? by placing the link of the main site in the tag?

Thank you very much for your help.