SEO while using reverse proxy

  1. We are hosting our webflow site on a subdomain we control. (webflow.mycoolsite.com)
  2. To serve some webflow pages on mycoolsite.com, we reverse proxy webflow.mycoolsite.com.

My question is: how do we avoid duplicate content and its SEO penalties by having the same exact website in two places?

I know that we can create a robots.txt on the subdomain (webflow.mycoolsite.com), but that’s not a complete solution. The best way would be to mark all the subdomain pages as noindex and give them a canonical tag referencing the appropriate page on mycoolsite.com. (We are currently doing this with a JS script that dynamically injects the tags, but Google has said they ignore these.) I know a lot of people use a reverse proxy, has anyone found an elegant solution?

1 Like