Merge pull request #13265 from patrickhousley/browser-google-index

feat: add google indexing troubleshooting
newrelic · Jun 6, 2023 · 9789941 · 9789941
2 parents 596bfca + a05bee0
commit 9789941
Show file tree

Hide file tree

Showing 2 changed files with 50 additions and 16 deletions.
diff --git a/...ocs/browser/new-relic-browser/troubleshooting/google-indexing-unknown-paths.mdx b/...ocs/browser/new-relic-browser/troubleshooting/google-indexing-unknown-paths.mdx
@@ -0,0 +1,32 @@
+---
+title: Google Indexing 404 Paths
+type: troubleshooting
+tags:
+  - Browser
+  - Browser monitoring
+  - Troubleshooting
+metaDescription: 'Google may attempt to index invalid site paths from strings found in the agent loader.'
+---
+
+## Problem
+
+You're seeing 404 errors in your Google Search dashboard for URLs that are not valid for your site. You may also see 404 errors in your web server logs for these paths. After searching your site, you find that the site path is present as a string in the browser agent loader script being injected into your HTML.
+
+## Cause
+
+The browser agent makes use of string literals in code but converts those to string concatenation during the build process to prevent possible issues with code that may wrap the loader string in a string literal. In some cases, this may lead to strings that begin with a forward slash `/`. This means that when Google indexes the page containing the agent loader script, it will store these strings as potential paths for your site and attempt to index them.
+
+## Solutions
+
+This is not an uncommon scenario for site administrators and is not just linked to the browser agent. There could be many reasons for strings to appear in your site HTML and cause Google Search to perceive them as a potential path. Here are some resources that demonstrate how others have addressed this concern:
+
+- [How can I block Google from crawling things it thinks are URLs from __NEXT_DATA__?](https://stackoverflow.com/questions/75672103/how-can-i-block-google-from-crawling-things-it-thinks-are-urls-from-next-data)
+- [Google follows JavaScript string as relative path - produces 404 error](https://webmasters.stackexchange.com/questions/50848/google-follows-javascript-string-as-relative-path-produces-404-error)
+
+Available internet resources indicate these 404 errors will not affect your site ranking or indexing. They can be safely ignored. However, if you are still concerned, you can reach out to the [Google Search Support Community](https://support.google.com/webmasters/community?hl=en) to get additional feedback and help.
+
+- [Do 404 errors hurt my site?](https://developers.google.com/search/blog/2011/05/do-404s-hurt-my-site)
+- [Google’s John Mueller Explains Why Google Crawls Non-Existent Pages](https://www.searchenginejournal.com/googlebot-404/239325/)
+- [Google treats 404 as 'Excluded' and doesn't index](https://support.google.com/webmasters/thread/59325769?hl=en&msgid=59357731)
+- [Block access to content on your site](https://support.google.com/news/publisher-center/answer/9605477?hl=en)
+- [Large increase in 404 pages with URLs ending in /aggregate in Page Indexing](https://support.google.com/webmasters/thread/212583650/large-increase-in-404-pages-with-urls-ending-in-aggregate-in-page-indexing?hl=en)
diff --git a/src/nav/browser.yml b/src/nav/browser.yml
@@ -153,38 +153,40 @@ pages:
             path: /docs/browser/new-relic-browser/browser-apis/setname
   - title: Troubleshooting
     pages:
-      - title: Installation
-        path: /docs/browser/browser-monitoring/troubleshooting/troubleshoot-your-browser-monitoring-installation
-      - title: Data doesn't match other tools
-        path: /docs/browser/new-relic-browser/troubleshooting/browser-data-doesnt-match-other-analytics-tools
-      - title: View detailed error logs
-        path: /docs/browser/new-relic-browser/troubleshooting/view-detailed-error-logs-browser
       - title: AJAX data collection
         path: /docs/browser/new-relic-browser/troubleshooting/troubleshoot-ajax-data-collection
-      - title: Session trace collection
-        path: /docs/browser/new-relic-browser/troubleshooting/troubleshooting-session-trace-collection
       - title: AngularJS errors do not appear
         path: /docs/browser/new-relic-browser/troubleshooting/angularjs-errors-do-not-appear
       - title: Angular truncated agent snippet
         path: /docs/browser/single-page-app-monitoring/troubleshooting/angular-truncated-copy-paste-snippet
+      - title: Data doesn't match other tools
+        path: /docs/browser/new-relic-browser/troubleshooting/browser-data-doesnt-match-other-analytics-tools
+      - title: Google Indexing 404 Paths
+        path: /docs/browser/new-relic-browser/troubleshooting/google-indexing-unknown-paths
       - title: HAR data collection
         path: /docs/browser/new-relic-browser/troubleshooting/get-browser-side-troubleshooting-details-har-file
-      - title: Not seeing specific page names
-        path: /docs/browser/new-relic-browser/troubleshooting/not-seeing-specific-page-or-endpoint-names-browser-data
+      - title: Installation
+        path: /docs/browser/browser-monitoring/troubleshooting/troubleshoot-your-browser-monitoring-installation
       - title: JS injection causes problems
         path: /docs/browser/new-relic-browser/troubleshooting/browser-javascript-injection-causes-problems-page
       - title: JS errors missing traces
         path: /docs/browser/new-relic-browser/troubleshooting/third-party-js-errors-missing-stack-traces
-      - title: RPM higher than PPM
-        path: /docs/browser/new-relic-browser/troubleshooting/app-server-requests-greatly-outnumber-browser-pageview-transactions
       - title: Missing data on Web vitals page
         path: /docs/browser/new-relic-browser/troubleshooting/missing-data-on-web-vitals-page
-      - title: "SPA: Missing route changes"
-        path: /docs/browser/single-page-app-monitoring/troubleshooting/missing-route-changes-spa-agent
-      - title: Navigation start time unknown
-        path: /docs/browser/new-relic-browser/page-load-timing-resources/navigation-start-time-unknown
       - title: MooTools related errors encountered
         path: /docs/browser/new-relic-browser/troubleshooting/mootools-related-errors
+      - title: Navigation start time unknown
+        path: /docs/browser/new-relic-browser/page-load-timing-resources/navigation-start-time-unknown
+      - title: Not seeing specific page names
+        path: /docs/browser/new-relic-browser/troubleshooting/not-seeing-specific-page-or-endpoint-names-browser-data
+      - title: RPM higher than PPM
+        path: /docs/browser/new-relic-browser/troubleshooting/app-server-requests-greatly-outnumber-browser-pageview-transactions
+      - title: Session trace collection
+        path: /docs/browser/new-relic-browser/troubleshooting/troubleshooting-session-trace-collection
+      - title: "SPA: Missing route changes"
+        path: /docs/browser/single-page-app-monitoring/troubleshooting/missing-route-changes-spa-agent
+      - title: View detailed error logs
+        path: /docs/browser/new-relic-browser/troubleshooting/view-detailed-error-logs-browser
   - title: Release notes
     pages:
       - title: Browser agent release notes