Merge #2

chris-lightbug · 2025-10-01T10:31:54Z

No description provided.

github-advanced-security · 2025-10-01T10:31:56Z

This pull request sets up GitHub code scanning for this repository. Once the scans have completed and the checks have passed, the analysis results for this pull request branch will appear on this overview. Once you merge this pull request, the 'Security' tab will show more code scanning analysis results (for example, for the default branch). Depending on your configuration and choice of analysis tool, future pull requests will be annotated with code scanning analysis results. For more information about GitHub code scanning, check out the documentation.

components/DownloadPdfButton.vue

+    // Add text before the link
+    if (match.index > lastIndex) {
+      const beforeText = html.substring(lastIndex, match.index);
+      const cleanBefore = beforeText.replace(/<[^>]+>/g, '');


To fix this problem, we need to ensure that all HTML tags (including those that may be split or repeated) are fully removed from the string. The optimal solution is to repeatedly apply the replacement until no further matches are found. This approach is robust against edge cases and multi-character reintroduction.

Concretely, replace all uses of .replace(/<[^>]+>/g, '') (on lines 85, 102, and 110) with a helper function that strips HTML tags by running this pattern until the string does not change anymore. The helper function can be defined inside the <script setup> block and should be used to sanitize the beforeText, remainingText, and in line 110 when no links are found.

No extra dependencies are needed for robust tag removal if looping is sufficient for the application (PDF text).

File/region to change:

components/DownloadPdfButton.vue, specifically the body of parseHtmlForPdfLinks function (lines 85, 102, and 110).

Implementation details:

Add a stripHtmlTags helper function before parseHtmlForPdfLinks.

Replace direct .replace(/<[^>]+>/g, '') calls with stripHtmlTags(text).

components/DownloadPdfButton.vue

+  // Add remaining text after last link
+  if (lastIndex < html.length) {
+    const remainingText = html.substring(lastIndex);
+    const cleanRemaining = remainingText.replace(/<[^>]+>/g, '');


The best way to fix the problem is to avoid simple one-pass regexes for removing HTML tags, as explained. You have two main choices:

Use a well-tested sanitizer library. In Vue.js/JavaScript, a highly recommended choice is the sanitize-html npm package, which reliably and recursively removes all dangerous tags and attributes, handling even fragmented or nested input.

If use of a library is not an option here, repeatedly apply the replace() operation until no more changes occur, as shown in the background examples.

Given only the shown code is editable, and library installation may not always be possible, a robust and generic solution is to repeatedly apply the regex replacement in parseHtmlForPdfLinks, for each location where we strip tags. Concretely, this means:

On lines 85, 102, and 111, rather than doing a single .replace(/<[^>]+>/g, ''), use a helper function which loops until no more changes.

Implement a helper function (locally or above parseHtmlForPdfLinks) called stripHtmlTags(input), which repeatedly applies the regular expression replacement as demonstrated in the Background.

If using a library is allowed (project uses npm), you could install sanitize-html and use that instead everywhere tags are stripped for much better results.

components/DownloadPdfButton.vue

+
+  // If no links found, return the cleaned HTML
+  if (segments.length === 0) {
+    const cleanText = html.replace(/<[^>]+>/g, '');


The best way to fix this issue is to use a well-tested HTML sanitization library (like sanitize-html) to safely remove all HTML tags and ensure no unsafe content remains. However, if dependencies are to be minimized, another robust approach is to repeatedly apply the regex replacement until no matches remain (as shown in the background), since this will ensure that multi-character overlapping matches are removed. This applies specifically to the region where html.replace(/<[^>]+>/g, '') is used (line 110 and elsewhere for tag removal).

To implement the change:

Replace direct single-pass usages of .replace(/<[^>]+>/g, '') with a loop that repeatedly strips all HTML tags until none remain.

Optionally, encapsulate this logic in a helper function (for maintainability).

Insert the helper function above its usage in the file.

Update usages to call the new helper method.

No imports are needed for this approach. Changes are all in file components/DownloadPdfButton.vue, in the region with the PDF HTML parsing/cleaning logic.

components/DownloadPdfButton.vue

+  });
+
+  // Strip any remaining HTML tags
+  out = out.replace(/<[^>]+>/g, '');


To correctly remove all HTML tags, including potentially dangerous or nested occurrences, we should either (a) use a well-known and robust library such as sanitize-html to sanitize the HTML string, or (b) if that is not available, repeatedly apply the regular expression replacement until no further replacements occur. The second approach can be implemented by running the replacement in a loop until the output no longer changes, which ensures that nested or consecutive tags are also removed. As we are only permitted to modify existing code in components/DownloadPdfButton.vue, we will update the implementation of htmlAnchorsToPlainText to use this approach, with no external dependencies. No code outside this function will change.

Edit the function htmlAnchorsToPlainText.

Replace the line out = out.replace(/<[^>]+>/g, ''); with a loop that continues replacing tags until none remain (out remains unchanged by the replacement).

No imports or other project files need to be changed.

addshore added 30 commits September 5, 2025 15:55

sitemap config

51933cb

Rework the device API, protocol and message page layouts

0fc9247

Intro to device messaging changes

94157a4

Add favicons

af64f1a

A first pass on RTK improvements

bc780bf

More spec downloads + booklet for rh2

acf264e

link more spec terms

e76e62b

RH2 has ESP32-C6

9dc861f

fix RH2 STM & Link datasheet urls

f5e0ea9

rtk: Section on correction data

29fd5e6

Use a single image for VT3 install tools

b90463c

admin: Split all devices doc pages

5b3b5ee

Add RTK admin portal basic docs

cccac34

Fix links in spec PDF render

a9d6645

Merge branch 'production'

c0d5d3d

Don't use archive.org on history page

e87340e

Rework make devices page

e72d7be

Update device-specs/rtk/v2 from booklet

1cdc8ca

Add RH2 sub pages

cf33e8e

menu and link tweaks around sdk and rtk

d4a1663

shiki add toit language highlighting

3e0cc41

markdown pluginx, mainly for images, css, spelling etc

b5a79ef

no longer import defineProps

c50c7cf

Include toit code examples from the library

1da966e

Link from toit startg to example

55b4cc7

Dont collapse protocol and messages menus

f3890c0

Include toit tmLanguage.json from submodule

98a37a7

Github actions need submodules

e0670d8

Update protocol changes

2dbabae

Plugin for markdownit YAML inclusion...

fa0144f

addshore and others added 9 commits September 18, 2025 15:49

PayloadTable from pre loaded data

a8092a3

GenerateConsts with pre loaded data

54a650f

Use yaml-data throughout for protocol YAML

821721a

ProtocolBytes dont load yaml, get provided it

73b2a7f

Build fixes

36d1082

minor fixes to rh2 spec

58226db

Fix weight of rh2

15eeddf

reorder spec sections

75795b5

Remove YAML download button for specs

f836f8e

github-advanced-security bot found potential problems Oct 1, 2025

View reviewed changes

chris-lightbug merged commit e3a43d6 into production Oct 1, 2025
5 of 6 checks passed

@@ -70,6 +70,18 @@
              * @param {string} html - HTML string that may contain anchor tags
              * @returns {Array} Array of text segments with link information
              */
+            // Repeatedly remove all HTML tags until none remain
+            function stripHtmlTags(input) {
+              if (!input || typeof input !== 'string') return input;
+              let output = input;
+              let prev;
+              do {
+                prev = output;
+                output = output.replace(/<[^>]+>/g, '');
+              } while (output !== prev);
+              return output;
+            }
             function parseHtmlForPdfLinks(html) {
               if (!html || typeof html !== 'string') return [{ text: html || '', isLink: false }];
@@ -82,7 +94,7 @@
                 // Add text before the link
                 if (match.index > lastIndex) {
                   const beforeText = html.substring(lastIndex, match.index);
-                  const cleanBefore = beforeText.replace(/<[^>]+>/g, '');
+                  const cleanBefore = stripHtmlTags(beforeText);
                   if (cleanBefore) {
                     segments.push({ text: cleanBefore, isLink: false });
                   }
@@ -99,7 +111,7 @@
               // Add remaining text after last link
               if (lastIndex < html.length) {
                 const remainingText = html.substring(lastIndex);
-                const cleanRemaining = remainingText.replace(/<[^>]+>/g, '');
+                const cleanRemaining = stripHtmlTags(remainingText);
                 if (cleanRemaining) {
                   segments.push({ text: cleanRemaining, isLink: false });
                 }
@@ -107,7 +119,7 @@
               // If no links found, return the cleaned HTML
               if (segments.length === 0) {
-                const cleanText = html.replace(/<[^>]+>/g, '');
+                const cleanText = stripHtmlTags(html);
                 return [{ text: cleanText, isLink: false }];
               }

@@ -70,6 +70,17 @@
              * @param {string} html - HTML string that may contain anchor tags
              * @returns {Array} Array of text segments with link information
              */
+            // Helper function for robust tag stripping
+            function stripHtmlTags(input) {
+              if (!input || typeof input !== "string") return input;
+              let previous;
+              do {
+                previous = input;
+                input = input.replace(/<[^>]+>/g, '');
+              } while (input !== previous);
+              return input;
+            }
             function parseHtmlForPdfLinks(html) {
               if (!html || typeof html !== 'string') return [{ text: html || '', isLink: false }];
@@ -82,7 +93,7 @@
                 // Add text before the link
                 if (match.index > lastIndex) {
                   const beforeText = html.substring(lastIndex, match.index);
-                  const cleanBefore = beforeText.replace(/<[^>]+>/g, '');
+                  const cleanBefore = stripHtmlTags(beforeText);
                   if (cleanBefore) {
                     segments.push({ text: cleanBefore, isLink: false });
                   }
@@ -99,7 +110,7 @@
               // Add remaining text after last link
               if (lastIndex < html.length) {
                 const remainingText = html.substring(lastIndex);
-                const cleanRemaining = remainingText.replace(/<[^>]+>/g, '');
+                const cleanRemaining = stripHtmlTags(remainingText);
                 if (cleanRemaining) {
                   segments.push({ text: cleanRemaining, isLink: false });
                 }
@@ -107,7 +118,7 @@
               // If no links found, return the cleaned HTML
               if (segments.length === 0) {
-                const cleanText = html.replace(/<[^>]+>/g, '');
+                const cleanText = stripHtmlTags(html);
                 return [{ text: cleanText, isLink: false }];
               }

@@ -70,6 +70,16 @@
              * @param {string} html - HTML string that may contain anchor tags
              * @returns {Array} Array of text segments with link information
              */
+            // Helper function to robustly remove all HTML tags (multi-pass)
+            function stripHtmlTags(input) {
+              let previous;
+              do {
+                previous = input;
+                input = input.replace(/<[^>]+>/g, '');
+              } while (input !== previous);
+              return input;
+            }
             function parseHtmlForPdfLinks(html) {
               if (!html || typeof html !== 'string') return [{ text: html || '', isLink: false }];
@@ -82,7 +92,7 @@
                 // Add text before the link
                 if (match.index > lastIndex) {
                   const beforeText = html.substring(lastIndex, match.index);
-                  const cleanBefore = beforeText.replace(/<[^>]+>/g, '');
+                  const cleanBefore = stripHtmlTags(beforeText);
                   if (cleanBefore) {
                     segments.push({ text: cleanBefore, isLink: false });
                   }
@@ -99,7 +109,7 @@
               // Add remaining text after last link
               if (lastIndex < html.length) {
                 const remainingText = html.substring(lastIndex);
-                const cleanRemaining = remainingText.replace(/<[^>]+>/g, '');
+                const cleanRemaining = stripHtmlTags(remainingText);
                 if (cleanRemaining) {
                   segments.push({ text: cleanRemaining, isLink: false });
                 }
@@ -107,7 +117,7 @@
               // If no links found, return the cleaned HTML
               if (segments.length === 0) {
-                const cleanText = html.replace(/<[^>]+>/g, '');
+                const cleanText = stripHtmlTags(html);
                 return [{ text: cleanText, isLink: false }];
               }

@@ -188,7 +188,12 @@
               });
               // Strip any remaining HTML tags
-              out = out.replace(/<[^>]+>/g, '');
+              // Repeatedly remove tags to handle nested/multiple cases
+              let previous;
+              do {
+                previous = out;
+                out = out.replace(/<[^>]+>/g, '');
+              } while (out !== previous);
               return out;
             }

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Merge #2

Merge #2

Uh oh!

chris-lightbug commented Oct 1, 2025

Uh oh!

github-advanced-security bot commented Oct 1, 2025

Uh oh!

Check failure

Copilot Autofix

Check failure

Copilot Autofix

Check failure

Copilot Autofix

Check failure

Copilot Autofix

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Merge #2

Merge #2

Uh oh!

Conversation

chris-lightbug commented Oct 1, 2025

Uh oh!

github-advanced-security bot commented Oct 1, 2025

Uh oh!

Check failure

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Copilot Autofix

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants