Does not censor properly when using a markdown parser #1

callmeclover · 2024-03-21T13:08:04Z

If we try to run censor on the text "fuck", it returns "f***" as expected. The issue arises, however, when we attempt to implement a markdown parser like Earmark. This runs us into a predicament, because if we run censor before we run as_html, it would parse the censor as markdown. If we run censor after we run as_html, it would parse self censoring as markdown in some cases, sabotaging the filter. Running it after also makes "fuck" return "f****/p>". We could try to make it run an iterator over some html to censor it, but then we would run into the same issue with self censoring.

Possible solutions:

~~Use a different character than "*" to solve "" -> "/p>"~~
Dispute this issue with Earmark's developers
Implement our own markdown parser
Use a different markdown parser (probably a rust crate like pulldown)
Don't support markdown officially

I'm probably going to try pulldown before implementing our own parser.

UPDATE: Closer testing reveals that running censor after as_html does not affect self censoring. I will only be forwarding the "" -> "/p>" section of this to finnbear's repo.

EDIT: Solution 1 won't work, since it detects all characters. Either add whitespace at the end of a string (probably not, that's just wasting characters) or remove the tags that earmark/pulldown adds.

callmeclover added the bug Something isn't working label Mar 21, 2024

callmeclover self-assigned this Mar 21, 2024

callmeclover mentioned this issue Mar 21, 2024

Filter falsely detects characters at the end of swears as part of the swear finnbear/rustrict#24

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does not censor properly when using a markdown parser #1

Does not censor properly when using a markdown parser #1

callmeclover commented Mar 21, 2024 •

edited

Does not censor properly when using a markdown parser #1

Does not censor properly when using a markdown parser #1

Comments

callmeclover commented Mar 21, 2024 • edited

callmeclover commented Mar 21, 2024 •

edited