Add profanity/offensive words filter attribute #72

Ninajoy · 2019-11-16T15:52:12Z

No idea if i am on the right track as to why curse words appear differently in the transcript of SpeechRecognitionResult in different browsers. Therefore thought it best to open an issue here.

Question
If browsers implement the transcript SpeechRecognitionResult in such a way where the output differs maybe a profanity filter attribute could be useful so that the developer using the API has a choice in that matter? For example offensiveWordFilter attribute, of type boolean?

Background Story
While experimenting with the SpeechRecognition Interface
in the phrase-matcher from https://github.com/mdn/web-speech-api/ the following occurred:

When using Chrome and saying a curse word like shit, the transcript in SpeechRecognitionResult is censored as s****
When using Firefox Nightly and saying a curse word like shit the transcript in SpeechRecognitionResult is not censored

In neither Chrome nor Nightly this type of censoring is applied for the speechSynthesis interface as used in the speak-easy-synthesis.

In my search into why this happens i found the following:
On https://github.com/chromium/chromium/blob/master/content/browser/speech/speech_recognition_engine.cc on line 277 filter_profanities is set to false on line 579 it should result in pFilter=0. According to https://stackoverflow.com/questions/15030339/remove-profanity-censor-from-google-speech-recognition/15071054 the setting pfilter=0 results in removing the profanity filter. Which could lead to the conclusion in chrome this is changed. I do not feel confident in this conclusion however.

In Nightly I have found no reference in the code to a profanity filter https://dxr.mozilla.org/mozilla-central/source/dom/media/webspeech/recognition

marcoscaceres · 2019-11-18T00:36:52Z

That seems like a bug in Chrome (or Google's speech service). The recognition engine should be profanity agnostic. The consuming application should then do its own filtering.

Ninajoy · 2019-12-13T11:06:47Z

Thank you for your answer.

I can see a bug was registered for this in chromium: https://bugs.chromium.org/p/chromium/issues/detail?id=804812&q=speech%20censored&colspec=ID%20Pri%20M%20Stars%20ReleaseBlock%20Component%20Status%20Owner%20Summary%20OS%20Modified

In the HTML Speech Incubator document in chapter 7.1.2.3 Builtin Default Grammars on https://www.w3.org/2005/Incubator/htmlspeech/XGR-htmlspeech-20111206/ the following was included: It is recommended that speech services support a filter parameter that can be set to the value noOffensiveWords to represent a desire to not recognize offensive words.

Would it therefore be handy, to prevent further misunderstandings about this subject, to change my request to include in the speech-api documentation that the engine should be profanity agnostic?

evanbliu · 2024-06-26T21:08:00Z

FYI, Chrome just updated its implementation of the Web Speech API to remove profanity masking. This change will take effect in release M127.

udonko777 mentioned this issue Apr 16, 2024

Chromeで実行時、独自のフィルタリングが特定の単語に対して正しく機能しない sayonari/jimakuChan#4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add profanity/offensive words filter attribute #72

Add profanity/offensive words filter attribute #72

Ninajoy commented Nov 16, 2019

marcoscaceres commented Nov 18, 2019

Ninajoy commented Dec 13, 2019

evanbliu commented Jun 26, 2024

Add profanity/offensive words filter attribute #72

Add profanity/offensive words filter attribute #72

Comments

Ninajoy commented Nov 16, 2019

marcoscaceres commented Nov 18, 2019

Ninajoy commented Dec 13, 2019

evanbliu commented Jun 26, 2024