-
-
Notifications
You must be signed in to change notification settings - Fork 264
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make it possible to collect voice answers to free text questions #3426
Comments
It would be very interesting to look at the browser capabilities and see if there is any way to manipulate the audio recording to anonymize the voice of the whistleblower. Examples of audio manipulations changing voice pitch can be found for example at: https://github.com/urtzurd/html-audio In scientific literature altering voice pitch like typically done in tv shows is a disguise that do not make it sense at all as analyzing the sound it is possible to try to restore the original. the scientific community agrees seems to agree instead that best solution i typically to do voice-to-text and text-to-voice to ensure there is no timber track in the record. the alternative in the middle is implementing a vocoder as described here: https://security.stackexchange.com/questions/227146/is-changing-pitch-enough-for-anonymizing-a-persons-voice vocoders are typically researched for performances in order to be used for real time communications. In the context of whistleblowing and globaeleas instead i think performances are not such a big deal as long that the voice is properly anonymized. regarding the approach to be adopted probably it would be best if we could achieve a full implementation of a vocoder directly on the browser to avoid to transfer the original voice record to the server and continuing our research targeting at implementing a fully end to end encryption system The community of reference on this topic seems to be the Voice Privacy Challenge community. Among the novel literature seems interesting the work "Research in methods for achieving secure voice anonymization - Evaluation and improvement of voice anonymization techniques for whistleblowing" by ERIK HELLMAN and MATTIAS NORDSTRAND performed at KTH in synergy with the whistleblowing operator Nebulr AB: http://kth.diva-portal.org/smash/get/diva2:1678546/FULLTEXT01.pdf |
@Voice-Privacy-Challenge @Natalia-T @brijmohan @TonyWangX: are you aware of any research effort on lightweight implementations that could be implemented in the context of a browser in JavaScript? thank you |
@vebmaylrie: we are testing your work (https://github.com/sarulab-speech/lightweight_spkr_anon) and it seems that current algorithms requires training and any results seems to not hide the gender of the speaker. Are you aware of any vocoder that consider this issue and:
Thank you for your feedback on this matter. |
@evilaliv3 |
Very interesting Abdul. In my opinion it would be very interesting first to try to see if for lightweight but effective approaches that we could possibly run on the client. |
functionality so far |
Very nice update @msmannan00 ! I see that you considered to add a flag to every type of question to enable the feature. In relation to the UI i like it in its simpicity. Probably if possible i would hide the player interface when the message has still not been recorded. What do you think? \cc @gianlucagilardi |
Hi all, very nice update!
What would you think of the option to enable to collect a voice answer for
all questions in a questionnaire, while still having the options to write
an answer? I.e. the normal questionnaire but with a mic-symbol next to each
question?
…On Sat, 10 Jun 2023, 17:35 Giovanni Pellerano, ***@***.***> wrote:
Very nice update @msmannan00 <https://github.com/msmannan00> !
I see that you considered to add a flag to every type of question to
enable the feature.
I would probably suggest to just add specific new type of question "Voice
message"
In relation to the UI i like it in its simpicity. Probably if possible i
would hide the player interface when the message has still not been
recorded.
What do you think?
\cc @gianlucagilardi <https://github.com/gianlucagilardi>
—
Reply to this email directly, view it on GitHub
<#3426 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AZXMC4LUKVRDJZODKIRXXTDXKSH35ANCNFSM6AAAAAAXBSDU4M>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
Sure, that sounds easier and much more cleaner would change it accordingly |
Thank @msmannan00; actually i'mve very in doubt on the final choice on this. Based on the suggestion of @danielvaknine , that i know is similar to your original idea i suggest the following We probably want all the questionnaire to be able to switch from written to oral, so probably the setting should be a setting of the channel. what do you all think? This ticket deals specifically with free text questions but i think in our next meeting it would be great to brainstorm on what to do for the other type of questions like checkboxes, terms of services, dropdowns; i suggest to keep them as they are because they could be related to mandatory consents that is impossible to handle orally or conditional workflows |
An other topic that comes to my mind is how to validate a mandatory question:
|
yes, that was something i was already thinking, will look into it |
Hello!
I just thought of the following that I think could represent a very
flexible simple implementation that would solve most of the questions
arised.
The idea is to:
- Let's suppose to have the possibility to configure questions of type
"Voice message"
- Let's suppose to have the possibility to set them mandatory
- Let suppose the possibility to have the possibility to set minimum and
maximum lengths on terms of seconds instead of characters
- let's suppose the possibility to add an oral questionnaire alternative to
every questionnaire or question by setting an UID in a configuration called
"Oral alternative"
- let's suppose the possibility to configure in the advanced configuration
a disclaimer to be shown when the user decide to fill the oral alternative
for the questionnaire.
With such a possibility we would have the possibility to always let the
user decide when they are asked to fill a questionnaire.
What do you all think?
What I think would be more critical is to design a nice and easy interface
to ask: Would you like to answer with written answers or by voice l. It
could be an explicit question or a slider option maybe followed eventually
by the addition of the configured warning, if present. Do anyone have ideas
about this?
|
no i believe having a seperate answers is better as compared to text interface with having a voice would be confusing to non it guys. the properties however for questionare that you mentioned like mandatory is a good idea but since deadline is almost end of the month would we have already added option to make audio question mandatory and to set the minimum required seconds. its shown in images below |
actualsound.webm This feature belongs to vasilis build but its just so that if you thing this is better we can integrate it and work on it more in the future. Transformation applied
|
Thank you @msmannan00 for this update! seems all very interesting. What i described above about the selection of the type of channel (oral or written) will be actually needed because users will have to decide if they want to make an oral reporting or written and this is up to their choice and needs but i agree we could postpone this to the future and discuss this specific topic a new ticket #3482 For what relates the anonymization of the voice i've created a dedicated ticket #3483
|
Hi all! To answer @evilaliv3 , we think the whistleblower should be able to see/listen to their own recording. Unlike @gianlucagilardi , we also believe the whistleblower should be able to download their recording, just as they can export/download their report today. But it's not a strict requirement from our side so we will leave it up to you. Regarding the actual implementation in the questionnaire – what did we end up with? Will it be:
In our opinion, number 2 is the best option, with an interface like this on every question: Looking very much forward to seeing this great functionality in action! |
@danielvaknine actually these questions are already answered withing discussions and proposals by @gianlucagilardi and @msmannan00 At the moment the implementation that we are performing will enable only the possibility to create a new question type "voice message".
For the topic that you are discussing we opened ticket #3482 As discussed in fact we currently agree to not mix oral reporting and written reporting because as already discussed above only free text questions enable to be replaced by an oral recording; it is impossible to implement dropdown and conditional logics with oral reporting and for this reason we consider. |
Can (s)he? |
sorry for the late reply. what i meant was not from the system itself. for instance think that a person plays the recorded video and when the voice comes out of speaker he puts his mobile in front of it to record that audio coming out of the system. other than that for instance the audio is playing on his system and their are a lot of software for instance imdb that can download that sound even when the software itself doesnt support export because audio is playting in the browser. is something like screen shot as you said. most apps dont allow screenshot but no one can stop him to take picture from another mobile directly from camera :) |
Hello again I am sharing the live demo for you guys to comment better. username: recipient |
Thank you @msmannan00 I like interface. I would suggest, if possible. to not show the player and the lenght if the user has not recorded till the user will start recording. By the way at the moment i'm encountering some issue testing it; does it work for you? I was able to record only once on Chrome, but not on Firefox and Brave. |
Sure, will look into it. I tested in most of browsers but let me check. For the player not showing I had disable it till no recording but would completely hide it in next build till not recorded |
@msmannan00 could you clarify this? If it's intentional – what is it? |
Sure, the shield represents secure audio when it is manipulated. For normal normal audio will be recorded and save with shield altered audio will be uploaded and saved |
I'm sorry but I don't quite understand how it should be operated and I think this might be a confusing extra function for the whistleblower to use without instructions etc |
so moving forward with audio masking at the moment this is the status
For masking at the moment did two major things for the time being
|
Super!
As for the next steps is suggest we could:
- add a standard low pass filter and an high pass filter based on the web audio api (to be always applied); this would help recording only voice and trimming background.
This would enable us to have a rock solid implementation for the clear voice and start testing what to do to anonymize the voice.
Regarding anonymity I consider changing the speed unfortunately won't be of any help. it definitely works with respect to a person passing nearby to the recipient but it can be easily reversed. If you listen to a voice that is altered in speed you recognize this easily and you can automate reversing it to the original speed or near to it. Especially if the change is linear
|
Implemented and merged since 4.12 |
Is there anywhere we could test this function, e.g. https://demo.whistleaks.com/ or https://try.globaleaks.org/ ? |
Proposal
In order to raise the accessibility of the tool, it would be very interesting to implement a feature to make it possible to collect voice answers for free to free text questions.
Motivation and context
The idea of this feature comes in general from the will to continuously improve the accessibility of the tool.
The feature would as well improve compliance with new upcoming laws like the ones inspired from the whistleblowing directive that still include the possibility to include traditional tools based on telephone lines that while not offering the same security of modern digital tools still have large diffusion due to their usability.
The text was updated successfully, but these errors were encountered: