You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If you are playing a song on your laptop, As you increase the volume from 0 to 100, the audio becomes louder and louder.
Say I have an .mp3 or .wav , how do I capture this ^ perceived loudness/intensity at regular intervals (may be 0.1 second) in the audio using python speech features?
Any advice is appreciated.
Thanks
Vivek
The text was updated successfully, but these errors were encountered:
the amplitude of the waveform is the loudness. So to measure the loudness every 0.1 seconds you get the amplitude at that point in time. Maybe you want something else?
Loudness is a perceptual quality of audio related nonlinearly to the audio
waveform. You can approximate it quite well by squaring the time domain
waveform, summing the result and taking the logarithm (cube root can also
work here). If you are interested in rough perceptual loudness per second,
that will do if you sum over 1 second segments of your audio.
If you are playing a song on your laptop, As you increase the volume from 0 to 100, the audio becomes louder and louder.
Say I have an .mp3 or .wav , how do I capture this ^ perceived loudness/intensity at regular intervals (may be 0.1 second) in the audio using python speech features?
Any advice is appreciated.
Thanks
Vivek
The text was updated successfully, but these errors were encountered: