GitHub - namitashukla/Voice-sentiment-analysis: Hackathon project - Voice sentiment analysis and suggestion filtering for chats

About the data set: The dataset used is RAVDESS in which there are two types of data: speech and song from 24 professional actors (12 female, 12 male), vocalizing two lexically-matched statements in a neutral North American accent. Speech includes calm, happy, sad, angry, fearful, surprise, and disgust expressions, and song contains calm, happy, sad, angry, and fearful emotions. Each expression is produced at two levels of emotional intensity (normal, strong), with an additional neutral expression. For creating the dataset, 2 classes are created: Positive (happy, calm) and Negative (angry, fearful, sad).

Audio Augmentation techniques used for increasing dataset size:

White Noise Addition
Pitch Tuning

Feature Extraction from audio (approx. 260 features per audio extracted) : Using Mel Frequency Cepstral Coefficients Steps Involved: * Take the Fourier transform of (a windowed excerpt of) a signal. * Map the powers of the spectrum obtained above onto the mel scale, using triangular overlapping windows. * Take the logs of the powers at each of the mel frequencies. * Take the discrete cosine transform of the list of mel log powers, as if it were a signal. * The MFCCs are the amplitudes of the resulting spectrum. Loading audio data and converting it to MFCC format can be easily done by the Python package librosa.

Our CNN model:

The CNN model is developed with Keras and constructed with 7 layers: 6 Conv1D layers followed by a Dense layer.
After creating the model, its weights are stored in model.h5 and provided 71.67% test accuracy.
This model is further converted into json file and used for predictions.

Consolidating speech with chat: Based on the emotion of a person analysed through voice, we will provide some suitable suggestions to the sender for further conversation. Also, the sender's emotion will be sent at the receiver end for the receiver to understand the mood of the sender and this will help both the sender as well as the receiver to respond further.

COMMANDS TO RUN:

python app.py
tap start recording on the directed URL
filtered suggestions are displayed in accordance to the sentiment

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
backend		backend
dataset		dataset
README.md		README.md
model.py		model.py
output10.wav		output10.wav
preprocessing.py		preprocessing.py
recorder.py		recorder.py
visualizationSpectogramMale.py		visualizationSpectogramMale.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

namitashukla/Voice-sentiment-analysis

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages