In [1]:
#import the libraries
import os
import librosa
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt


In [2]:
from google.colab import drive
drive.mount('/content/drive')

Mounted at /content/drive


#ANALYSIS TOOLS

-Clustering and dimension reduction techniques, which help create graphical displays of high-dimensional data containing many variables.
-

Univariate visualization of each field in the raw dataset, with summary statistics.


| Bivariate visualizations and summary statistics that allow you to assess the relationship between each variable in the dataset and the target variable you’re looking at.


| Multivariate visualizations, for mapping and understanding interactions between different fields in the data.


| K-means Clustering is a clustering method in unsupervised learning where data points are assigned into K groups, i.e. the number of clusters, based on the distance from each group’s centroid. The data points closest to a particular centroid will be clustered under the same category. K-means Clustering is commonly used in market segmentation, pattern recognition, and image compression.


| Predictive models, such as linear regression, use statistics and data to predict outcomes.

# TASKS IN EDA
Step 1: Dataset Overview and Descriptive Statistics

| Step 2: Feature Assessment and Visualization

| Step 3: Data Quality Evaluation

In [5]:
# loading the .wav files

audio_path = "/content/drive/MyDrive/data"
audio_data, sr = librosa.load(audio_path)

#data preprocessing methods to apply to the .wav files
#voice activity detection
#Divide the speech signal into smaller segments
#group similar segments based on their aucostic features
#re-segmentation and refinement
#Speaker labelling

#get meta data from the audio data set

mean = np.mean(audio_data)
std = np.std(audio_data)
spectrogram = librosa.feature.melspectrogram( y= audio_data, sr=sr)
mfccs = librosa.feature.mfcc( y= audio_data, sr=sr, n_mfcc=40 )


#visualization
# Plot the spectrogram
plt.figure(figsize=(10, 4))
librosa.display.specshow(librosa.power_to_db(spectrogram, ref=np.max),
                         y_axis='mel', fmax=8000, x_axis='time')
plt.colorbar(format='%+2.0f dB')
plt.title('Mel spectrogram')
plt.tight_layout()
plt.show()

# Plot the MFCCs
plt.figure(figsize=(10, 4))
librosa.display.specshow(mfccs, x_axis='time')
plt.colorbar()
plt.title('MFCCs')
plt.tight_layout()
plt.show()


NameError: ignored

# EDA trends and tools for said trends

Temporal Changes:

Trend: How do the audio features change over time? Are there patterns in short-term or long-term trends?
EDA Tools: Line plots, time-series plots, and spectrograms can help visualize temporal changes.

Correlations Between Features:

Trend: Are there correlations or relationships between different audio features?
EDA Tools: Heatmaps and pair plots can be used to visualize correlations between features.

Distribution of Features:

Trend: What is the distribution of each feature? Are there outliers?
EDA Tools: Histograms, box plots, and kernel density plots can help analyze feature distributions.

Class Separation:

Trend: Can the features effectively separate different speech intents or classes?
EDA Tools: Scatter plots, violin plots, and box plots comparing different classes can reveal separation trends.

Mean and Variability Across Classes:

Trend: How do the mean and variability of features differ across different speech intents?
EDA Tools: Grouped bar plots and error bars can help visualize mean and variability across classes.

Seasonality in Pitch or Tone:

Trend: Are there recurring patterns or seasonality in pitch or tone?
EDA Tools: Autocorrelation plots and pitch contour visualizations can reveal patterns.

Changes in Energy Levels:

Trend: How do the energy levels vary over time or across different speech intents?
EDA Tools: Line plots or bar plots representing energy levels can provide insights.

Formant Patterns:

Trend: How do formants change across different phonemes or speech sounds?
EDA Tools: Spectrograms and formant tracking visualizations can help analyze formant patterns.

Feature Importance:

Trend: Which features contribute the most to distinguishing between speech intents?
EDA Tools: Feature importance plots from machine learning models or statistical tests can provide insights.

Temporal Dynamics:

Trend: How do the features evolve over time, and are there specific patterns during certain time intervals?
EDA Tools: Rolling statistics, dynamic time warping, or time-domain visualizations can help analyze temporal dynamics.