GitHub

Files

audiodata/audioinfo.py: inside audiodata dir, run with $ python3 audioinfo.py
audiofiles/splitaudio.py: splits audio into 1 minute increments, puts new files in splitfiles directory
textdata/transcribe.py: takes 1 minute wav file and transcribes, splitting by sentence
textdata/concatfiles.py: takes all transcriptions and puts them into single file
textdata/transcribe.sh: inside textdata dir, run with $ ./transcribe.sh [case-number]

Text

sentences_google.csv: from google speech to text transcript
sentences_ibm.csv: from ibm speech to text transcript

In each row:

full sentence text
sentence start time in audio files
sentence end time in audio file
corresponding audio file id
label: 0 = statement, 1 = question

Audio

Input: vocal speech - one sentence Output: 13 parameters based on slices of 0.08 sec

min pitch value
max pitch value
pitch range (max-min)
mean pitch
median pitch
check if pitch increases in 2nd half of statement
total pitch increasing
count of increasing slices
total pitch decreasing
count of decreasing slices
check if increasing total > decreasing total
count of nonzero pitches

Audio Data Sources

All via https://www.supremecourt.gov/oral_arguments/argument_audio/2018
16-1498
17-204
17-290
17-532
17-571
17-647
17-1091
17-1094
17-1107
17-1174
17-1184
17-1201
17-1299
17-1307
17-1471
17-1484
17-1594
17-1606
17-1625
18-95
18-302
18-389
18-431
18-457
18-459
18-481
18-485
18-525

Important Links

Parameters 1-12 based on: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.563.1655&rep=rep1&type=pdf

Some code based on:

Packages:

Convert MP3 to wav:

https://www.online-convert.com/result/60d91ab3-c873-407f-888f-719c1f4f5006

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
audiodata		audiodata
audiofiles		audiofiles
classifiers		classifiers
textdata		textdata
visualizations		visualizations
main.py		main.py
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Files

Text

Audio

Audio Data Sources

Important Links

About

Uh oh!

Releases

Packages

Languages

rvhirsch/QuestionDetection

Folders and files

Latest commit

History

Repository files navigation

Files

Text

Audio

Audio Data Sources

Important Links

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages