@hipstas

High-Performance Sound Technologies for Access and Scholarship

Helping humanists access and analyze speech audio collections.

Pinned repositories

  1. audio-labeler

    An in-browser app for labeling audio clips at random, using Docker and Flask.

    JavaScript 11 3

  2. audio-ml-lab

    A Dockerized Jupyter notebook environment with pre-installed audio machine learning tools.

    Python 3

  3. audio-tagging-toolkit

    A Python package for audio annotation and classifier training. Developed in collaboration with the WGBH Foundation and the American Archive of Public Broadcasting.

    Python 13

  4. sida

    Speaker Identification for Archives. This repository includes several notebooks that walks through the steps of training and running a classifier that takes speaker labels and the audio, extracts f…

    Jupyter Notebook 2

  5. aapb-speaker-labels

    This repository contains speaker labels in CSV files for training speaker identification classifiers. These speakers appear in a subset of AAPB files.

    Jupyter Notebook 2

  6. kaldi-pop-up-archive

    A Docker image for the Kaldi speech recognition tool + training data from Pop Up Archive

    Perl 8 3

  • A project for creating workflows for generating audio IIIF manifests by HiPSTAS and the Brumfield Labs

    Updated Jan 15, 2019
  • Speaker Identification for Archives. This repository includes several notebooks that walks through the steps of training and running a classifier that takes speaker labels and the audio, extracts features (including vowels), and trains a model and runs it.

    Jupyter Notebook 2 Updated Jun 23, 2018
  • MIT Updated Jun 8, 2018
  • A Python package for audio annotation and classifier training. Developed in collaboration with the WGBH Foundation and the American Archive of Public Broadcasting.

    Python 13 MIT Updated Jun 2, 2018
  • This repository contains speaker labels in CSV files for training speaker identification classifiers. These speakers appear in a subset of AAPB files.

    Jupyter Notebook 2 Updated Jun 2, 2018
  • A Dockerized Jupyter notebook environment with pre-installed audio machine learning tools.

    Python 3 MIT Updated May 14, 2018
  • This repository includes training data and SVM classifier for locating applause in audio recordings.

    1 Updated May 12, 2018
  • This repository contains notebooks with basic and simple workflows for audio processing and analysis in the humanities.

    Jupyter Notebook Updated May 7, 2018
  • Audio classifier output for identifying Marco Werman as a speaker across all the recordings of The World in AAPBin the American Archive of Public Broadcasting. CSV includes start time, duration, confidence level, speaker name

    Jupyter Notebook Updated Dec 17, 2017
  • A Docker image for the Kaldi speech recognition tool + training data from Pop Up Archive

    Perl 8 3 Updated Dec 13, 2017
  • This repository contains preprocessing instructions for building a universal background model for speaker identification in the AAPB corpus.

    Jupyter Notebook 1 Updated Nov 16, 2017
  • A miscellaneous collection of human-approved audio labels.

    Jupyter Notebook Updated Sep 1, 2017
  • An in-browser app for labeling audio clips at random, using Docker and Flask.

    JavaScript 11 3 MIT Updated Aug 28, 2017
  • Data and code for ongoing collaboration between the High-Performance Sound Technologies for Access and Scholarship research group at UT Austin, the WGBH Foundation, and the American Archive of Public Broadcasting.

    Jupyter Notebook 2 Updated Aug 23, 2017
  • These demo notebooks demonstrate how to train and run audio classifiers.

    Jupyter Notebook Updated Aug 18, 2017
  • This repository includes a workshop from the "Shaping Humanities Data" event as part of the "Collections as Data" project at the 2017 Digital Humanities conference in Montreal.

    Jupyter Notebook MIT Updated Aug 15, 2017
  • A machine learning classifier, including training data, for identifying broadcast test tones in audio and video files.

    Jupyter Notebook Updated Aug 15, 2017
  • This repository contains all the pbcore metadata from the AAPB. This includes a script that turns the XML structure into JSON and loads pbcore metadata in mongodb database in order to construct or customize speaker-specific UBMs

    Python MIT Updated Jul 14, 2017
  • This repository contains a demonstration workshop run at Indiana University at Bloomington, March 2017.

    Jupyter Notebook 1 Updated Mar 26, 2017

Most used topics

Loading…

0

People

This organization has no public members. You must be a member to see who’s a part of this organization.