Architecture

#Integrated Radio/Audio Editor

Installation

See INSTALL.md

Tools

Text-based speech editor

Text/speech alignment
Standard cut/copy/paste/delete metaphors
Pause and breath identification and insertion
Duplicate sentence detection

Music selection

Music remixing

Architecture

The speech editor app has the client/server model. The client--the javascript web app--is responsible for all of the interaction. As you edit the speech, the web app changes the underlying state of the audio composition. Then, to actually generate the audio for the composition, the web app sends a request to the server (/reauthor) to build the audio.

Components

`speecheditor.js` - main front-end javascript code

Defines TAAPP, a global variable that controls the state of the app. Key functions of TAAPP include loadSite, newProject, generateAudio, createUnderlay, and drawScript. Most of the functions in this file have fairly descriptive names.

Here are the key things that happen when the site loads (this can be found at the end of the file):

// launch the project creation modal dialog
$('#setupModal')
.modal({
    show: (speech === "")
})
.find('.createProjectBtn')
.click(function () {
    TAAPP.newProject();
});

// start a new project if it was specified in the url
if (speech !== "") {
    TAAPP.newProject(speech);
}

// initialize everything that doesn't depend on the speech track
TAAPP.loadSite();

`edible` - timeline and waveform plugin

edible is a jquery-ui plugin that I wrote to represent the waveforms and timeline in the interface. There are a few different kinds of waveforms in the app: edible.musicWaveform.js, edible.textAlignedWaveform.js, edible.waveform.js, all of which inherit from edible.wfBase.js. There's also edible.timeline.js, which is the timeline itself.

The waveforms are rendered as html5 canvas objects.

`textAreaManager.coffee` - manage the text areas that contain speech

textAreaManager (often referred to as TAM throughout the code) manages the text areas in the UI. These contain the text that can be edited. It's responsible for editing and highlighting the text as the audio plays.

You can, for example, see the keyboard shortcuts defined in the ScriptArea constructor. This gives you a sense of what you can do within a textarea.

The TAM is created in the TAAPP.reset function in speecheditor.js.

`musicbrowser` - sub-app for the music browser

This folder contains the entire music browser app.

`app.py` - python back-end

This is the main server for the web app. Its primary functions are to serve the static web app pages, and to generate audio (and do any intense background processing, like music retargeting).

/: serves the main web app (index.html)
/reauthor: generates the complete audio for the edited story (activated by rendering/pressing play/pressing enter in the web app)
/download/<name>: used to download generated audio (activated by download button in web app)
/dupes: detect duplicate lines in script (activated when a script is loaded in the web app)
/changepoints/<song_name>: finds music change points in a song
/underlayRetarget...: generates a retargeted musical underlay for the story
/uploadSong: uploads and analyzes a song
/alignment/<name>: return the pre-computed transcript-to-speech alignment for the speech track

Name		Name	Last commit message	Last commit date
Latest commit History 163 Commits
music_changepoints		music_changepoints
music_remix		music_remix
static		static
templates		templates
utilities		utilities
.bowerrc		.bowerrc
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
Dockerfile		Dockerfile
Gruntfile.js		Gruntfile.js
INSTALL-OLD.md		INSTALL-OLD.md
INSTALL.md		INSTALL.md
README.md		README.md
Vagrantfile		Vagrantfile
alignment-setup.sh		alignment-setup.sh
analyze_speech.py		analyze_speech.py
app.py		app.py
app.wsgi		app.wsgi
cubic_spline.py		cubic_spline.py
duplicate_lines.py		duplicate_lines.py
package.json		package.json
provision.sh		provision.sh
reauthor_speech.py		reauthor_speech.py
requirements.txt		requirements.txt
wav2json.patch		wav2json.patch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation

Tools

Text-based speech editor

Music selection

Music remixing

Architecture

Components

`speecheditor.js` - main front-end javascript code

`edible` - timeline and waveform plugin

`textAreaManager.coffee` - manage the text areas that contain speech

`musicbrowser` - sub-app for the music browser

`app.py` - python back-end

About

Releases

Packages

Languages

ucbvislab/speecheditor

Folders and files

Latest commit

History

Repository files navigation

Installation

Tools

Text-based speech editor

Music selection

Music remixing

Architecture

Components

speecheditor.js - main front-end javascript code

edible - timeline and waveform plugin

textAreaManager.coffee - manage the text areas that contain speech

musicbrowser - sub-app for the music browser

app.py - python back-end

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`speecheditor.js` - main front-end javascript code

`edible` - timeline and waveform plugin

`textAreaManager.coffee` - manage the text areas that contain speech

`musicbrowser` - sub-app for the music browser

`app.py` - python back-end

Packages