Audio Waveform Image Generator

audiowaveform is a C++ command-line application that generates waveform data from either MP3, WAV, or FLAC format audio files. Waveform data can be used to produce a visual rendering of the audio, similar in appearance to audio editing applications.

Waveform data files are saved in either binary format (.dat) or JSON (.json). Given an input waveform data file, audiowaveform can also render the audio waveform as a PNG image at a given time offset and zoom level.

The waveform data is produced from an input stereo audio signal by first combining the left and right channels to produce a mono signal. The next stage is to compute the minimum and maximum sample values over groups of N input samples (where N is controlled by the --zoom command-line option), such that each N input samples produces one pair of minimum and maxmimum points in the output.

Getting started

audiowaveform requires cmake 2.8.7 or later, g++ 4.6.3 or later, and Boost 1.46.0 or later.

The software has been developed on Ubuntu 12.04 and Fedora 18. Due to compiler and library version requirements, the software may not build on earlier operating system releases.

Install package dependencies

Fedora

libmad is available from the RPM Fusion free repository. Before running the following yum command you should follow the instructions here to add this repository, if you have not already done so.

$ sudo yum install git cmake libmad-devel libsndfile-devel \
  gd-devel boost-devel

Ubuntu

$ sudo apt-get install git-core make cmake gcc g++ libmad0-dev libsndfile1-dev \
  libgd2-xpm-dev libboost-filesystem-dev libboost-program-options-dev libboost-regex-dev

Mac OSX

Install XCode and Homebrew, then:

$ brew install cmake
$ brew install libmad
$ brew install libsndfile
$ brew install gd
$ brew install boost --with-c++11

Obtain the source code

$ git clone https://github.com/bbcrd/audiowaveform.git
$ cd audiowaveform

Install googletest and googlemock test frameworks

audiowaveform uses googletest and googlemock for unit testing. Following this advice in the googletest FAQ, download the source and unzip:

$ wget https://googlemock.googlecode.com/files/gmock-1.7.0.zip
$ unzip gmock-1.7.0.zip
$ ln -s gmock-1.7.0 gmock

Build

$ mkdir build
$ cd build
$ cmake ..
$ make

The default build type is Release. To build in Debug mode add -D CMAKE_BUILD_TYPE=Debug to the cmake command above. If you don't want to compile the unit tests add -D ENABLE_TESTS=0.

To compile with clang instead of g++:

$ cmake -D CMAKE_C_COMPILER=/usr/local/bin/clang -D CMAKE_CXX_COMPILER=/usr/local/bin/clang++ ..

Test

$ make test

To see detailed test output:

$ ./audiowaveform_tests

Install

$ sudo make install

By default this installs the audiowaveform program in /usr/local/bin, and man pages in /usr/local/share/man. To change these locations, add a -D CMAKE_INSTALL_PREFIX=... option when invoking cmake above.

Run

$ audiowaveform --help

Command line options

audiowaveform accepts the following command-line options:

Short	Long	Description
	`--help`	Show help message
`-v`	`--version`	Show version information
`-i <filename>`	`--input-filename <filename>`	Input mono or stereo audio (.wav or .mp3) or waveform data (.dat) file name
`-o <filename>`	`--output-filename <filename>`	Output waveform data (.dat or .json), audio (.wav), or PNG image (.png) file name
`-z <level>`	`--zoom <zoom>`	Zoom level (samples per pixel), default: 256. Not valid if `--end` or `--pixels-per-second` is also specified
	`--pixels-per-second <zoom>`	Zoom level (pixels per second), default: 100. Not valid if `--end` or `--zoom` is also specified
`-b <bits>`	`--bits <bits>`	Number of bits resolution when creating a waveform data file (either 8 or 16), default: 16
`-s <seconds>`	`--start <seconds>`	Start time (seconds), default: 0
`-e <seconds>`	`--end <seconds>`	End time (seconds). Not valid if `--zoom` is also specified
`-w <width>`	`--width <width>`	Width of output image (pixels), default: 800
`-h <height>`	`--height <height>`	Height of output image (pixels), default: 250
`-c <scheme>`	`--colors <scheme>`	Color scheme of output image (either 'audition' or 'audacity'), default: audacity
	`--border-color <color>`	Border color (in rrggbb[aa] hex format), default: set by `--colors` option
	`--background-color <color>`	Background color (in rrggbb[aa] hex format), default: set by `--colors` option
	`--waveform-color <color>`	Waveform color (in rrggbb[aa] hex format), default: set by `--colors` option
	`--axis-label-color <color>`	Axis label color (in rrggbb[aa] hex format), default: set by `--colors` option
	`--no-axis-labels`	Render PNG images without axis labels
	`--with-axis-labels`	Render PNG images with axis labels (default)

Usage

In general, you should use audiowaveform to create waveform data files (.dat) from input MP3 or WAV audio files, then create waveform images from the waveform data files.

For example, to create a waveform data file from an MP3 file, at 256 samples per point with 8-bit resolution:

$ audiowaveform -i test.mp3 -o test.dat -z 256 -b 8

Then, to create a PNG image of a waveform, either specify the zoom level, in samples per pixel, or the time region to render.

The following command creates a 1000x200 pixel PNG image from a waveform data file, at 50 pixels per second, starting at 5.0 seconds from the start of the audio:

$ audiowaveform -i test.dat -o test.png --pixels-per-second 50 -s 5.0 -w 1000 -h 200

This command creates a 1000x200 pixel PNG image from a waveform data file, showing the region from 45.0 seconds to 60.0 seconds from the start of the audio:

$ audiowaveform -i test.dat -o test.png -s 45.0 -e 60.0 -w 1000 -h 200

Note that it is not possible to set a zoom level less than that used to create the original waveform data file.

It is also possible to create PNG images directly from either MP3 or WAV files, although if you want to render multiple images from the same audio file, it's generally preferable to first create a waveform data (.dat) file, and create the images from that, as decoding long MP3 files can take significant time.

The following command creates a 1000x200 PNG image directly from a WAV file, at 300 samples per pixel, starting at 60.0 seconds from the start of the audio:

$ audiowaveform -i test.wav -o test.png -z 300 -s 60.0 -w 1000 -h 200

The following command converts a waveform data file (.dat) to JSON format:

$ audiowaveform -i test.dat -o test.json

In addition, audiowaveform can also be used to convert MP3 to WAV format audio:

$ audiowaveform -i test.mp3 -o test.wav

Credits

This program contains code from the following open-source projects, used under the terms of these projects' respective licenses:

License

See COPYING for details.

Contributing

If you have a feature request or want to report a bug, we'd be happy to hear from you. Please either raise an issue, or fork the project and send us a pull request.

Authors

This software was written by Chris Needham, chris.needham at bbc.co.uk.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
cmake/modules		cmake/modules
debian		debian
doc		doc
src		src
test		test
.gitignore		.gitignore
.travis.yml		.travis.yml
CMakeLists.txt		CMakeLists.txt
COPYING		COPYING
ChangeLog		ChangeLog
Doxyfile		Doxyfile
README.md		README.md
VERSION		VERSION

License

isotoma/audiowaveform

Folders and files

Latest commit

History

Repository files navigation

Audio Waveform Image Generator

Getting started

Install package dependencies

Fedora

Ubuntu

Mac OSX

Obtain the source code

Install googletest and googlemock test frameworks

Build

Test

Install

Run

Command line options

Usage

Credits

License

Contributing

Authors

Copyright

About

Resources

License

Stars

Watchers

Forks

Languages