Setup

VM Setup:

We've run all the code on a GCP Ubuntu 20.04 VM with 8 V100 GPU, Cuda 11.7, CuDNN 8.5 setup script can be seen here : https://gist.githubusercontent.com/sourabh2k15/9dbadd0f5ca35568ca210ee4cb3b19c1/raw/be988210438f91eda85fc608a9c71744a4c8af89/vm_setup.sh

we simply wget above script and run to install required elements.

Code Setup

git clone https://github.com/sourabh2k15/deepspeech_timing_diffs.git

cd deepspeech_timing_diffs

Data Setup

download using :

wget https://transfer.sh/oanN8q/sharded_padded_batch.npz

this will give you one sharded batch of real librispeech data with size 256, this batch has input_paddings and target paddings includes.

dimensions:

inputs - (8, 32, 320000)

input_paddings - (8, 32, 320000)

targets - (8, 32, 256)

target_paddings - (8, 32, 256)

Reproduction

JAX

Command to run the jax example :

python3 jax/jax_e2e.py

PyTorch

Command to run the torch example :

torchrun --standalone --nnodes 1 --nproc_per_node 8 pytorch/torch_e2e.py

Results:

on 8 V100 gcp VM torch script gives ~80 seconds for 100 steps which translates to 1.25 steps / second this is in line with the e2e run here : https://tensorboard.dev/experiment/aXqps4I4QlG90gfMRoAaJg/#scalars&_smoothingWeight=0
JAX script gives 170 seconds for 100 steps which translates to 0.58 steps / second , full e2e run here which shows 0.74 steps / second : https://tensorboard.dev/experiment/BOSgqsQGTGugYJ1pDnPHbw/#scalars&_smoothingWeight=0

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
jax		jax
pytorch		pytorch
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
dump_librispeech_batch.py		dump_librispeech_batch.py
load_librispeech_batch.py		load_librispeech_batch.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jax

jax

pytorch

pytorch

.gitattributes

.gitattributes

.gitignore

.gitignore

README.md

README.md

dump_librispeech_batch.py

dump_librispeech_batch.py

load_librispeech_batch.py

load_librispeech_batch.py

Repository files navigation

Setup

VM Setup:

Code Setup

Data Setup

Reproduction

JAX

PyTorch

Results:

About

Releases

Packages

Contributors 2

Languages

sourabh2k15/deepspeech_timing_diffs

Folders and files

Latest commit

History

Repository files navigation

Setup

VM Setup:

Code Setup

Data Setup

Reproduction

JAX

PyTorch

Results:

About

Resources

Stars

Watchers

Forks

Languages