BatchFeature should cast to `np.float32` by default #12862

patrickvonplaten · 2021-07-23T15:00:38Z

Currently the default dtype for Speech Feature Extractors is numpy.float64 which leads to two problems:

It makes the data processing extremely expensive for the RAM. Many sound formats are stored in int16 (such as .wav) and are then transformed to float64 which unnecessarly increases RAM by a factor of 4. We should at least stick to float32
Currently we have added some hacks to the Wav2Vec2 and Speech2TextTransformer feature extractors to prevent Double vs. Float dtype mismatches:

transformers/src/transformers/models/wav2vec2/feature_extraction_wav2vec2.py

Line 87 in f6e2544

input_values = [x.astype(np.float32) for x in input_values]

The main problem is that np.asarray([....]) by default creates a np.float64 array and that we just pass that format along.
=> We should either always cast to float32 in BatchFeature (see here:

transformers/src/transformers/feature_extraction_utils.py

Line 151 in f6e2544

as_tensor = np.asarray

) or add a flag dtype to BatchFeature.

@patrickvonplaten

The text was updated successfully, but these errors were encountered:

patrickvonplaten self-assigned this Jul 23, 2021

huggingface deleted a comment from github-actions bot Aug 24, 2021

patrickvonplaten added the WIP label Aug 24, 2021

patrickvonplaten mentioned this issue Aug 25, 2021

Add Wav2Vec2 & Hubert ForSequenceClassification #13153

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BatchFeature should cast to `np.float32` by default #12862

BatchFeature should cast to `np.float32` by default #12862

patrickvonplaten commented Jul 23, 2021 •

edited

Loading

BatchFeature should cast to np.float32 by default #12862

BatchFeature should cast to np.float32 by default #12862

Comments

patrickvonplaten commented Jul 23, 2021 • edited Loading

BatchFeature should cast to `np.float32` by default #12862

BatchFeature should cast to `np.float32` by default #12862

patrickvonplaten commented Jul 23, 2021 •

edited

Loading