Add wav2vec2.0 model #1529

mthrok · 2021-05-26T02:41:42Z

This PR adds

TorchScript-able Wav2Vec2Model class
Factory functions for three configurations presented in the paper
- wav2vec2_base
- wav2vec2_large
- wav2vec2_large_lv60k

ref: #1506
supersedes #1525

mthrok · 2021-05-26T03:49:21Z

Quantization tests are failing for macOS CI. But I tried it locally with the latest PyTorch nightly, and it worked fine.

2021-05-25 23:45:43 moto@moto-mbp:/Users/moto/Development/torchaudio  (base)
% pytest test/torchaudio_unittest/models/wav2vec2 -k quant -v
============================================================================================================ test session starts =============================================================================================================
platform darwin -- Python 3.8.5, pytest-6.2.2, py-1.10.0, pluggy-0.13.1 -- /Users/moto/miniconda3/bin/python
cachedir: .pytest_cache
rootdir: /Users/moto/Development/torchaudio
collected 18 items / 12 deselected / 6 selected

test/torchaudio_unittest/models/wav2vec2/model_test.py::TestWav2Vec2Model::test_quantize_0 PASSED                                                                                                                                      [ 16%]
test/torchaudio_unittest/models/wav2vec2/model_test.py::TestWav2Vec2Model::test_quantize_1 PASSED                                                                                                                                      [ 33%]
test/torchaudio_unittest/models/wav2vec2/model_test.py::TestWav2Vec2Model::test_quantize_2 PASSED                                                                                                                                      [ 50%]
test/torchaudio_unittest/models/wav2vec2/model_test.py::TestWav2Vec2Model::test_quantize_torchscript_0 PASSED                                                                                                                          [ 66%]
test/torchaudio_unittest/models/wav2vec2/model_test.py::TestWav2Vec2Model::test_quantize_torchscript_1 PASSED                                                                                                                          [ 83%]
test/torchaudio_unittest/models/wav2vec2/model_test.py::TestWav2Vec2Model::test_quantize_torchscript_2 PASSED                                                                                                                          [100%]

===================================================================================================== 6 passed, 12 deselected in 52.63s ======================================================================================================

@vkuzo Have you ever seen an error like this? I guess it's CI / PyTorch nightly package issue, but if you have an insight, that will be helpful as well.

From https://app.circleci.com/pipelines/github/pytorch/audio/6076/workflows/6958ea54-1bb4-482e-8d7b-c42de0bf0a4b/jobs/216325

self = <[AttributeError("'LinearPackedParams' object has no attribute '_packed_params'",) raised in repr()] LinearPackedParams object at 0x7faa652edef0>
weight = tensor([[0.]], size=(1, 1), dtype=torch.qint8,
       quantization_scheme=torch.per_tensor_affine, scale=1.0, zero_point=0)
bias = None

    @torch.jit.export
    def set_weight_bias(self, weight: torch.Tensor, bias: Optional[torch.Tensor]) -> None:
        if self.dtype == torch.qint8:
>           self._packed_params = torch.ops.quantized.linear_prepack(weight, bias)
E           RuntimeError: Didn't find engine for operation quantized::linear_prepack NoQEngine