ValueError: We expect a numpy ndarray as input, got `<class 'NoneType'>` #89

PiotrEsse · 2024-05-07T06:01:10Z

Hi,
I am trying an example of Youtube to text. I am getting following error.

024-05-07 07:58:17,383 - WARNING - /home/piotr/anaconda3/envs/WhisperPlus38/lib/python3.8/site-packages/pyannote/audio/core/io.py:43: UserWarning: torchaudio._backend.set_audio_backend has been deprecated. With dispatcher enabled, this function is no-op. You can remove the function call.
  torchaudio.set_audio_backend("soundfile")

2024-05-07 07:58:19,889 - ERROR - An error occurred: __init__: could not find match for ^\w+\W
2024-05-07 07:58:20,578 - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
2024-05-07 07:58:21,842 - INFO - Model loaded successfully.
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
2024-05-07 07:58:22,965 - INFO - Transcribing audio...
Traceback (most recent call last):
  File "/home/piotr/WhisperPlus/Tutorial/YoutubeToText.py", line 31, in <module>
    transcript = pipeline(
  File "/home/piotr/anaconda3/envs/WhisperPlus38/lib/python3.8/site-packages/whisperplus/pipelines/whisper.py", line 91, in __call__
    result = pipe(audio_path)
  File "/home/piotr/anaconda3/envs/WhisperPlus38/lib/python3.8/site-packages/transformers/pipelines/automatic_speech_recognition.py", line 285, in __call__
    return super().__call__(inputs, **kwargs)
  File "/home/piotr/anaconda3/envs/WhisperPlus38/lib/python3.8/site-packages/transformers/pipelines/base.py", line 1234, in __call__
    return next(
  File "/home/piotr/anaconda3/envs/WhisperPlus38/lib/python3.8/site-packages/transformers/pipelines/pt_utils.py", line 124, in __next__
    item = next(self.iterator)
  File "/home/piotr/anaconda3/envs/WhisperPlus38/lib/python3.8/site-packages/transformers/pipelines/pt_utils.py", line 269, in __next__
    processed = self.infer(next(self.iterator), **self.params)
  File "/home/piotr/anaconda3/envs/WhisperPlus38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 631, in __next__
    data = self._next_data()
  File "/home/piotr/anaconda3/envs/WhisperPlus38/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 675, in _next_data
    data = self._dataset_fetcher.fetch(index)  # may raise StopIteration
  File "/home/piotr/anaconda3/envs/WhisperPlus38/lib/python3.8/site-packages/torch/utils/data/_utils/fetch.py", line 32, in fetch
    data.append(next(self.dataset_iter))
  File "/home/piotr/anaconda3/envs/WhisperPlus38/lib/python3.8/site-packages/transformers/pipelines/pt_utils.py", line 186, in __next__
    processed = next(self.subiterator)
  File "/home/piotr/anaconda3/envs/WhisperPlus38/lib/python3.8/site-packages/transformers/pipelines/automatic_speech_recognition.py", line 410, in preprocess
    raise ValueError(f"We expect a numpy ndarray as input, got `{type(inputs)}`")
ValueError: We expect a numpy ndarray as input, got `<class 'NoneType'>`

The text was updated successfully, but these errors were encountered:

kadirnar · 2024-05-07T09:24:07Z

Can you share your code? It gives this error because it is not a .mp3 file.

kadirnar · 2024-05-07T09:25:12Z

Can't download Youtube video. It may be related to library versions. I will test it.

2024-05-07 07:58:19,889 - ERROR - An error occurred: __init__: could not find match for ^\w+\W

kadirnar · 2024-05-07T10:03:41Z

There is a bug with the Pytube library. I will solve this error.
pytube/pytube#1201

PiotrEsse · 2024-05-07T10:57:00Z

Code is taken straight from Your example - no changes.

If I hardcode a file in audio_path = "/home/piotr/WhisperPlus/Tutorial/zycie.mp3" then its working.

kadirnar · 2024-05-07T11:04:51Z

This function was working for 1-2 days, but it gives an error. I can't download videos either. I don't know why. Pytube is not an up-to-date library. I started writing a different library code.

Now you can manually download the .mp3 file and test it. There is only a bug with the download function.

kadirnar · 2024-05-07T12:19:25Z

I rewrote this function(download_youtube_to_mp3). I tested it and it works.

kadirnar self-assigned this May 7, 2024

kadirnar added the bug Something isn't working label May 7, 2024

kadirnar mentioned this issue May 7, 2024

Issue: diarization = self.diarization_pipeline( TypeError: 'NoneType' object is not callable #87

Closed

kadirnar linked a pull request May 7, 2024 that will close this issue

Add yt-dlp instead of pytube to download youtube video #92

Merged

kadirnar closed this as completed in #92 May 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ValueError: We expect a numpy ndarray as input, got `<class 'NoneType'>` #89

ValueError: We expect a numpy ndarray as input, got `<class 'NoneType'>` #89

PiotrEsse commented May 7, 2024

kadirnar commented May 7, 2024

kadirnar commented May 7, 2024 •

edited

Loading

kadirnar commented May 7, 2024

PiotrEsse commented May 7, 2024 •

edited

Loading

kadirnar commented May 7, 2024

kadirnar commented May 7, 2024

ValueError: We expect a numpy ndarray as input, got <class 'NoneType'> #89

ValueError: We expect a numpy ndarray as input, got <class 'NoneType'> #89

Comments

PiotrEsse commented May 7, 2024

kadirnar commented May 7, 2024

kadirnar commented May 7, 2024 • edited Loading

kadirnar commented May 7, 2024

PiotrEsse commented May 7, 2024 • edited Loading

kadirnar commented May 7, 2024

kadirnar commented May 7, 2024

ValueError: We expect a numpy ndarray as input, got `<class 'NoneType'>` #89

ValueError: We expect a numpy ndarray as input, got `<class 'NoneType'>` #89

kadirnar commented May 7, 2024 •

edited

Loading

PiotrEsse commented May 7, 2024 •

edited

Loading