Skip to content

Commit

Permalink
[NeuralChat] Fix pydub library import issues (#1285)
Browse files Browse the repository at this point in the history
  • Loading branch information
lvliang-intel committed Feb 20, 2024
1 parent 39064aa commit c37dab2
Show file tree
Hide file tree
Showing 3 changed files with 9 additions and 2 deletions.
6 changes: 6 additions & 0 deletions intel_extension_for_transformers/neural_chat/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -228,6 +228,12 @@ NeuralChat includes support for various plugins to enhance its capabilities:
- [**Caching**](./pipeline/plugins/caching/README.md)
- [**Named Entity Recognition (NER)**](./pipeline/plugins/ner/README.md)

Please be aware that additional libraries are required for various plugins. You can locate a 'requirements.txt' file in each plugin directory. Navigate to the plugin directory and execute 'pip install -r requirements.txt'. For instance, to enable the RAG plugin, run the following commands:
```shell
cd ./pipeline/plugins/retrieval/
pip install -r requirements.txt
```

### Multimodal APIs

In addition to the text-based chat RESTful API, NeuralChat offers several helpful plugins in its RESTful API lineup to aid users in building multimodal applications. NeuralChat supports the following RESTful APIs:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,6 @@
from typing import Optional
from ...cli.log import logger
from fastapi import File, UploadFile, Form
from pydub import AudioSegment
from ...config import GenerationConfig
import base64
import torch
Expand Down Expand Up @@ -86,6 +85,7 @@ async def handle_talkingbot_face_animation(image: UploadFile = File(...),
with open("tmp_audio_bytes", 'wb') as fout:
content = await audio.read()
fout.write(content)
from pydub import AudioSegment
audio = AudioSegment.from_file("tmp_audio_bytes")
audio = audio.set_frame_rate(16000)
# bytes to wav
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,6 @@
from typing import Optional, List
from ...cli.log import logger
from fastapi import File, UploadFile
from pydub import AudioSegment
from ...plugins import plugins, get_plugin_instance
import base64
import torch
Expand Down Expand Up @@ -90,6 +89,7 @@ async def handle_talkingbot_asr(file: UploadFile = File(...), language: str = "a
with open("tmp_audio_bytes", 'wb') as fout:
content = await file.read()
fout.write(content)
from pydub import AudioSegment
audio = AudioSegment.from_file("tmp_audio_bytes")
audio = audio.set_frame_rate(16000)
# bytes to wav
Expand Down Expand Up @@ -122,6 +122,7 @@ async def create_speaker_embedding(file: UploadFile = File(...)):
with open(f"tmp_spk_{file_name}", 'wb') as fout:
content = await file.read()
fout.write(content)
from pydub import AudioSegment
audio = AudioSegment.from_file(f"tmp_spk_{file_name}")
audio.export(f"{spk_id}", format="mp3")

Expand Down

0 comments on commit c37dab2

Please sign in to comment.