Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

配置怎么算的,比如长视频用large的模型,多长时间的视频要用的大概什么样的配置【内存、GPU】 #38

Open
lwstudy opened this issue Apr 29, 2024 · 1 comment

Comments

@lwstudy
Copy link
Contributor

lwstudy commented Apr 29, 2024

有试过的么?比如半小时算长视频要识别的话32G内存+8G显存能用么【看了下faster-whisper Large-v2 模型的13分钟测试数据,4G左右的GPU+4G内存一分钟内完成识别】,不知道这里的faster-whisper Large-v3模型不知道是不是上个月huggingface发的faster-distil-whisper-large-v3,不然在github才v2。

@lwstudy
Copy link
Contributor Author

lwstudy commented Apr 29, 2024

; after update set , please restart the app
; ip:port
web_address=127.0.0.1:9977
;en or zh
lang=zh
; cpu or cuda
devtype=cuda
; int8 or float32 only gpu
cuda_com_type=int8
;Reducing these two numbers will use less graphics memory
beam_size=2
best_of=4
;vad set to false,use litter GPU memory,true is more
vad=true
;0 is use litter GPU,other is more
temperature=6
;false is litter GPU,ture is more
condition_on_previous_text=false
initial_prompt_zh=以下是普通话内容,请转录为中文简体。

【large-v3】
无论怎么配gpu都在2.2的样子。19分钟的视频70s的样子,结果三十分钟的就要到240-300s了

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant