Cache Aware Streaming, missing words at chunk seams. #8598

tomkiddl · 2024-03-06T06:14:25Z

tomkiddl
Mar 6, 2024

Similar to this question: [https://github.com/NVIDIA/NeMo/discussions/4216#discussioncomment-3436321]

I'm using Online_ASR_Microphone_Demo_Cache_Aware_Streaming.
with stt_en_fastconformer_hybrid_large_streaming_multi.nemo

greedy decoding
with context size of [70:13]
and chunk_size = 1040 + 80

it seems to often miss words when the word is near the seam between chunks or if it overlaps the seam between chunks
is there a way to improve this without increasing the right context?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache Aware Streaming, missing words at chunk seams. #8598

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

Cache Aware Streaming, missing words at chunk seams. #8598

tomkiddl Mar 6, 2024

Replies: 0 comments

tomkiddl
Mar 6, 2024