Excessive Hypothesis Words in Non-English Audios #206

rk-helper · 2024-09-26T10:25:19Z

Hello,

I’m experiencing an issue with transcription in non-English streams, particularly in Russian, where the number of hypothesis words generated is disproportionately large compared to confirmed words. The hypothesis words can be up to half of the confirmed words, which significantly impacts the accuracy and readability of the transcription. This issue is not present when transcribing English streams, where hypothesis words are more appropriately balanced.

Environment:

•	Model: whisper-large-v3 turbo 958mb
•	Device: MacBook Pro M3 Max (36GB RAM)

Video of issue in whisperkit: https://youtu.be/JWEHgKwogG8

The text was updated successfully, but these errors were encountered:

atiorh · 2024-10-16T05:28:22Z

I have replicated this in other languages as well. This requires an algorithmic improvement to the Eager Streaming Mode in order to break out of diverging hypotheses. We are investigating a fix for this!

atiorh self-assigned this Oct 16, 2024

atiorh added the triaged This issue has been looked at and prioritized by a maintainer label Oct 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Excessive Hypothesis Words in Non-English Audios #206

Excessive Hypothesis Words in Non-English Audios #206

rk-helper commented Sep 26, 2024

atiorh commented Oct 16, 2024

Excessive Hypothesis Words in Non-English Audios #206

Excessive Hypothesis Words in Non-English Audios #206

Comments

rk-helper commented Sep 26, 2024

atiorh commented Oct 16, 2024