-
Hi, It seems to work at beginning, but after some time it tends to repeat lot of utterances or detected events. Does Whisper CPP requires special parameters for long recordings? Examples (music & pause below is wrong, there is some speech): [00:55:14.980 --> 00:55:24.980] I'm going to be able to help you. [00:23:21.000 --> 00:23:24.000] and that the baby was not in the room, [00:04:25.000 --> 00:04:40.000] [Music] [00:09:22.000 --> 00:09:24.000] (Pause.) I see a lot of similar cases. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 6 replies
-
Search this github and the actual Whisper one for "hallucinations", "duplicating", "repetition", "repeating", etc, there's a ton of posts about this. Whisper pushed a potential fix a few weeks back, but it hasn't made its way into whisper.cpp yet, so no idea if it helps. |
Beta Was this translation helpful? Give feedback.
Search this github and the actual Whisper one for "hallucinations", "duplicating", "repetition", "repeating", etc, there's a ton of posts about this. Whisper pushed a potential fix a few weeks back, but it hasn't made its way into whisper.cpp yet, so no idea if it helps.