Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug: Failed to read audio file #556

Open
misters2008 opened this issue Aug 27, 2024 · 3 comments
Open

Bug: Failed to read audio file #556

misters2008 opened this issue Aug 27, 2024 · 3 comments

Comments

@misters2008
Copy link

misters2008 commented Aug 27, 2024

Contact Details

makintosh_2008@ukr.net

What happened?

Hi, i ve downloaded whisper-large-v3.llamafile and added ".exe" to it.
It opened a command window and showed me "whisper server listening at http://127.0.0.1:8080"
i opened tis link in browser and saw an interface to upload an audio file and set a temperature.

i uploaded 10mb .mp3 audiofile and when i submitted it, it showed tha page with only a single phrase: "Failed to read .Wav file"

Meanwhile in the opened command window it is showing this:
"Received request: Howcut.mp3
ID3: converting to wav...
ID3: failed to open audio file: Invalid argument (we support .wav, .mp3, .flac, and .ogg)
error: failed to read WAV file"

how to tackle this?

Version

dont know how to check, since i donwloaded a single .llamafile and dodnt download a separate app. the opened llamafile.exe doesnt let me enter anything into the command line.

What operating system are you seeing the problem on?

Windows

Relevant log output

No response

@WilliamTambellini
Copy link

same issue here on linux.

@WilliamTambellini
Copy link

whisperfile-0.8.13 seems to work though:

 $ ./whisperfile-0.8.13 -m ggml-large-v3.bin -f jfk.wav 
whisper_init_from_file_with_params_no_state: loading model from 'ggml-large-v3.bin'
...
[00:00:00.300 --> 00:00:09.000]   And so, my fellow Americans, ask not what your country can do for you, ask what you
[00:00:09.000 --> 00:00:11.000]   can do for your country.

@gnmarten
Copy link

AFAIK, wav file needs to be in a specific format: convert via ffmpeg -i "your mp3 or whatever file" -ar 16000 -ac 2 -c:a pcm_s16le -y output.wav (I did not have success with the --convert flag within whisperfile itself)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants