First release with pre-built docker images and support https://github.com/openai/whisper/releases/tag/v20231117.
Quick start
You can pull the pre-built image
docker pull ghcr.io/lifeosm/whisper:latest # or v20231117
or build your own by
docker build -t whisper:local .
The image contains no models, so you need to download them first.
docker volume create whisper-models
docker run --rm -it \
--entrypoint python \
-v whisper-models:/root/.cache/whisper \
ghcr.io/lifeosm/whisper:latest \
-c 'import whisper; whisper.load_model("tiny")'
With the model, you can run a required command, e.g.,
docker run --rm -it \
-v whisper-models:/root/.cache/whisper \
-v audio.wav:/usr/src/audio.wav \
ghcr.io/lifeosm/whisper:latest \
--model tiny \
--task transcribe \
audio.wav
The complete list of commands can be found here
docker run --rm -it ghcr.io/lifeosm/whisper:latest --help
Don't forget about memory limits, e.g., to run the medium model you could use the following command
docker run --rm -it \
-m 8g \
-v whisper-models:/root/.cache/whisper \
-v audio.wav:/usr/src/audio.wav \
ghcr.io/lifeosm/whisper:latest \
--model medium \
--task transcribe \
audio.wav