Skip to content

Releases: lifeosm/whisper

v20231117

25 Nov 18:06
v20231117
c6f93d9
Compare
Choose a tag to compare

First release with pre-built docker images and support https://github.com/openai/whisper/releases/tag/v20231117.

Quick start

You can pull the pre-built image

docker pull ghcr.io/lifeosm/whisper:latest # or v20231117

or build your own by

docker build -t whisper:local .

The image contains no models, so you need to download them first.

docker volume create whisper-models
docker run --rm -it \
  --entrypoint python \
  -v whisper-models:/root/.cache/whisper \
  ghcr.io/lifeosm/whisper:latest \
    -c 'import whisper; whisper.load_model("tiny")'

With the model, you can run a required command, e.g.,

docker run --rm -it \
  -v whisper-models:/root/.cache/whisper \
  -v audio.wav:/usr/src/audio.wav \
  ghcr.io/lifeosm/whisper:latest \
    --model tiny \
    --task transcribe \
    audio.wav

The complete list of commands can be found here

docker run --rm -it ghcr.io/lifeosm/whisper:latest --help

Don't forget about memory limits, e.g., to run the medium model you could use the following command

docker run --rm -it \
  -m 8g \
  -v whisper-models:/root/.cache/whisper \
  -v audio.wav:/usr/src/audio.wav \
  ghcr.io/lifeosm/whisper:latest \
    --model medium \
    --task transcribe \
    audio.wav