Web-based annotation tool for media data.
- Uploading selected audio-files from directory. Currently, only
.wav
.mp3
files are supported. - Manual and auto transcribing for 20+ languages and dialects via Vosk
ℹ️ Auto annotation will use the first channel of
.mp3
. - Export of the speech regions as
zip
ofwavs
andregions.csv
orregions.json
-
Run app as docker image
docker-compose up -d
-
Install conda/miniconda/micromamba and node+npm
-
Clone repository, create python environment using conda manager and activate it
git clone https://github.com/ruslantau/media-annotator cd annotator conda env create -f backend/environment.yaml conda activate annotator
-
Run FastAPI backend
python backend/main.py
-
Install dependencies and run Nuxt frontend
cd frontend npm install npm run build npm run start
- add docker images and setup CI/CD
- extend the list of supported formats (
mp4,flac,avi,etc.
) - running auto annotation on selected region
- adding punctuation
- speaker diarisation