Skip to content

WhisperX containerization #19

@okulovsky

Description

@okulovsky

The rational is to have diarization implemented in the BrainBox, so the routine of the automatic extraction of voices of the character can be built.

WhisperX https://github.com/m-bain/whisperX needs to be containerized and implemented as a BrainBox decider. The most similar decider is VideoToImages.

The input should be video. The output should be

  1. the JSON with all the information about fragments WhisperX provides
  2. The fragments themselves as aac files.

The amount of fragments might be capped to avoid overusing of the disk space.

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions