Skip to content

Audio syncing assistance for film/video editors with Python's SpeechRecognition

Notifications You must be signed in to change notification settings

JackyKLai/SlateInPost

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 

Repository files navigation

SlateInPost

Audio syncing assistance for film/video editors with Python's SpeechRecognition

How It Works

SlateInPost transcribes all your media files and compares them, giving you a rough idea of which audio/video clips contain similar contents. It uses a Python package called SpeechRecognition, which requires all media files to be in the .wav format. It is recommended that you convert your files to the supported format in advance, but SlateInPost does provide a conversion option that depends on the installation of FFmpeg.

Getting Started

The app is still a work in progress, releases are for testing only. Releases are available here. You can also build it yourself with PyInstaller. To build the app, you will need SpeechRecognition, ffmpeg-python and PyQt5.

Usage

  1. To start, click on the Import Media Files button and select your files.
  2. SlateInPost outputs a .txt file with the information it collected. Click on the Save output to... button to tell it where the text file should be saved.
  3. Let SlateInPost know how similar the files need to be in order for them to be associated. The minimum is set at 30%.
  4. You can also choose which speech recognition engine to use. Options include Google Speech Recognition (free), Wit.ai, Bing, Houndify and IBM.
  5. As mentioned above, you can also ask SlateInPost to convert your files to .wav for you, but make sure you have FFmpeg installed properly. If you choose to go with this conversion option, you can also decide whether these new .wav files should be deleted.
  6. Action! ;) Audio transcriptions take quite a while to complete. Maybe take that time to go through your footage clip by clip like a good editor should!

About

Audio syncing assistance for film/video editors with Python's SpeechRecognition

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages