This script reduces some of the donkey work involved in preparing speech data for analysis and processing.
It does the following:
- Cleans up the recording by remove low-freqency noise.
- Automatically chunks the recording into separate phrases / repetitions based on regions of silence (using "To Textgrid ( silences)" function).
- Displays the automatic textgrid annotation to allow for manual correction of errors.
- Automatically labels each area of non-silence with a prefix + number(e.g. YNQ_1, BEAG_1)
- Prompts the user to correct any errors
- Backs up previous version of files.
- Saves Chopped up large sounds into "SmallFiles" directory.
Note, the "buffer" options in the menu refer to extra time which will be added to the beginning and end of each interval or short sound file. This is to make sure that the beginning and ends of each target utterance are not accidentally truncated.
Note also, this is a draft version. It has no "memory" (i.e., it will not remember you previous menu choices)
- Convert to plug-in format
- Add "memory"
- Add extra error handling
- Add better instructions!