forked from awslabs/open-data-registry
-
Notifications
You must be signed in to change notification settings - Fork 0
/
lab41-sri-voices.yaml
32 lines (32 loc) · 1.38 KB
/
lab41-sri-voices.yaml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
Name: Voices Obscured in Complex Environmental Settings (VOiCES)
Description: |
VOiCES is a speech corpus recorded in acoustically challenging settings,
using distant microphone recording. Speech was recorded in real rooms with various
acoustic features (reverb, echo, HVAC systems, outside noise, etc.). Adversarial noise,
either television, music, or babble, was concurrently played with clean speech.
Data was recorded using multiple microphones strategically placed
throughout the room. The corpus includes audio recordings, orthographic transcriptions,
and speaker labels.
Documentation: https://voices18.github.io/
Contact: https://github.com/voices18/utilities/issues
UpdateFrequency: Data from two additional rooms will be added to the corpus Fall 2018.
Tags:
- aws-pds
- machine learning
- automatic speech recognition
- speaker identification
- denoising
- speech processing
License: Creative Commons BY 4.0 (see [here](htpps://voices.lab41.org) for more details)
Resources:
- Description: wav audio files, orthographic transcriptions, and speaker ID
ARN: arn:aws:s3:::lab41openaudiocorpus
Region: us-east-1
Type: S3 Bucket
DataAtWork:
Tutorials:
- Title: Getting started with VOiCES data
URL: https://github.com/voices18/utilities/blob/master/Using%20VOiCES%20corpus%20tutorial.ipynb
AuthorName: M.A. Barrios
Tools & Applications:
Publications: