Child-ASR-Paper

A curated list of papers and resources for children's automatic speech recognition.

Benchmarking Children's ASR with Supervised and Self-supervised Speech Foundation Models
Shared-Adapters: A Novel Transformer-based Parameter Efficient Transfer Learning Approach For Children’s Automatic Speech Recognition
Mixed Children/Adult/Childrenized Fine-Tuning for Children’s ASR: How to Reduce Age Mismatch and Speaking Style Mismatch
Improving child speech recognition with augmented child-like speech
Self-Supervised Models for Phoneme Recognition: Applications in Children's Speech for Reading Learning
Children’s Speech Recognition through Discrete Token Enhancement

ICASSP 2024 - updated in 4/1/2024

Normalization and Data Augmentation

Arxiv 2024 - Evaluation of state-of-the-art ASR Models in Child-Adult Interactions
Journal of Electrical Systems 2024 - SVCGAN: Speaker Voice Conversion Generative Adversarial Network for Children's Speech Conversion and Recognition
JASA 2024 - ChildAugment: Data Augmentation Methods for Zero-Resource Children's Speaker Verification
Interspeech 2023 - Data augmentation for children ASR and child-adult speaker classification using voice conversion methods
ICASSP 2023 - Using Modified Adult Speech as Data Augmentation for Child Speech Recognition
Interspeech 2022 - Spectral Modification Based Data Augmentation for Improving End-to-End ASR for Children’s Speech
ICASSP 2022 - LPC Augment: An LPC-Based ASR Data Augmentation Algorithm for Low and Zero-Resource Children's Dialects
Speech Communication 2021 - Fundamental frequency feature warping for frequency normalization and data augmentation in child automatic speech recognition
ICASSP 2021 - Fundamental Frequency Feature Normalization and Data Augmentation for Child Speech Recognition
Interspeech 2020 - Data Augmentation Using Prosody and False Starts to Recognize Non-native Children’s Speech
Interspeech 2020 - Voice Conversion Based Data Augmentation to Improve Children’s Speech Recognition in Limited Data Scenario
ASRU 2019 - Data Augmentation Based on Vowel Stretch for Improving Children's Speech Recognition
ASRU 2019 - GANs for Chidren: A Generative Data Augmentation Strategy for Children Speech Recognirion Interspeech 2019 - A Frequency Normalization Technique for Kindergarten Speech Recognition Inspired by the Role of fo in Vowel Perception
IEEE SPL 2019 - Significance of Pitch-Based Spectral Normalization for Children’s Speech Recognition
Interspeech 2016 - Improving Children’s Speech Recognition through Out-of-Domain Data Augmentation

Pretraining + Finetuning

Arxiv 2024 - Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings
IEEE/ACM TASLP 2024 - Effect of Modeling Glottal Activity Parameters on Zero-Shot Children's ASR
IEEE Access 2024 - Exploring Native and Non-Native English Child Speech Recognition With Whisper
Arxiv 2023 - Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults
Interspeech 2023 - Adaptation of Whisper models to child speech recognition
Under-review Speech Communication 2022 - Improving Children's Speech Recognition by Fine-tuning Self-supervised Adult Speech Representations
IEEE JSTSP 2022 - Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR
Interspeech 2022 - DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASR
Interspeech 2022 - Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping
ICASSP 2021 - Bi-APC: Bidirectional Autoregressive Predictive Coding for Unsupervised Pre-training and Its Application to Children's ASR
Computer Speech & Language 2020 - Transfer Learning from Adult to Children for Speech Recognition: Evaluation, Analysis and Recommendations
Interspeech 2019 - Improving ASR Systems for Children with Autism and Language Impairment Using Domain-Focused DNN Transfer Techniques
WOCCI 2016 - Improving DNN-Based Automatic Recognition of Non-native Children's Speech with Adult Speech

Special Sessions

Connecting Speech science and Speech technology for Children’s Speech
- Interspeech 2023, Interspeech 2024
MERLIon CCS Challenge: Language Identification on Code-Switched Child-Directed Speech
- Interspeech 2023
ETLT 2021: Shared Task on ASR for Non-Native Children's Speech
- Interspeech 2021
CSRC: Children Speech Recognition Challenge
- SLT 2021
Spoken Language Processing for Children's Speech
- Interspeech 2019

Contributing

This is an active repository and your contributions are always welcome!

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Child-ASR-Paper

Table of Contents

Datasets

Papers

Interspeech 2024

ICASSP 2024 - updated in 4/1/2024

Normalization and Data Augmentation

Pretraining + Finetuning

Other topics

Special Sessions

Contributing

About

Contributors 3

License

Diamondfan/Child-ASR-Paper

Folders and files

Latest commit

History

Repository files navigation

Child-ASR-Paper

Table of Contents

Datasets

Papers

Interspeech 2024

ICASSP 2024 - updated in 4/1/2024

Normalization and Data Augmentation

Pretraining + Finetuning

Other topics

Special Sessions

Contributing

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 3