GitHub - asmarufoglu/roboaudio: Analysis of Robot Ego-Noise impact on ASR models (Whisper) & Signal Processing solutions.

Motivation & Context

This repository was created as an exploratory project to bridge my background in signal processing (EEG / biosignals) with audio processing and ASR systems.

Rather than building a production-ready solution, the goal of this work is to:

understand how real-world, noise-heavy audio scenarios affect Transformer-based ASR models,
experiment with basic audio preprocessing techniques,
and gain hands-on experience in model evaluation and reporting, aligned with LLM & Speech-focused R&D roles.

Data Selection Note

The audio samples used in this study were not collected as a standardized dataset. Instead, a small number of scenario-driven recordings were selected from publicly available LTC (Jidoka) quadruped robot videos on YouTube.

These videos were intentionally chosen to:

simulate realistic operational noise (motor hum, footsteps),
keep the analysis controlled and interpretable,
and support qualitative error analysis rather than statistical benchmarking.

This design choice reflects the exploratory nature of the project.

Simple demo interface used to inspect ASR outputs and preprocessing effects during experiments.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
src		src
.gitignore		.gitignore
README.md		README.md
Research_Report.ipynb		Research_Report.ipynb
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Motivation & Context

Data Selection Note

About

Uh oh!

Releases

Packages

Languages

asmarufoglu/roboaudio

Folders and files

Latest commit

History

Repository files navigation

Motivation & Context

Data Selection Note

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages