Skip to content
View StefanoGiacomelli's full-sized avatar

Highlights

  • Pro

Block or report StefanoGiacomelli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
StefanoGiacomelli/README.md

Stefano Giacomelli

Ph.D. candidate in ICT working on Computational Auditory Scene Analysis (CASA) & Deep Learning for Audio.

Researcher | Developer | Electroacoustic Musician


My research focuses on neural audio representations, audio tagging, acoustic scene classification, sound event detection, and efficient inference. Alongside research, I am an electroacoustic music composer, and live performer working at the intersection between machine learning, sound, digital signal processing, and artistic practices β€” someone who designs, studies, and uses sound both as a scientific object and as an artistic material.


πŸ”­ Areas of Interest

  • Research: Computational Auditory Scene Analysis, Deep Learning, Machine Learning, Audio Tagging, Acoustic Scene Classification, Sound Event Detection, Neural Audio Embeddings, Music Information Retrieval, Digital Signal Processing.

  • Artistic Practice: Electroacoustic music composition, live performance, sound design, interactive and immersive audio systems, spatial sound, soundscapes.

πŸ’» Languages and Tools

Category Technologies
ML / DL PyTorch Lightning NumPy Numba CuPy SciPy Scikit-learn
DSP & Analysis LibRoSA Essentia TorchAudio Matplotlib Seaborn Pandas
Audio & Code Max/MSP Reaper JavaScript HTML CSS
Publishing LaTeX Overleaf Zotero Markdown MS_Office

βš™οΈ Selected Projects

  • e2panns β€” From Large Scale Audio Tagging to Real-Time Explainable Emergency Vehicle Siren Detection.
  • audioset-tools β€” A Python/PyTorch utility framework for taxonomy-aware AudioSet curation, dataset preparation, and reproducible audio research.
  • torch-audio-embeddings β€” A Standardized Framework for Deploying and inference with Audio Neural Networks embedding models.
  • StrumKANet β€” Kolmogorov Arnold Network (KAN) NAS Framework for Strumming Pattern Recognition.
  • torch_amt β€” MATLAB Auditory Modeling Toolbox porting (in PyTorch).

πŸ“ Selected Research & Outputs

Selected Publications

  • S. Giacomelli, M. Giordano, C. Rinaldi, and F. Graziosi, "AudioSet-Tools: A Python Framework for Taxonomy-Aware AudioSet Curation and Reproducible Audio Research," EURASIP Journal of Audio, Speech and Music Processing, vol. 2026, no. 2, 2026. DOI: 10.1186/s13636-025-00436-z

  • M. Giordano, S. Giacomelli, C. Rinaldi, and F. Graziosi, "Real-Time Emergency Vehicle Siren Detection with Efficient CNNs on Embedded Hardware," in 2025 IEEE 6th International Symposium on the Internet of Sounds (IS2), 2025, pp. 1-10. DOI: 10.1109/IS264627.2025.11284671

  • M. Pennese, S. Giacomelli, and C. Rinaldi, "A Kolmogorov Arnold Network NAS Framework for Strumming Pattern Recognition in Technology-Enhanced Pop/Rock Music Education," in 2025 IEEE 6th International Symposium on the Internet of Sounds (IS2), 2025, pp. 1-10. DOI: 10.1109/IS264627.2025.11284580

  • S. Giacomelli, M. Giordano, and C. Rinaldi, "The OCON model: an Old but Gold Solution for Distributable Supervised Classification," in 2024 IEEE Symposium on Computers and Communications (ISCC), 2024, pp. 1-7. DOI: 10.1109/ISCC61673.2024.10733621

Selected Datasets & Toolkits

⚑ Currently Working On

  • Audio deep learning models for scene/event analysis.
  • Efficient convolutional and transformers-based neural networks for audio inference.
  • Dataset curation and benchmarking pipelines.
  • Artistic research in electroacoustic performance and composition (electroacoustic feedback systems).
  • Room Acoustics and features extraction.
  • Sound source separation and Semantic sound scene segmentation.

πŸ“Š Analytics

GitHub Stats Top Languages

πŸ“« Contacts

Pinned Loading

  1. audioset-tools audioset-tools Public

    This repository provides code and examples for "S. Giacomelli et al. - AudioSet-Tools: A Python Framework for Taxonomy-Aware AudioSet Curation and Reproducible Audio Research"

    Jupyter Notebook 2

  2. torch_amt torch_amt Public

    Differentiable, GPU-accelerated PyTorch implementations of Computational Auditory models from the MATLAB Auditory Modeling Toolbox (AMT).

    Python 1

  3. torch-audio-embeddings torch-audio-embeddings Public

    Torch-Audio-Embeddings (TAE) Official Repository

    Python 1

  4. e2panns e2panns Public

    E2PANNs official code repository

    Jupyter Notebook