Skip to content

Latest commit

 

History

History
56 lines (34 loc) · 3.33 KB

README.md

File metadata and controls

56 lines (34 loc) · 3.33 KB

PAM_P3S

Authors

Pierre Chouteau, Henri Desvallées, Louis Lalay, Mathilde Lefebvre, Ivan Meresman-Higgs, Théo Nguyen

Introduction

This project is part of the ATIAM master. The aim of this project is to separate instruments from a quintet as if each instrument was recorded alone, but they indeed have been recorded playing together. This technique is called source separation. A effort was made to link signal processing with acoustics and sound recording in order to make better informed choices, mainly when choosing and placing the microphones.

You can read our report SourceSeparation and hear some of our results on our website.

Abstract

Music Source Separation (MSS) is the process of separating individual audio signals from a mixed recording containing multiple sound sources, such as different musical instruments, vocals and ambient noise. Its various applications include remixing, transcription and music recommendation. In the context of real acoustic recordings, the separation task is particularly challenging due to the complexity and variability of acoustic instruments and recording conditions such as room acoustics and microphone directivity. We propose the use of Non-negative Matrix Factorization (NMF) algorithms for this task, and in our multi-channel setting, we aim to implement efficient, conditioned versions of this algorithm to be applied to musical recordings performed in a known and controlled context. We investigate methods of informing this algorithm by conditioning on temporal and spectral information from the instruments, that were specifically registered at the time of the recording for this purpose. To this end, we conducted a professional-level recording of a chamber music quintet.

We have compared our results with other state-of-the-art algorithms, obtaining comparable results on benchmark datasets, and we have carried out subjective evaluation according to the MUSHRA protocol, where we see a good performance of our algorithm. We observe a strong effect of the processing of the recording, which helps or hinders the separation depending on the instrument. Our approach confirms the versatility of the FastMNMF algorithm and the possibility of extending and making these algorithms more versatile.

Keywords: Music, Source separation, Non-Negative Matrix Factorization, Live Recording, Acoustics

Install

To run our scripts, you will need to have a specific environment which will require the installation of miniconda (or anaconda). If you do not already have it, you can install it from the original website.

  • Clone the github repository
git clone https://github.com/PierreChouteau/PAM_P3S.git
  • Create a virtual env with python 3.10.8:
conda create -n P3S python=3.10.8
  • Activate the environment:
conda activate P3S
  • Go into the repository and install the dependencies:
cd PAM_P3S
pip install -r requirement.txt
  • Install the src package:
pip install -e .