Releases: theMoro/dcase25task4
Releases · theMoro/dcase25task4
Model Checkpoints
In this release, we upload checkpoints of our tagger and separator models. Specifically, we upload a checkpoint for our best sound event detection model and checkpoints from all models used in the ablation study of the separation model. See the README.md file of this repository on details of how you can load and evaluate these checkpoints.
Available checkpoints:
- m2d_sed.ckpt: Our sound event detection model
- audiosep_sed.ckpt: AudioSep-SED model using TimeFiLM and Embedding Injection
- audiosep_sed_no-timefilm.ckpt: AudioSep-SED model using only Embedding Injection, no TimeFiLM
- audiosep_sed_no-embedding-injection.ckpt: AudioSep-SED model using only TimeFiLM, no Embedding Injection
- audiosep_sed_s2sed-not-trainable.ckpt: AudioSep-SED model using TimeFiLM and Embedding Injection, but Stage 2 SED model is frozen
- audiosep_sed_dprnn.ckpt: AudioSep-SED model using TimeFiLM, Embedding Injection and DPRNN
- audiosep_sed_iter2.ckpt: AudioSep-SED model using TimeFiLM, Embedding Injection, DPRNN and iterative refinement with a maximum of 2 iterations during training
- audiosep_sed_iter3.ckpt: AudioSep-SED model using TimeFiLM, Embedding Injection, DPRNN and iterative refinement with a maximum of 3 iterations during training
- audiosep_sed_iter4.ckpt: AudioSep-SED model using TimeFiLM, Embedding Injection, DPRNN and iterative refinement with a maximum of 4 iterations during training