Releases · theMoro/dcase25task4

In this release, we upload checkpoints of our tagger and separator models. Specifically, we upload a checkpoint for our best sound event detection model and checkpoints from all models used in the ablation study of the separation model. See the README.md file of this repository on details of how you can load and evaluate these checkpoints.

Available checkpoints:

m2d_sed.ckpt: Our sound event detection model
audiosep_sed.ckpt: AudioSep-SED model using TimeFiLM and Embedding Injection
audiosep_sed_no-timefilm.ckpt: AudioSep-SED model using only Embedding Injection, no TimeFiLM
audiosep_sed_no-embedding-injection.ckpt: AudioSep-SED model using only TimeFiLM, no Embedding Injection
audiosep_sed_s2sed-not-trainable.ckpt: AudioSep-SED model using TimeFiLM and Embedding Injection, but Stage 2 SED model is frozen
audiosep_sed_dprnn.ckpt: AudioSep-SED model using TimeFiLM, Embedding Injection and DPRNN
audiosep_sed_iter2.ckpt: AudioSep-SED model using TimeFiLM, Embedding Injection, DPRNN and iterative refinement with a maximum of 2 iterations during training
audiosep_sed_iter3.ckpt: AudioSep-SED model using TimeFiLM, Embedding Injection, DPRNN and iterative refinement with a maximum of 3 iterations during training
audiosep_sed_iter4.ckpt: AudioSep-SED model using TimeFiLM, Embedding Injection, DPRNN and iterative refinement with a maximum of 4 iterations during training

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Releases: theMoro/dcase25task4

Model Checkpoints

Uh oh!