This is the official repository for Interspeech 2024 paper Text-aware Speech Separation for Multi-talker Keyword Spotting. The implementaion of the front-end model is based on ESPnet, which is currently available here in egs2/librimix/enh_kws1
. For the KWS backend, We directly apply the default setup of MDTC from WeKws examples/hey_snips/s0
.
I apologize that the email address of the primary author is wrong, which should be haoyu.li.cs@sjtu.edu.cn instead of haoyu.li@sjtu.edu.cn. Feel free to mail to me if you have any question!