Hi,
I found there are totally 116,405 unbalanced train videos for Musical instrument videos on AudioSet, which is much more than the amount of clips you mentioned in your paper. So, can you provide the video ids for those video for traning and testing in the experiment? Many thanks!