You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Secondly, I am able to extract xvector on my .wav file and finally got 512-dimensional vector. Now question is that what does this 512-d vector describes ? I was supposed to get xvectors in place of MFCC frames, right ? My .wav file is of 26min duration from AMI corpus.
The text was updated successfully, but these errors were encountered:
Firstly, I check the link, it's avaliable, please check your web; Secondly, the 512-d xvector decribes the utterance-level speaker infomation, rather than frame-level infomation.
Hi,
Thanks for your quick guide for ivector and xvector extraction. First of all model_3000h is not available on this link "https://pan.baidu.com/s/1kZHcOKQVkv_26emvG69IZg". So I tried to use pretrained model available here http://www.kaldi-asr.org/models/m7 but got stuck in an error.
Secondly, I am able to extract xvector on my .wav file and finally got 512-dimensional vector. Now question is that what does this 512-d vector describes ? I was supposed to get xvectors in place of MFCC frames, right ? My .wav file is of 26min duration from AMI corpus.
The text was updated successfully, but these errors were encountered: