Inference Speed #2

rishikksh20 · 2023-10-09T09:42:23Z

Hi @cantabile-kwok ,
I have also implemented UniCATS's vec2wav but that model is too slow, so I am curious to know the inference speed of this model. Actually, I am interested in integrating CTX-vec2wav with GPT-based AR txt2vec to create a fast prompt-based TTS model.

Also, do you have any plan to release CTX-txt2vec model anytime soon?
Thanks

cantabile-kwok · 2023-10-09T10:13:21Z

Hi, thanks for the interest!

From a previous log on GPU, the following speed was reported:

11.94it/s, RTF=0.0106

May I know the speed of your implemented model if it is too slow? For the above speed, I think that should be OK in regular cases.

The CTX-text2vec is a bit harder to open-source, but we will get our hands on it soon (probably will finish in this month, but I can't be 100% certain). Please stay tuned if you are interested : )

cantabile-kwok · 2023-11-21T07:06:37Z

@rishikksh20 Hi Rishikesh, for your information, the acoustic model part (text2vec) is now finished at https://github.com/cantabile-kwok/UniCATS-CTX-text2vec.

rishikksh20 · 2023-11-21T09:37:03Z

Thanks @cantabile-kwok I already following that repo. Will check end to end training

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inference Speed #2

Inference Speed #2

rishikksh20 commented Oct 9, 2023

cantabile-kwok commented Oct 9, 2023

cantabile-kwok commented Nov 21, 2023

rishikksh20 commented Nov 21, 2023

Inference Speed #2

Inference Speed #2

Comments

rishikksh20 commented Oct 9, 2023

cantabile-kwok commented Oct 9, 2023

cantabile-kwok commented Nov 21, 2023

rishikksh20 commented Nov 21, 2023