Caption my custom video with no audio information #36

dawnlh · 2021-09-02T10:02:06Z

dawnlh
Sep 2, 2021

Hi ~ @v-iashin ,
Thanks for sharing your wonderful work and the detailed instructions on the usage!! I want to caption my own video with the provided pre-trained model. But the video doesn't has an audio. So I wander if I can directly follow the instructions in "Single Video Prediction" of the README? My concerns mainly lie in (1) the feature extraction module (VGGish) ,should I skip the VGGish feature or just do it even (error may occur?) through the video has no an audio. (2) Can I use the pre-trained model directly, or I have to re-training the model without audio information (I just want to finish a small application, re-training is time-consuming)?

Thanks and best reagrds!

v-iashin · 2021-09-02T10:45:52Z

v-iashin
Sep 2, 2021
Maintainer

Try to replace the vggish features with zeros. Note that this feature is neither supported nor documented.

Also, check out the discussion in #34 – you might want to collaborate with the author of that issue.

0 replies

dawnlh · 2021-09-02T11:41:32Z

dawnlh
Sep 2, 2021
Author

Thanks a lot for your prompt reply. I will have a try based on your advise. And again, deeply impressed by your patient and kind help for everyone interested in this work. 👍

All the best!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Caption my custom video with no audio information #36

{{title}}

Replies: 2 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Caption my custom video with no audio information #36

dawnlh Sep 2, 2021

Replies: 2 comments

v-iashin Sep 2, 2021 Maintainer

dawnlh Sep 2, 2021 Author

dawnlh
Sep 2, 2021

v-iashin
Sep 2, 2021
Maintainer

dawnlh
Sep 2, 2021
Author