-
Notifications
You must be signed in to change notification settings - Fork 2
Open
Description
Thanks for your job! I want to ask that in your paper, you depict that you concatenate the zt which is the noised latent after linear projection in EF-Net, but in the inference code, what you actually do is concatenate the latent_image_input, which is first + 0 + end. What are the actual settings in training, to concatenate the noisy ground truth image or only the start and end frame with 0 padding?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels