Allow Wan21_HuMo extra_conds to pass concat_latent_image #11567
+15
−14
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This is the same code change as this older PR. I am resubmitting after rebasing the code changes to a more recent version of ComfyUI.
To re-summarize the changes: This small change simply allows the concat_latent_image to pass through the extra_conds in the HuMo model code so that it can be used for I2V. This can be accomplished in base ComfyUI by chaining a WanImageToVideo node with a WanHuMoImageToVideo node.
Even though the HuMo model was not designed for direct I2V, it still works passably and there are many circumstances in which it is useful. Because the WanHuMoImageToVideo node doesn't have an input for a start image, only people who know what they are doing may stumble upon this capability. I have already developed several useful workflows that takes advantage of this technique.
Attached is a very simple HuMo I2V workflow that works only when this change is in place. I tested this WF on a fresh ComfyUI installation with no custom nodes and only this code change in place. It performs a simple I2V HuMo generation with a start image, and then performs one continuation with a second sampler using the last frame as the start image for the second generation. There is definitely a contrast artifact on the second generation and this is a known limitation of HuMo I2V.
ComfyUI_00023_.mp4
Workflow: droz_HuMoImageToVideo_I2VBaseComfyUI_v1.json
Start Image:

Audio source: https://commons.wikimedia.org/wiki/File:Found-von_Goethe.ogg