[Feature] Add Cosmos2 i2v pipeline #837

kevin314 · 2025-10-10T02:52:15Z

No description provided.

SolitaryThinker · 2025-10-10T08:55:39Z

fastvideo/configs/models/vaes/cosmosvae.py

+    _class_name: str = "AutoencoderKLWan"
+    _diffusers_version: str = "0.34.0.dev0"
+    _name_or_path: str = ""


These fields should be pop/removed in the loader so can be removed. You can refer to how wan's vae config is defined

SolitaryThinker · 2025-10-10T08:55:51Z

fastvideo/configs/models/vaes/cosmosvae.py

+
+    def __post_init__(self):
+        self.blend_num_frames = (self.tile_sample_min_num_frames -
+                                 self.tile_sample_stride_num_frames) * 2


Newline char

SolitaryThinker · 2025-10-10T08:56:16Z

fastvideo/configs/pipelines/cosmos.py

+
+
+@dataclass
+class CosmosVideoConfigFixed(CosmosVideoConfig):


Why is this needed?

SolitaryThinker · 2025-10-10T08:56:53Z

fastvideo/configs/pipelines/cosmos.py

+class CosmosVideoConfigFixed(CosmosVideoConfig):
+    """Fixed Cosmos Video Config that matches original Cosmos2 Video2World configuration."""
+
+    def update_model_arch(self, config: dict) -> None:


Did you align against diffusers or original cosmos?

SolitaryThinker · 2025-10-10T08:57:42Z

fastvideo/configs/sample/cosmos.py

+
+
+@dataclass
+class CosmosTeaCacheParams(CacheParams):


This can be removed for now, but you should add a default for cosmos sampling params. Refer to wan

SolitaryThinker · 2025-10-10T08:58:41Z

fastvideo/layers/layernorm.py

        if self.has_weight:
            self.weight = nn.Parameter(self.weight)

+    def forward(self, hidden_states: torch.Tensor) -> torch.Tensor:


can you change this to forward_diffusers?

SolitaryThinker · 2025-10-26T22:50:32Z

fastvideo/layers/rotary_embedding.py

    return x.flatten(-2)


+def apply_rotary_emb(


Maybe we should consider making this cosmos2 specific?

SolitaryThinker · 2025-10-26T22:50:52Z

fastvideo/layers/visual_embedding.py

    return imgs
+
+
+def get_timestep_embedding(


Maybe also make the architecture specific

SolitaryThinker · 2025-10-26T22:51:20Z

fastvideo/models/encoders/t5.py

        bs, seq_len, _ = hidden_states.shape
        num_seqs = bs
-        n, c = self.n_heads, self.d_model // self.total_num_heads
+        #n, c = self.n_heads, self.d_model // self.total_num_heads


delete this

SolitaryThinker · 2025-10-26T22:51:27Z

fastvideo/models/encoders/t5.py

        qkv, _ = self.qkv_proj(hidden_states)
        # Projection of 'own' hidden state (self-attention). No GQA here.
-        q, k, v = qkv.split(self.inner_dim, dim=-1)
+        #q, k, v = qkv.split(self.inner_dim, dim=-1)


delete this

SolitaryThinker · 2025-10-26T22:52:49Z

fastvideo/models/schedulers/scheduling_flow_match_euler_discrete.py

-                timesteps_array = np.linspace(self._sigma_to_t(self.sigma_max),
-                                              self._sigma_to_t(self.sigma_min),
-                                              num_inference_steps)
+                t_max = self._sigma_to_t(self.sigma_max)


maybe make a separate version of this scheduler

SolitaryThinker · 2025-10-26T22:55:11Z

fastvideo/pipelines/stages/input_validation.py

        # Peiyuan: using GPU seed will cause A100 and H100 to generate different results...
        batch.generator = [
-            torch.Generator("cpu").manual_seed(seed) for seed in seeds
+            torch.Generator(device="cpu").manual_seed(seed) for seed in seeds


revert please

SolitaryThinker · 2025-10-26T22:55:30Z

fastvideo/pipelines/stages/latent_preparation.py

maybe also separate these

SolitaryThinker · 2025-10-26T22:56:14Z

fastvideo/pipelines/stages/utils.py

make arch specific

SolitaryThinker · 2025-10-26T22:57:48Z

fastvideo/tests/encoders/test_t5_encoder.py

SolitaryThinker · 2025-10-26T22:58:18Z

fastvideo/worker/multiproc_executor.py

        self.world_size = self.fastvideo_args.num_gpus
        self.shutting_down = False

+        # Initialize CUDA before setting up multiprocessing to ensure


SolitaryThinker reviewed Oct 10, 2025

View reviewed changes

kevin314 marked this pull request as ready for review October 24, 2025 02:30

kevin314 added the go Trigger Buildkite CI label Oct 24, 2025

kevin314 force-pushed the klin/cosmos2 branch from ca83c7e to 8b89aff Compare October 26, 2025 04:05

kevin314 added 2 commits October 26, 2025 22:34

Add cosmos2 i2v pipeline

7118cae

pre-commit

1b7bfe5

SolitaryThinker reviewed Oct 26, 2025

View reviewed changes

kevin314 force-pushed the klin/cosmos2 branch from 275f0b2 to 1b7bfe5 Compare October 26, 2025 23:49



		@dataclass
		class CosmosVideoConfigFixed(CosmosVideoConfig):



		@dataclass
		class CosmosTeaCacheParams(CacheParams):

[Feature] Add Cosmos2 i2v pipeline #837

Are you sure you want to change the base?

[Feature] Add Cosmos2 i2v pipeline #837

Uh oh!

Conversation

kevin314 commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kevin314 commented Oct 10, 2025 •

edited

Loading