Add LoRA support for Cosmos Predict 2.5 and fix pipeline to match official Cosmos repo by terarachang · Pull Request #13664 · huggingface/diffusers

terarachang · 2026-04-30T20:12:17Z

What this PR does

Adds LoRA fine-tuning support for Cosmos Predict 2.5 (nvidia/Cosmos-Predict2.5-2B) and fixes the pipeline to match the official Cosmos reference implementation.

LoRA support

CosmosLoraLoaderMixin in src/diffusers/loaders/lora_pipeline.py for LoRA loading/saving on CosmosTransformer3DModel
Training script examples/cosmos/train_cosmos_predict25_lora.py using accelerate + peft
Inference script examples/cosmos/eval_cosmos_predict25_lora.py
Added CosmosLoraLoaderMixin to docs/source/en/api/loaders/lora.md

Fixes to match the official Cosmos repo

Fix conditional_frame_timestep scaling by timestep_scale=0.001
Auto-cast AdaLN and DiT final layer to fp32 for training
Deterministic VAE encode (no sampling)
Flash Attention 2 as the default attention implementation of the text encoder
Support invariant seeds via numpy noise sampling

Test plan

All repository consistency checks pass (check_copies, check_dummies, check_support_list)

…ncoder attention implementation, and timestep scaling

… to device before torch.stack

yiyixuxu

thanks, i left a question

yiyixuxu · 2026-05-05T23:03:26Z


-        device = sample.device
-        sigma_t, sigma_s0 = self.sigmas[self.step_index + 1].to(device), self.sigmas[self.step_index].to(device)
+        sigma_t, sigma_s0 = self.sigmas[self.step_index + 1], self.sigmas[self.step_index]


ohh I think the change here might not be intended, no?
it seeem to have reverted https://github.com/huggingface/diffusers/pull/13489/changes

yiyixuxu

thanks, looks good to me once we revert the change in scheculer

sayakpaul

I left some questions and suggestions. LMK if anything is unclear.

sayakpaul · 2026-05-06T00:16:19Z

+[[autodoc]] loaders.lora_pipeline.CosmosLoraLoaderMixin
+
 ## KandinskyLoraLoaderMixin
 [[autodoc]] loaders.lora_pipeline.KandinskyLoraLoaderMixin


Figures should be hosted somewhere else.

sayakpaul · 2026-05-06T00:21:03Z

+
+    @classmethod
+    @validate_hf_hub_args
+    def lora_state_dict(


We should be able to also use "# Copied from ..." comment here?

diffusers/src/diffusers/loaders/lora_pipeline.py

Line 684 in cff1e39

# Copied from diffusers.loaders.lora_pipeline.StableDiffusionLoraLoaderMixin.lora_state_dict

sayakpaul · 2026-05-06T00:21:16Z

+        else:
+            return state_dict
+
+    def load_lora_weights(


Same as above.

sayakpaul · 2026-05-06T00:21:29Z

+            safe_serialization=safe_serialization,
+        )
+
+    def fuse_lora(


Same as above.

sayakpaul · 2026-05-06T00:22:50Z

+        network_alphas = {}
+        for k in list(state_dict.keys()):
+            if "alpha" in k:
+                alpha_value = state_dict.get(k)
+                if (torch.is_tensor(alpha_value) and torch.is_floating_point(alpha_value)) or isinstance(
+                    alpha_value, float
+                ):
+                    network_alphas[k] = state_dict.pop(k)
+                else:
+                    raise ValueError(
+                        f"The alpha key ({k}) seems to be incorrect. If you think this error is unexpected, please open as issue."
+                    )
+
+        if return_alphas or return_lora_metadata:
+            return cls._prepare_outputs(
+                state_dict,
+                metadata=metadata,
+                alphas=network_alphas,
+                return_alphas=return_alphas,
+                return_metadata=return_lora_metadata,
+            )
+        else:
+            return state_dict


Do we need this setup in cosmos?

sayakpaul · 2026-05-06T00:23:51Z

+            components=components, lora_scale=lora_scale, safe_fusing=safe_fusing, adapter_names=adapter_names, **kwargs
+        )
+
+    def unfuse_lora(self, components: list[str] = ["transformer"], **kwargs):


Same as above.

Our implementation doesn't need it. But if users trained multiple LoRA adaptors they may want to use this method. Would you suggest I remove unfuse_lora?

I am not suggesting to remove it. I am suggesting to supplement it with "# Copied from ..." statement.

Ting-Yun Chang added 8 commits April 30, 2026 19:52

support lora for cosmos 2.5

48fbe84

Fix inconsistencies with cosmos official repo in VAE encoding, text e…

5216816

…ncoder attention implementation, and timestep scaling

Support f_min and f_max in linear_scheduler warmup

3d5551b

Add requirements and dataset preprocessing scripts to run examples

f0b51ed

Add LoRA training scripts

9038e10

Add LoRA eval scripts

cee46cc

add assets for blogpost

9e24319

Fix(scheduler): device mismatch from upstream b114620 - move rk and b…

c3bb1aa

… to device before torch.stack

github-actions Bot added documentation Improvements or additions to documentation lora models pipelines examples schedulers loaders size/L PR with diff > 200 LOC labels Apr 30, 2026

Always upcast to fp32

d32b085

github-actions Bot added size/L PR with diff > 200 LOC and removed size/L PR with diff > 200 LOC labels Apr 30, 2026

Directly inhrit from LoraBaseMixin

80aa547

github-actions Bot added size/L PR with diff > 200 LOC and removed size/L PR with diff > 200 LOC labels May 2, 2026