🎨 Stable Diffusion Pipeline

A hands-on implementation of Stable Diffusion v1.4 inference with custom DDIM sampling, classifier-free guidance, and inpainting — built piece by piece to understand what's actually happening under the hood.

🖼️ Inpainting Result

Full walkthrough in in-painting.ipynb

Mask-based latent blending: the original image is preserved outside the mask, and new content is diffused inside it.

🚀 What This Project Does

Generate images from text prompts using a custom-built sampling pipeline, with working inpainting on top.

🛠️ What I Built From Scratch

DDIM Sampler — complete noise scheduling and denoising loop
Classifier-Free Guidance — custom conditional/unconditional steering
VAE Interface — latent encoding/decoding with proper scaling
CLIP Text Pipeline — tokenization and embedding extraction
Inpainting Logic — mask-based latent blending (in-painting.ipynb)

🤝 What's Integrated

UNet Backbone — UNet2DConditionModel from 🤗 Diffusers (pre-trained weights)

💡 Why This Approach

Initial work focused on injecting weights into a fully custom UNet architecture. 684/686 layers loaded successfully, but architectural mismatches (GEGLU vs GELU activations, upsampling order) prevented coherent outputs. Rather than paper over the issue, the pragmatic call was to use the proven Diffusers UNet as a stable backbone while keeping every other component custom — quality without sacrificing what was learned.

See stable-diffusion.ipynb for that experiment.

🏗️ Architecture

Inference Loop

Custom UNet vs. Diffusers UNet

Prompt: "an astronaut riding a horse" — 35 steps each

My Custom UNet (weight injection attempt)	Diffusers UNet (final pipeline)

Garbled / incoherent output	Coherent, prompt-following output

📦 Features

✅ Text-to-image generation
✅ Configurable steps and guidance scale
✅ Custom DDIM sampling loop
✅ Inpainting with custom masks (in-painting.ipynb)

🔧 Usage

python inference.py -c "your prompt" -s 50 -g 7.5
```<img width="512" height="512" alt="__results___9_12" src="https://github.com/user-attachments/assets/c5168e58-7eee-426e-99fd-4c7c80b13542" />

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
outputs		outputs
src		src
.gitignore		.gitignore
.python-version		.python-version
CompVis.ipynb		CompVis.ipynb
README.md		README.md
config.py		config.py
config.yaml		config.yaml
in-painting.ipynb		in-painting.ipynb
inference.py		inference.py
modules.ipynb		modules.ipynb
pyproject.toml		pyproject.toml
stable-diffusion.ipynb		stable-diffusion.ipynb
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎨 Stable Diffusion Pipeline

🖼️ Inpainting Result

🚀 What This Project Does

🛠️ What I Built From Scratch

🤝 What's Integrated

💡 Why This Approach

🏗️ Architecture

Inference Loop

Custom UNet vs. Diffusers UNet

📦 Features

🔧 Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎨 Stable Diffusion Pipeline

🖼️ Inpainting Result

🚀 What This Project Does

🛠️ What I Built From Scratch

🤝 What's Integrated

💡 Why This Approach

🏗️ Architecture

Inference Loop

Custom UNet vs. Diffusers UNet

📦 Features

🔧 Usage

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages