Add StableDiffusion3InstructPix2PixPipeline #11378

xduzhangjiayu · 2025-04-22T00:12:30Z

What does this PR do?

Add StableDiffusion3InstructPix2PixPipeline
Would you like give a review? Many thanks~ @yiyixuxu @asomoza

asomoza · 2025-04-22T12:36:02Z

Hi and thanks for your contribution. Is there a model for this pipeline so I can test it?

xduzhangjiayu · 2025-04-22T15:38:10Z

Hi, thanks for the reply!
You can use model trained by myself from https://huggingface.co/CaptainZZZ/sd3-instructpix2pix/tree/main, you only need to replace the original transformer from official SD3. I have already tested the result and the result is reasonable, Or, for better performance, you can refer to another powerful model from https://huggingface.co/BleachNick/SD3_UltraEdit_freeform/tree/main/transformer
@asomoza

asomoza · 2025-04-23T22:32:30Z

I did a test but I get a bad result:

import torch

from diffusers.pipelines.stable_diffusion_3.pipeline_stable_diffusion_3_instruct_pix2pix import (
    StableDiffusion3InstructPix2PixPipeline,
)
from diffusers.utils import load_image


resolution = 1024
image = load_image("https://hf.co/datasets/diffusers/diffusers-images-docs/resolve/main/mountain.png").resize(
    (resolution, resolution)
)
edit_instruction = "Turn sky into a cloudy one"

pipe = StableDiffusion3InstructPix2PixPipeline.from_pretrained(
    "BleachNick/SD3_UltraEdit_freeform", torch_dtype=torch.float16
)

pipe.enable_model_cpu_offload()

edited_image = pipe(
    prompt=edit_instruction,
    image=image,
    height=resolution,
    width=resolution,
    guidance_scale=7.5,
    image_guidance_scale=1.5,
    num_inference_steps=30,
).images[0]

edited_image.save("edited_image.png")

and I get this image:

I also tried with changing the transformer model with yours but got the same result.

I don't have the time to look into this right now, can you solve the issue? Also ideally if this works, you will need to add the corresponding doc page and to be able to load it like the example in the dosctring from diffusers import StableDiffusion3InstructPix2PixPipeline you will need to add it to the __init__.py of the pipelines and the main diffusers one.

But still, the priority here should be to make it work and to demo an example with it.

xduzhangjiayu · 2025-04-24T01:31:10Z

Hi @asomoza
Sorry I didn't mention before that the model was trained on 512×512 images, so it is better to use 512×512 as the input.
I changed the following code and using my transformer model
resolution = 512 image = load_image("https://hf.co/datasets/diffusers/diffusers-images-docs/resolve/main/mountain.png").resize( (resolution, resolution) ) edit_instruction = "Turn sky into a sunny one"
And I got the image below:

And could you please tell me where I should add the doc page? Thanks~

asomoza · 2025-04-24T15:21:58Z

@xduzhangjiayu the code I used was the one that's in the docstring of the pipeline, so probably it's better to change it there.

And could you please tell me where I should add the doc page?

For the docs, it's inside the docs/source/en, you can learn from other implementations like this one .

But looking at the quality of the model and the work involved, I would recommend to move this pipeline to the community examples, it would be easier to do and also we can move it to core if it gets popular later, this way you don't have to write docs and we can merge it faster.

xduzhangjiayu · 2025-04-29T07:05:10Z

Hi @asomoza
I agree, and I already moved the pipeline to community, what else do I need to write for this PR? Thanks~

asomoza · 2025-04-29T07:41:15Z

@xduzhangjiayu thanks, can you please add a small description and a functional snipped of code to run it in the README file.

xduzhangjiayu · 2025-04-29T09:27:28Z

Hi @asomoza
Done! Please check~

asomoza · 2025-04-29T11:05:19Z

we don't use the same directory for hosting images, I took the liberty to upload your images to the hub here:

https://huggingface.co/datasets/diffusers/docs-images/resolve/main/StableDiffusion3InstructPix2Pix/mountain.png
https://huggingface.co/datasets/diffusers/docs-images/resolve/main/StableDiffusion3InstructPix2Pix/edited.png

can you delete the images from this PR and use these links instead please?

xduzhangjiayu · 2025-04-29T12:38:16Z

@asomoza Done

HuggingFaceDocBuilderDev · 2025-04-29T12:55:50Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

asomoza

thanks, just some minor suggestions

asomoza · 2025-04-29T13:25:49Z

examples/community/README.md

@@ -86,6 +86,7 @@ PIXART-α Controlnet pipeline | Implementation of the controlnet model for pixar
 | Perturbed-Attention Guidance |StableDiffusionPAGPipeline is a modification of StableDiffusionPipeline to support Perturbed-Attention Guidance (PAG).|[Perturbed-Attention Guidance](#perturbed-attention-guidance)|[Notebook](https://github.com/huggingface/notebooks/blob/main/diffusers/perturbed_attention_guidance.ipynb)|[Hyoungwon Cho](https://github.com/HyoungwonCho)|
 | CogVideoX DDIM Inversion Pipeline | Implementation of DDIM inversion and guided attention-based editing denoising process on CogVideoX. | [CogVideoX DDIM Inversion Pipeline](#cogvideox-ddim-inversion-pipeline) | - | [LittleNyima](https://github.com/LittleNyima) |
 | FaithDiff Stable Diffusion XL Pipeline | Implementation of [(CVPR 2025) FaithDiff: Unleashing Diffusion Priors for Faithful Image Super-resolutionUnleashing Diffusion Priors for Faithful Image Super-resolution](https://arxiv.org/abs/2411.18824) - FaithDiff is a faithful image super-resolution method that leverages latent diffusion models by actively adapting the diffusion prior and jointly fine-tuning its components (encoder and diffusion model) with an alignment module to ensure high fidelity and structural consistency. | [FaithDiff Stable Diffusion XL Pipeline](#faithdiff-stable-diffusion-xl-pipeline) | [![Hugging Face Models](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Models-blue)](https://huggingface.co/jychen9811/FaithDiff) | [Junyang Chen, Jinshan Pan, Jiangxin Dong, IMAG Lab, (Adapted by Eliseu Silva)](https://github.com/JyChen9811/FaithDiff) |
+| Stable Diffusion 3 InstructPix2Pix Pipeline | Implementation of Stable Diffusion 3 InstructPix2Pix Pipeline | [Stable Diffusion 3 InstructPix2Pix Pipeline](#stable-diffusion-3-instructpix2pix-pipeline) | [![Hugging Face Models]()](https://huggingface.co/BleachNick/SD3_UltraEdit_freeform) [![Hugging Face Models]()](https://huggingface.co/CaptainZZZ/sd3-instructpix2pix) | [Jiayu Zhang](https://github.com/xduzhangjiayu) and [Haozhe Zhao](https://github.com/HaozheZhao)|


Suggested change

| Stable Diffusion 3 InstructPix2Pix Pipeline | Implementation of Stable Diffusion 3 InstructPix2Pix Pipeline | [Stable Diffusion 3 InstructPix2Pix Pipeline](#stable-diffusion-3-instructpix2pix-pipeline) | [![Hugging Face Models]()](https://huggingface.co/BleachNick/SD3_UltraEdit_freeform) [![Hugging Face Models]()](https://huggingface.co/CaptainZZZ/sd3-instructpix2pix) | [Jiayu Zhang](https://github.com/xduzhangjiayu) and [Haozhe Zhao](https://github.com/HaozheZhao)|

| Stable Diffusion 3 InstructPix2Pix Pipeline | Implementation of Stable Diffusion 3 InstructPix2Pix Pipeline | [Stable Diffusion 3 InstructPix2Pix Pipeline](#stable-diffusion-3-instructpix2pix-pipeline) | [![Hugging Face Models](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Models-blue)](https://huggingface.co/BleachNick/SD3_UltraEdit_freeform) [![Hugging Face Models](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Models-blue)](https://huggingface.co/CaptainZZZ/sd3-instructpix2pix) | [Jiayu Zhang](https://github.com/xduzhangjiayu) and [Haozhe Zhao](https://github.com/HaozheZhao)|

asomoza · 2025-04-29T13:34:12Z

examples/community/README.md

+![Original image](https://huggingface.co/datasets/diffusers/docs-images/resolve/main/StableDiffusion3InstructPix2Pix/mountain.png)
+![Edited image](https://huggingface.co/datasets/diffusers/docs-images/resolve/main/StableDiffusion3InstructPix2Pix/edited.png)


Suggested change

### Result

![Original image](https://huggingface.co/datasets/diffusers/docs-images/resolve/main/StableDiffusion3InstructPix2Pix/mountain.png)

![Edited image](https://huggingface.co/datasets/diffusers/docs-images/resolve/main/StableDiffusion3InstructPix2Pix/edited.png)

|Original|Edited|

|---|---|

|![Original image](https://huggingface.co/datasets/diffusers/docs-images/resolve/main/StableDiffusion3InstructPix2Pix/mountain.png)|![Edited image](https://huggingface.co/datasets/diffusers/docs-images/resolve/main/StableDiffusion3InstructPix2Pix/edited.png)

…into sd3_p2p

xduzhangjiayu · 2025-04-30T02:07:21Z

@asomoza OK, done

asomoza · 2025-04-30T07:24:18Z

@bot /style

github-actions · 2025-04-30T07:25:16Z

Style fixes have been applied. View the workflow run here.

asomoza · 2025-04-30T10:12:56Z

thanks!

xduzhangjiayu and others added 4 commits April 20, 2025 18:50

upload StableDiffusion3InstructPix2PixPipeline

f90adbb

Merge branch 'huggingface:main' into sd3_p2p

94be186

Merge branch 'main' into sd3_p2p

296a86e

Merge branch 'main' into sd3_p2p

07fd208

xduzhangjiayu added 2 commits April 23, 2025 08:35

Merge branch 'main' into sd3_p2p

7b5bffa

Merge branch 'main' into sd3_p2p

d5a7542

Merge branch 'main' into sd3_p2p

8adeab8

xduzhangjiayu and others added 2 commits April 29, 2025 14:45

Merge branch 'main' into sd3_p2p

d6f72d8

Move to community

055d3bb

xduzhangjiayu added 3 commits April 29, 2025 17:00

Add readme

5068384

Fix images

912027c

remove images

194082e

Change image url

d9be254

Merge branch 'main' into sd3_p2p

8ca42e7

asomoza reviewed Apr 29, 2025

View reviewed changes

xduzhangjiayu added 2 commits April 30, 2025 09:56

fix

ec55dbe

Merge branch 'sd3_p2p' of https://github.com/xduzhangjiayu/diffusers …

e67c57f

…into sd3_p2p

Merge branch 'main' into sd3_p2p

0df2130

Apply style fixes

1537115

asomoza and others added 3 commits April 30, 2025 03:38

Merge branch 'main' into sd3_p2p

9735713

Merge branch 'main' into sd3_p2p

95b2af1

Merge branch 'main' into sd3_p2p

c8dc6f1

asomoza approved these changes Apr 30, 2025

View reviewed changes

asomoza merged commit 8cd7426 into huggingface:main Apr 30, 2025
8 of 9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add StableDiffusion3InstructPix2PixPipeline #11378

Add StableDiffusion3InstructPix2PixPipeline #11378

xduzhangjiayu commented Apr 22, 2025

asomoza commented Apr 22, 2025

xduzhangjiayu commented Apr 22, 2025 •

edited

Loading

asomoza commented Apr 23, 2025

xduzhangjiayu commented Apr 24, 2025 •

edited

Loading

asomoza commented Apr 24, 2025

xduzhangjiayu commented Apr 29, 2025

asomoza commented Apr 29, 2025

xduzhangjiayu commented Apr 29, 2025

asomoza commented Apr 29, 2025

xduzhangjiayu commented Apr 29, 2025

HuggingFaceDocBuilderDev commented Apr 29, 2025

asomoza left a comment •

edited

Loading

asomoza Apr 29, 2025

asomoza Apr 29, 2025

xduzhangjiayu commented Apr 30, 2025

asomoza commented Apr 30, 2025

github-actions bot commented Apr 30, 2025

asomoza commented Apr 30, 2025

		![Original image](https://huggingface.co/datasets/diffusers/docs-images/resolve/main/StableDiffusion3InstructPix2Pix/mountain.png)
		![Edited image](https://huggingface.co/datasets/diffusers/docs-images/resolve/main/StableDiffusion3InstructPix2Pix/edited.png)

-### Result
-![Original image](https://huggingface.co/datasets/diffusers/docs-images/resolve/main/StableDiffusion3InstructPix2Pix/mountain.png)
-![Edited image](https://huggingface.co/datasets/diffusers/docs-images/resolve/main/StableDiffusion3InstructPix2Pix/edited.png)
+|Original|Edited|
+|---|---|
+|![Original image](https://huggingface.co/datasets/diffusers/docs-images/resolve/main/StableDiffusion3InstructPix2Pix/mountain.png)|![Edited image](https://huggingface.co/datasets/diffusers/docs-images/resolve/main/StableDiffusion3InstructPix2Pix/edited.png)

Add StableDiffusion3InstructPix2PixPipeline #11378

Add StableDiffusion3InstructPix2PixPipeline #11378

Conversation

xduzhangjiayu commented Apr 22, 2025

What does this PR do?

asomoza commented Apr 22, 2025

xduzhangjiayu commented Apr 22, 2025 • edited Loading

asomoza commented Apr 23, 2025

xduzhangjiayu commented Apr 24, 2025 • edited Loading

asomoza commented Apr 24, 2025

xduzhangjiayu commented Apr 29, 2025

asomoza commented Apr 29, 2025

xduzhangjiayu commented Apr 29, 2025

asomoza commented Apr 29, 2025

xduzhangjiayu commented Apr 29, 2025

HuggingFaceDocBuilderDev commented Apr 29, 2025

asomoza left a comment • edited Loading

Choose a reason for hiding this comment

asomoza Apr 29, 2025

Choose a reason for hiding this comment

asomoza Apr 29, 2025

Choose a reason for hiding this comment

xduzhangjiayu commented Apr 30, 2025

asomoza commented Apr 30, 2025

github-actions bot commented Apr 30, 2025

asomoza commented Apr 30, 2025

xduzhangjiayu commented Apr 22, 2025 •

edited

Loading

xduzhangjiayu commented Apr 24, 2025 •

edited

Loading

asomoza left a comment •

edited

Loading