Refactor: Flexible model architecture for dit models (Flux & SD3) #490

stduhpf · 2024-11-29T01:30:10Z

(Built on top of #455)

Motivations:

I stumbled upon this: https://huggingface.co/TencentARC/flux-mini, and thought it would be nice to run it on sdcpp.

The number of variants Flux and MMDiT (SD3.x) models supported is starting to get a bit overwhelming (3 each so far), and if people keep making these kinds of distillations or self merges, it would be impossible to support them all individually.

With this PR, the number of layers for each kind of block and the presence of some optional features is inferred from the tensor names in the model file when initializing the model runner, making it a lot more flexible.

New models supported with these chages (examples):

Flux Mini 3.2B (sucks at text): https://huggingface.co/TencentARC/flux-mini,
Flux Heavy 17B: https://huggingface.co/city96/Flux.1-Heavy-17B

leejet · 2024-11-30T06:19:17Z

Thank you for your contribution.

…#490) * Refactor: wtype per tensor * Fix default args * refactor: fix flux * Refactor photmaker v2 support * unet: refactor the refactoring * Refactor: fix controlnet and tae * refactor: upscaler * Refactor: fix runtime type override * upscaler: use fp16 again * Refactor: Flexible sd3 arch * Refactor: Flexible Flux arch * format code --------- Co-authored-by: leejet <[email protected]>

stduhpf added 11 commits November 25, 2024 12:57

Refactor: wtype per tensor

6cbcbe0

Fix default args

ee674a5

refactor: fix flux

b465f13

Refactor photmaker v2 support

cb46146

unet: refactor the refactoring

371d81f

Refactor: fix controlnet and tae

04ca926

refactor: upscaler

38f5685

Refactor: fix runtime type override

170663f

upscaler: use fp16 again

8e7fbf8

Refactor: Flexible sd3 arch

e7eabd3

Refactor: Flexible Flux arch

4080c29

stduhpf mentioned this pull request Nov 29, 2024

Refactor: wtype per tensor from file instead of global #455

Closed

format code

5d501cd

leejet merged commit 7ce63e7 into leejet:master Nov 30, 2024
7 of 9 checks passed

stduhpf deleted the refactor-dit-setup branch January 1, 2025 15:48

iwr-redmond mentioned this pull request Jan 23, 2025

[FEATURE] Update stable-diffusion-cpp submodule NexaAI/nexa-sdk#358

Closed

This was referenced Mar 27, 2025

Unable to inference using Segmind Tiny SD model #603

Open

Flux-mini Support Request #638

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor: Flexible model architecture for dit models (Flux & SD3) #490

Refactor: Flexible model architecture for dit models (Flux & SD3) #490

stduhpf commented Nov 29, 2024 •

edited

Loading

leejet commented Nov 30, 2024

Refactor: Flexible model architecture for dit models (Flux & SD3) #490

Refactor: Flexible model architecture for dit models (Flux & SD3) #490

Conversation

stduhpf commented Nov 29, 2024 • edited Loading

Motivations:

New models supported with these chages (examples):

leejet commented Nov 30, 2024

stduhpf commented Nov 29, 2024 •

edited

Loading