Fix: Support --train_text_encoder in train_dreambooth_sd3.py by handling None tokenizers/text_input_ids #12376

quchenyuan · 2025-09-23T09:05:42Z

What does this PR do?

This PR fixes the --train_text_encoder flag in train_dreambooth_sd3.py, which previously failed during training because the encode_prompt and _encode_prompt_with_t5 functions did not properly handle cases where tokenizers or text_input_ids are None.

When --train_text_encoder is enabled, the training pipeline pre-tokenizes prompts and passes text_input_ids_list directly to avoid retokenizing in every step — but the original code assumed tokenizers was always available, causing crashes like:

AttributeError: 'NoneType' object has no attribute 'encode'
or
ValueError: text_input_ids must be provided...

This PR:

✅ Makes _encode_prompt_with_t5 and encode_prompt robust to None tokenizer by:

Accepting precomputed text_input_ids as fallback
Adding validation when tokenizer is missing
Preserving batch size logic even when prompt is None

✅ Ensures compatibility between training and inference code paths

Fixes # ( 8507)

晓丽 and others added 4 commits September 23, 2025 16:51

fix: enable --train_text_encoder in train_dreambooth_sd3.py

85781e1

fix: enable --train_text_encoder in train_dreambooth_sd3.py

fc2f1d2

Merge branch 'main' into fix/train-text-encoder-sd3

a174e9b

Merge branch 'main' into fix/train-text-encoder-sd3

bb07447

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix: Support --train_text_encoder in train_dreambooth_sd3.py by handling None tokenizers/text_input_ids #12376

Fix: Support --train_text_encoder in train_dreambooth_sd3.py by handling None tokenizers/text_input_ids #12376

Uh oh!

quchenyuan commented Sep 23, 2025

Uh oh!

Uh oh!

Fix: Support --train_text_encoder in train_dreambooth_sd3.py by handling None tokenizers/text_input_ids #12376

Are you sure you want to change the base?

Fix: Support --train_text_encoder in train_dreambooth_sd3.py by handling None tokenizers/text_input_ids #12376

Uh oh!

Conversation

quchenyuan commented Sep 23, 2025

What does this PR do?

Uh oh!

Uh oh!