Enhance LLaVA dataset processing with optional text preprocessing #1000

wanghao9610 · 2025-03-09T02:47:24Z

Dear XTuner Contributors,

Thank you for providing the open-source code for MLLM. I noticed that the process_hf_dataset function in the LLaVADataset takes several minutes to preprocess text data each time the program starts. In contrast, some other repositories (e.g., Original LLaVA, LLaVA-Next) handle text data preprocessing during training.

To address this, I have proposed an enhancement to the LLaVA dataset processing by introducing optional text preprocessing. This modification eliminates the need to preprocess text data at runtime, improving efficiency.

I kindly request you review my code and consider merging my PR.

Best regards,

wanghao9610

wanghao9610 · 2025-03-09T07:37:33Z

@LZHgrla @HIT-cwh This PR includes updates to the popular LLaVADataset. Could you kindly review it at your earliest convenience? Thanks!

wanghao9610 added 2 commits March 9, 2025 10:39

Enhance LLaVA dataset processing with optional text preprocessing

c3e862a

Use deep copy to prevent data mutation in LLaVA dataset processing

6d14081

Add auto resume feature for training checkpoints

ceb9e7f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance LLaVA dataset processing with optional text preprocessing #1000

Enhance LLaVA dataset processing with optional text preprocessing #1000

wanghao9610 commented Mar 9, 2025 •

edited

Loading

wanghao9610 commented Mar 9, 2025

Enhance LLaVA dataset processing with optional text preprocessing #1000

Are you sure you want to change the base?

Enhance LLaVA dataset processing with optional text preprocessing #1000

Conversation

wanghao9610 commented Mar 9, 2025 • edited Loading

wanghao9610 commented Mar 9, 2025

wanghao9610 commented Mar 9, 2025 •

edited

Loading