enable batch_size auto for model eval #2675

vkuzo · 2025-08-04T18:30:31Z

Summary:

enables passing --batch_size auto to model eval script

on LLaMa 3.1 8B and wikitext + hellaswag, this reduces the runtime from
13 minutes to 4.5 minutes on my machines (a 2.9x speedup)

Test Plan:

with-proxy time python benchmarks/_models/eval_hf_models.py --model_id meta-llama/Llama-3.1-8B --tasks wikitext hellaswag --batch_size auto

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]

vkuzo · 2025-08-04T18:30:32Z

Stack from ghstack (oldest at bottom):

-> enable batch_size auto for model eval #2675

pytorch-bot · 2025-08-04T18:30:34Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2675

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

ghstack-mergeability-check and Check labels failing with 'Resource not accessible by integration'

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: enables passing `--batch_size auto` to model eval script on LLaMa 3.1 8B and wikitext + hellaswag, this reduces the runtime from 13 minutes to 4.5 minutes on my machines (a 2.9x speedup) Test Plan: ``` with-proxy time python benchmarks/_models/eval_hf_models.py --model_id meta-llama/Llama-3.1-8B --tasks wikitext hellaswag --batch_size auto ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 9a91dd6 ghstack-comment-id: 3151916665 Pull Request resolved: #2675

[ghstack-poisoned]

Summary: enables passing `--batch_size auto` to model eval script on LLaMa 3.1 8B and wikitext + hellaswag, this reduces the runtime from 13 minutes to 4.5 minutes on my machines (a 2.9x speedup) Test Plan: ``` with-proxy time python benchmarks/_models/eval_hf_models.py --model_id meta-llama/Llama-3.1-8B --tasks wikitext hellaswag --batch_size auto ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 91c5dd0 ghstack-comment-id: 3151916665 Pull Request resolved: #2675

jerryzh168

int still works right?

[ghstack-poisoned]

Summary: enables passing `--batch_size auto` to model eval script on LLaMa 3.1 8B and wikitext + hellaswag, this reduces the runtime from 13 minutes to 4.5 minutes on my machines (a 2.9x speedup) Test Plan: ``` with-proxy time python benchmarks/_models/eval_hf_models.py --model_id meta-llama/Llama-3.1-8B --tasks wikitext hellaswag --batch_size auto ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: d4da3ce ghstack-comment-id: 3151916665 Pull Request resolved: #2675

vkuzo · 2025-08-05T11:35:51Z

int still works right?

yes, it does

vkuzo added 2 commits August 4, 2025 11:03

Update

bf78f5d

[ghstack-poisoned]

Update

2601060

[ghstack-poisoned]

vkuzo mentioned this pull request Aug 4, 2025

fix lm_eval import in eval_hf_models.py #2674

Merged

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 4, 2025

vkuzo added the topic: not user facing Use this tag if you don't want this PR to show up in release notes label Aug 4, 2025

Update

63801d1

[ghstack-poisoned]

vkuzo requested review from andrewor14 and jerryzh168 August 4, 2025 18:51

jerryzh168 approved these changes Aug 4, 2025

View reviewed changes

Update

e4b349f

[ghstack-poisoned]

vkuzo changed the base branch from gh/vkuzo/101/head to main August 5, 2025 11:35

vkuzo merged commit 18edd01 into main Aug 5, 2025
39 of 49 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

enable batch_size auto for model eval #2675

enable batch_size auto for model eval #2675

Uh oh!

vkuzo commented Aug 4, 2025

Uh oh!

vkuzo commented Aug 4, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 4, 2025 •

edited

Loading

Uh oh!

jerryzh168 left a comment

Uh oh!

vkuzo commented Aug 5, 2025

Uh oh!

Uh oh!

Uh oh!

enable batch_size auto for model eval #2675

enable batch_size auto for model eval #2675

Uh oh!

Conversation

vkuzo commented Aug 4, 2025

Uh oh!

vkuzo commented Aug 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2675

❗ 1 Active SEVs

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

vkuzo commented Aug 5, 2025

Uh oh!

Uh oh!

Uh oh!

vkuzo commented Aug 4, 2025 •

edited

Loading

pytorch-bot bot commented Aug 4, 2025 •

edited

Loading