Added evaluation script for qualcomm LlamaModel #11663

rohansjoshi · 2025-06-14T00:49:33Z

Summary:
Script for evaluating models which follow qualcomm's LlamaModel definition, on lm eval harness tasks such as WikiText

Results for WikiText evaluation task:

Model Name	max_seq_len	word_perplexity
Llama 1B Instruct	128	34.82890030691187
Llama 1B Instruct	512	22.919538703371582

Differential Revision: D76634688

pytorch-bot · 2025-06-14T00:49:37Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/11663

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit d4f2c9e with merge base 56392aa ():

NEW FAILURE - The following job has failed:

Lint / lintrunner / linux-job (gh)
RuntimeError: Command docker exec -t b432530c983f21d5f2037dbca211ceb695271704fcc1375a017998d9d78a7742 /exec failed with exit code 127

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-06-14T00:50:01Z

This pull request was exported from Phabricator. Differential Revision: D76634688

github-actions · 2025-06-14T00:50:34Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

facebook-github-bot · 2025-06-14T00:56:17Z

This pull request was exported from Phabricator. Differential Revision: D76634688

Summary: Pull Request resolved: pytorch#11663 Script for evaluating models which follow qualcomm's LlamaModel definition, on lm eval harness tasks such as WikiText Results for WikiText evaluation task: | Model Name | max_seq_len | word_perplexity |----------|----------|----------| | Llama 1B Instruct | 128 | 34.82890030691187 | | Llama 1B Instruct | 512 | 22.919538703371582 | Differential Revision: D76634688

cccclai

Thank you for adding the eval scripts. @shewu-quic @haowhsu-quic fyi we're adding eval scripts here and trying to improve the accuracy for ptq

haowhsu-quic · 2025-06-15T15:13:01Z

Thank you for adding the eval scripts. @shewu-quic @haowhsu-quic fyi we're adding eval scripts here and trying to improve the accuracy for ptq

Thank you, this is very helpful!

facebook-github-bot · 2025-06-15T16:50:27Z

This pull request was exported from Phabricator. Differential Revision: D76634688

Summary: Pull Request resolved: pytorch#11663 Script for evaluating models which follow qualcomm's LlamaModel definition, on lm eval harness tasks such as WikiText Results for WikiText evaluation task: | Model Name | max_seq_len | word_perplexity |----------|----------|----------| | Llama 1B Instruct | 128 | 34.82890030691187 | | Llama 1B Instruct | 512 | 22.919538703371582 | Reviewed By: cccclai Differential Revision: D76634688

facebook-github-bot · 2025-06-16T15:25:36Z

This pull request was exported from Phabricator. Differential Revision: D76634688

Differential Revision: D76634688 Pull Request resolved: pytorch#11663

rohansjoshi requested a review from cccclai as a code owner June 14, 2025 00:49

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 14, 2025

facebook-github-bot added the fb-exported label Jun 14, 2025

rohansjoshi force-pushed the export-D76634688 branch from 246afd1 to 74603d2 Compare June 14, 2025 00:56

cccclai approved these changes Jun 14, 2025

View reviewed changes

rohansjoshi force-pushed the export-D76634688 branch from 74603d2 to 7596337 Compare June 15, 2025 16:50

rohansjoshi force-pushed the export-D76634688 branch from 7596337 to d4f2c9e Compare June 16, 2025 15:25

facebook-github-bot merged commit 057558f into pytorch:main Jun 16, 2025
101 of 104 checks passed

abhinaykukkadapu pushed a commit to abhinaykukkadapu/executorch that referenced this pull request Jun 17, 2025

Added evaluation script for qualcomm LlamaModel

105da68

Differential Revision: D76634688 Pull Request resolved: pytorch#11663

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added evaluation script for qualcomm LlamaModel #11663

Added evaluation script for qualcomm LlamaModel #11663

Uh oh!

rohansjoshi commented Jun 14, 2025

Uh oh!

pytorch-bot bot commented Jun 14, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Jun 14, 2025

Uh oh!

github-actions bot commented Jun 14, 2025

Uh oh!

facebook-github-bot commented Jun 14, 2025

Uh oh!

cccclai left a comment •

edited

Loading

Uh oh!

haowhsu-quic commented Jun 15, 2025

Uh oh!

facebook-github-bot commented Jun 15, 2025

Uh oh!

facebook-github-bot commented Jun 16, 2025

Uh oh!

Uh oh!

Uh oh!

Added evaluation script for qualcomm LlamaModel #11663

Added evaluation script for qualcomm LlamaModel #11663

Uh oh!

Conversation

rohansjoshi commented Jun 14, 2025

Uh oh!

pytorch-bot bot commented Jun 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/11663

❌ 1 New Failure

Uh oh!

facebook-github-bot commented Jun 14, 2025

Uh oh!

github-actions bot commented Jun 14, 2025

This PR needs a release notes: label

Uh oh!

facebook-github-bot commented Jun 14, 2025

Uh oh!

cccclai left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

haowhsu-quic commented Jun 15, 2025

Uh oh!

facebook-github-bot commented Jun 15, 2025

Uh oh!

facebook-github-bot commented Jun 16, 2025

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Jun 14, 2025 •

edited

Loading

This PR needs a `release notes:` label

cccclai left a comment •

edited

Loading