Llama3.1 #129

gshtras · 2024-08-12T16:53:35Z

Adding changes required to support LLama 3.1:
vllm-project#6553
vllm-project#6693

shajrawi

ship it

* Add support for a rope extension method (vllm-project#6553) * [BugFix] Fix RoPE error in Llama 3.1 (vllm-project#6693) --------- Co-authored-by: Simon Mo <[email protected]> Co-authored-by: Woosuk Kwon <[email protected]>

* optimizations for process output step * Llama3.1 (#129) * Add support for a rope extension method (vllm-project#6553) * [BugFix] Fix RoPE error in Llama 3.1 (vllm-project#6693) --------- Co-authored-by: Simon Mo <[email protected]> Co-authored-by: Woosuk Kwon <[email protected]> * Update hipblaslt and FA revs to match what was used for MLPerf * Switch to "unified docker" with a ROCm 6.2 base image This base image includes current libraries, so there is no need for us to rebuild hipblaslt, RCCL, and Flash Attention. --------- Co-authored-by: Shomy <[email protected]> Co-authored-by: Gregory Shtrasberg <[email protected]> Co-authored-by: Simon Mo <[email protected]> Co-authored-by: Woosuk Kwon <[email protected]>

simon-mo and others added 3 commits August 12, 2024 16:20

Add support for a rope extension method (vllm-project#6553)

be817de

[BugFix] Fix RoPE error in Llama 3.1 (vllm-project#6693)

dcd6ad0

Missing import

8d7d2a1

gshtras requested a review from shajrawi August 12, 2024 17:24

Merge branch 'main' into llama3.1

5487d78

shajrawi approved these changes Aug 12, 2024

View reviewed changes

gshtras merged commit dd1a208 into main Aug 12, 2024

gshtras deleted the llama3.1 branch August 12, 2024 20:45

JArnoldAMD mentioned this pull request Aug 26, 2024

FP8 throughput #154

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Llama3.1 #129

Llama3.1 #129

Uh oh!

gshtras commented Aug 12, 2024

Uh oh!

shajrawi left a comment

Uh oh!

Uh oh!

Llama3.1 #129

Llama3.1 #129

Uh oh!

Conversation

gshtras commented Aug 12, 2024

Uh oh!

shajrawi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!