perplexity : faster HellaSwag via batching #5017

ggerganov · 2024-01-18T12:00:49Z

This PR improves HellaSwag computation via the perplexity tool by batching both the endings and the tasks into a single llama_batch

For GPUs with plenty of FLOPS, adding -c 1024 or even -c 2048 might further improve performance

By default we evaluate 1 task at a time, but for small tasks it is useful to batch them together. This can be controlled with the --parallel argument.

ggml-ci

* perplexity : faster HellaSwag ggml-ci * perplexity : clean-up ggml-ci * perplexity : no need for decode_helper ggml-ci * perplexity : add comments * perplexity : option to specify max batched tasks via `n_parallel` * perplexity : remove HellaSwag restruction for n_batch

ggerganov added 7 commits January 18, 2024 13:43

perplexity : faster HellaSwag

baa5279

ggml-ci

perplexity : clean-up

4351c46

ggml-ci

perplexity : no need for decode_helper

af30901

ggml-ci

Merge branch 'master' into gg/hellaswag-batched

0e4e58f

ggml-ci

perplexity : add comments

30ebd94

perplexity : option to specify max batched tasks via n_parallel

64d173b

perplexity : remove HellaSwag restruction for n_batch

9df62c2

ggerganov merged commit ad19812 into master Jan 18, 2024

ggerganov deleted the gg/hellaswag-batched branch January 18, 2024 13:33

ikawrakow mentioned this pull request Jan 18, 2024

HellaSwag: speed up by parallelizing log-prob evaluation #5020

Merged

ggerganov mentioned this pull request Jan 18, 2024

perplexity : faster Winogrande via batching #5024

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perplexity : faster HellaSwag via batching #5017

perplexity : faster HellaSwag via batching #5017

Uh oh!

ggerganov commented Jan 18, 2024 •

edited

Loading

Uh oh!

Uh oh!

perplexity : faster HellaSwag via batching #5017

perplexity : faster HellaSwag via batching #5017

Uh oh!

Conversation

ggerganov commented Jan 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ggerganov commented Jan 18, 2024 •

edited

Loading