Skip to content

[Sprint 2][Performance Fine Tuning] Hugging Face Transformer Models #12467

@mcr229

Description

@mcr229
Model Name Source Type Status % Ops Lowered % Time in XNNPACK Inference Time Model Load Time Performance Table
BERT Huggingface Transformers Quantized
Floating Point
DistilBERT Huggingface Transformers Quantized
Floating Point
RoBERTa Huggingface Transformers Quantized
Floating Point
Gemma3 Huggingface Transformers Quantized
Floating Point
Gemma3n Huggingface Transformers Quantized
Floating Point
Llama Huggingface Transformers Quantized
Floating Point
Olmo Huggingface Transformers Quantized
Floating Point
Phi4 Huggingface Transformers Quantized
Floating Point
Qwen3 Huggingface Transformers Quantized
Floating Point
SmolLM Huggingface Transformers Quantized
Floating Point
BART Huggingface Transformers Quantized
Floating Point
T5 Huggingface Transformers Quantized
Floating Point
ResNet Huggingface Transformers Quantized
Floating Point
ViT Huggingface Transformers Quantized
Floating Point
Yolos Huggingface Transformers Quantized
Floating Point
SAM Huggingface Transformers Quantized
Floating Point
Wav2vec2 Huggingface Transformers Quantized
Floating Point
Whisper Huggingface Transformers Quantized
Floating Point
CLAP Huggingface Transformers Quantized
Floating Point
Seamless M4T Huggingface Transformers Quantized
Floating Point
CLIP Huggingface Transformers Quantized
Floating Point
Smolvlm Huggingface Transformers Quantized
Floating Point

Metadata

Metadata

Assignees

Labels

triagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

Type

Projects

Status

In progress

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions