You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Please refer to [this tutorial](https://pytorch.org/executorch/main/llm/llama-demo-android.html) to for full instructions on building the Android LLAMA Demo App.
226
+
224
227
# What is coming next?
225
228
## Quantization
226
229
- Enabling FP16 model to leverage smaller groupsize for 4-bit quantization.
0 commit comments