-
Notifications
You must be signed in to change notification settings - Fork 11.8k
Support for OpenELM of Apple #6868
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Nice to see the LLaMA* idea implemented in these models: |
Not sure if anyone is working on this yet but i'm happy to pick it up |
Okay update: It's more difficult as I first expected it and this is a new codebase so it's stumped me a bit. ![]() Still attempting it but i don't think i'm gonna be done any time soon |
@joshcarp |
Yeah, so basically, i think I can't figure out how to calculate the kqv offsets for the kqv tensor every layer. I'm using this as a reference: https://huggingface.co/apple/OpenELM-270M/blob/main/modeling_openelm.py If anyone else wants to implement this feel free |
If anyone can help out: #6986 |
Does this work? Did anyone try this? |
Have you checked the files? |
Is it possible to have support forr openelm |
Prerequisites
Please answer the following questions for yourself before submitting an issue.
Feature Description
Support for OpenELM of Apple
https://huggingface.co/apple/OpenELM-3B-Instruct/tree/main
The text was updated successfully, but these errors were encountered: