-
Notifications
You must be signed in to change notification settings - Fork 7.9k
Pull requests: karpathy/nanoGPT
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Replacing DDP with FSDP since it has sharing capability train.py
#626
opened Aug 4, 2025 by
mostafamos
Loading…
feat: implement ALiBi (Attention with Linear Biases) for length extra…
#623
opened Jul 16, 2025 by
nimarez
Loading…
fix: Add trust_remote_code=True to openwebtext loading for compatibility with latest datasets library
#614
opened Jun 9, 2025 by
niloydebbarma-code
Loading…
fix: minor typo in multi-node training info (train.py)
#597
opened Mar 14, 2025 by
ramanakshay
Loading…
RoPE implementation with a shakespeare-char-rope test
#590
opened Jan 26, 2025 by
albertvucinovic
Loading…
Refactored code from different base based on leyan_branch
#588
opened Jan 9, 2025 by
cesposo
Loading…
Add the Quantized model and also a Demo of the Quantized model
#587
opened Jan 9, 2025 by
Ruhaan838
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.