-
Notifications
You must be signed in to change notification settings - Fork 550
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Provide load_seed_checkpoint_only option
CLA Signed
This label is managed by the Meta Open Source bot.
#1800
opened Oct 5, 2025 by
fegin
Loading…
Add a loss comparison script
CLA Signed
This label is managed by the Meta Open Source bot.
#1799
opened Oct 5, 2025 by
fegin
Loading…
[RFC] Lift freqs_cis as an input of models
CLA Signed
This label is managed by the Meta Open Source bot.
#1797
opened Oct 4, 2025 by
fegin
Loading…
add option to disable ft checkpoints
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#1795
opened Oct 3, 2025 by
tushar00jain
Loading…
JointGraph-based Training Prototype
CLA Signed
This label is managed by the Meta Open Source bot.
#1794
opened Oct 3, 2025 by
SherlockNoMad
•
Draft
[wip][simplefsdp] fix simplefsdp gradient_divide_factor
CLA Signed
This label is managed by the Meta Open Source bot.
#1793
opened Oct 3, 2025 by
ruisizhang123
•
Draft
Enable ROCm CI support
ciflow/rocm
CLA Signed
This label is managed by the Meta Open Source bot.
module: rocm
#1786
opened Oct 2, 2025 by
akashveramd
Loading…
[debug don't merge] pr to reproduce error in aot_eager+post_grad_custom_post_pass
CLA Signed
This label is managed by the Meta Open Source bot.
#1785
opened Oct 2, 2025 by
ruisizhang123
Loading…
[RFC] Refactor attention and make attention mask an argument to the model
CLA Signed
This label is managed by the Meta Open Source bot.
#1776
opened Sep 30, 2025 by
fegin
Loading…
[ap] Knobs to enable reorder/bucketing/async_tp passes
CLA Signed
This label is managed by the Meta Open Source bot.
#1772
opened Sep 30, 2025 by
IvanKobzarev
Loading…
[mxfp8 MoE training] Support mxfp8 all to all in expert parallel
CLA Signed
This label is managed by the Meta Open Source bot.
#1765
opened Sep 26, 2025 by
danielvegamyhre
Loading…
Adding config options for deterministic execution
CLA Signed
This label is managed by the Meta Open Source bot.
#1761
opened Sep 26, 2025 by
githubsgi
Loading…
gpt-oss model enablement
CLA Signed
This label is managed by the Meta Open Source bot.
#1754
opened Sep 24, 2025 by
wwwjn
Loading…
improve profiler
CLA Signed
This label is managed by the Meta Open Source bot.
#1753
opened Sep 24, 2025 by
tushar00jain
•
Draft
handle unable to load ft checkpoint
CLA Signed
This label is managed by the Meta Open Source bot.
#1752
opened Sep 24, 2025 by
tushar00jain
•
Draft
[DONT REVIEW] Debug Async TP CI
CLA Signed
This label is managed by the Meta Open Source bot.
#1751
opened Sep 24, 2025 by
fegin
Loading…
Add support for AC budget API
CLA Signed
This label is managed by the Meta Open Source bot.
#1731
opened Sep 21, 2025 by
tohskai
Loading…
[DONT REVIEW] debug ac(fsdp) in llama and deepseek
CLA Signed
This label is managed by the Meta Open Source bot.
[torchtitan][replicate] experimenting new replicate integration with torchtitan
CLA Signed
This label is managed by the Meta Open Source bot.
#1714
opened Sep 15, 2025 by
anshul-si
Loading…
[Do not merge] Reproduce AC(FSDP(moe.experts)) composibility issue
CLA Signed
This label is managed by the Meta Open Source bot.
Previous Next
ProTip!
Follow long discussions with comments:>50.