Skip to content

Pull requests: pytorch/torchtitan

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[SimpleFSDP] Add support for hsdp+tp CLA Signed This label is managed by the Meta Open Source bot.
#1343 opened Jun 26, 2025 by ruisizhang123 Loading…
Only calls destroy_process_group if the trainer exist successfully CLA Signed This label is managed by the Meta Open Source bot.
#1342 opened Jun 26, 2025 by fegin Loading…
[DSV3] Apply TP on DSV3 CLA Signed This label is managed by the Meta Open Source bot.
#1341 opened Jun 26, 2025 by wwwjn Loading…
Always ignore freqs_cis CLA Signed This label is managed by the Meta Open Source bot.
#1338 opened Jun 25, 2025 by fegin Loading…
missing dependency in pyproject for tyro CLA Signed This label is managed by the Meta Open Source bot.
#1335 opened Jun 24, 2025 by wesleytruong Loading…
[WIP] Refactor Tokenizer -> BaseTokenizer CLA Signed This label is managed by the Meta Open Source bot.
#1333 opened Jun 24, 2025 by H-Huang Draft
[kernels][blackwell] add cutlass/cute group gemm forward for blackwell CLA Signed This label is managed by the Meta Open Source bot.
#1327 opened Jun 22, 2025 by lessw2020 Loading…
[WIP] expert parallel dp2ep CLA Signed This label is managed by the Meta Open Source bot.
#1324 opened Jun 21, 2025 by tianyu-l Draft
Support finetuning from a pretrained model CLA Signed This label is managed by the Meta Open Source bot.
#1321 opened Jun 20, 2025 by vwxyzjn Loading…
[float8] add _auto_filter_for_recipe for float8 training CLA Signed This label is managed by the Meta Open Source bot.
#1319 opened Jun 18, 2025 by danielvegamyhre Loading…
Support different tokenizers CLA Signed This label is managed by the Meta Open Source bot.
#1318 opened Jun 18, 2025 by H-Huang Loading…
[not for land] testing out float8 128_1_128_128 blockwise scaling CLA Signed This label is managed by the Meta Open Source bot.
#1317 opened Jun 18, 2025 by vkuzo Loading…
Do not submit: Multinode training seems to be working CLA Signed This label is managed by the Meta Open Source bot.
#1314 opened Jun 17, 2025 by ahmadsharif1 Draft
Do not submit: Multinode is working with multiple controllers CLA Signed This label is managed by the Meta Open Source bot.
#1313 opened Jun 17, 2025 by ahmadsharif1 Draft
[llama4][auxiliary-loss-free load balancing] update expert_bias without backward hooks CLA Signed This label is managed by the Meta Open Source bot.
#1304 opened Jun 16, 2025 by hann-wang Loading…
Finetune from pre-trained models CLA Signed This label is managed by the Meta Open Source bot.
#1300 opened Jun 15, 2025 by vwxyzjn Loading…
[not for land] Use new AC CLA Signed This label is managed by the Meta Open Source bot.
#1294 opened Jun 13, 2025 by soulitzer Loading…
WIP: Try to use monarch to run torchtitan. CLA Signed This label is managed by the Meta Open Source bot.
#1288 opened Jun 12, 2025 by ahmadsharif1 Draft
Titan changes to use DCP ZOC instead of titan default Async + Pinned Memory CLA Signed This label is managed by the Meta Open Source bot.
#1287 opened Jun 12, 2025 by Saiteja64 Loading…
DO NOT SUBMIT: WIP: Try to use monarch to run torchtitan. CLA Signed This label is managed by the Meta Open Source bot.
#1286 opened Jun 12, 2025 by ahmadsharif1 Draft
[deepseek][kernels][blackwell] Cutlass blackwell grouped gemm using cute dsl (forward,backward) CLA Signed This label is managed by the Meta Open Source bot.
#1276 opened Jun 8, 2025 by lessw2020 Loading…
[deepseek][blackwell] add Cutlass cute dsl blackwell dense based looping group gemm CLA Signed This label is managed by the Meta Open Source bot.
#1274 opened Jun 8, 2025 by lessw2020 Loading…
[llama4] enable expert parallel on the same device mesh as tp (tp2ep) CLA Signed This label is managed by the Meta Open Source bot.
#1269 opened Jun 6, 2025 by hann-wang Loading…
Enable ROCm CI support. ciflow/rocm CLA Signed This label is managed by the Meta Open Source bot. module: rocm
#1260 opened Jun 4, 2025 by akashveramd Draft
[WIP][Blackwell Kernels] Blackwell group gemm and dense gemms with Python Cutlass CLA Signed This label is managed by the Meta Open Source bot.
#1256 opened Jun 3, 2025 by lessw2020 Loading…
ProTip! Updated in the last three days: updated:>2025-06-23.