NVIDIA / TensorRT-LLM Public

Notifications You must be signed in to change notification settings
Fork 1.8k
Star 11.9k

Code
Issues 748
Pull requests 396
Discussions
Actions
Projects 2
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: NVIDIA/TensorRT-LLM

Labels 55 Milestones 1

New pull request New

396 Open 4,818 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[None][chore] Remove duplicate log outputs in test_perf.py

#8418 opened Oct 16, 2025 by hyukn

Loading…

1 task done

[None][chore] replace print_colored_debug with logger_debug

#8417 opened Oct 16, 2025 by Superjomn

Loading…

1 task done

[TRTLLM-8638][fix] waive llam4 tests on H20

#8416 opened Oct 16, 2025 by xinhe-nv

Loading…

1 task done

[None][chore] Align RPC performance with existing GenerationExecutor

#8415 opened Oct 16, 2025 by Superjomn

Loading…

1 task

[https://nvbugs/5501820][fix] Add requirements for numba-cuda version to WAR mem corruption (#7992)

#8414 opened Oct 16, 2025 by pengbowang-nv • Draft

1 task

[None][bug] Set NCCL_GRAPH_REGISTER to false to avoid hang

#8413 opened Oct 16, 2025 by Tabrizian

Loading…

1 task done

[TRTLLM-8480][chore] clean create_py_executor API

#8412 opened Oct 16, 2025 by QiJune

Loading…

1 task done

[https://nvbugs/5451280][fix] Reduce memory fraction problem by warmu…

#8410 opened Oct 16, 2025 by liji-nv

Loading…

1 task done

[None][bug] Set NCCL_GRAPH_REGISTER to false to avoid hang

#8409 opened Oct 16, 2025 by Tabrizian

Loading…

1 task done

[https://nvbugs/5496705][fix] Fix high IPC overhead with logprobs enabled by removing duplicate sends

#8406 opened Oct 16, 2025 by nvxuanyuc

Loading…

[TRTLLM-8535][feat] Support DeepSeek V3.2 with FP8 GEMM + BF16 KV cache

#8405 opened Oct 16, 2025 by chang-l • Draft

1 task done

[None][infra] Update CI allowed list 2025_10_15

#8403 opened Oct 15, 2025 by yuanjingx87

Loading…

1 task

[None][feat] Update load_weights method to include mapping parameter in checkpoint loaders

#8402 opened Oct 15, 2025 by Funatiq • Draft

1 task

[https://nvbugs/5502901][fix] Set max_seq_len and max_batch_size in TestNemotronUltra test cases to prevent OOM

#8399 opened Oct 15, 2025 by amitz-nv

Loading…

1 task

[TRTLLM-8436][feat] batched sampling and top-k logprobs improvements

#8398 opened Oct 15, 2025 by ixlmar • Draft

1 task done

[https://nvbugs/5437384][test] fix trtllm-llmapi-launch multi tests with single launch

#8397 opened Oct 15, 2025 by Superjomn

Loading…

1 task done

[None][chore] Cleanup KV cache manager

#8396 opened Oct 15, 2025 by Funatiq • Draft

1 task

[none][feat] Support nano-v2-vlm with multiple PRs

#8395 opened Oct 15, 2025 by Wanli-Jiang • Draft

[TRTLLM-8669][infra] Use artifactory mirror for install python

#8394 opened Oct 15, 2025 by ZhanruiSunCh

Loading…

1 task

[None][feat] Add fmha_v2 kernel for head_dim=80 and sm=100 to support VLM

#8392 opened Oct 15, 2025 by Wanli-Jiang

Loading…

1 task done

[https://nvbugs/5540138][fix] Fix shape error when duplicating kv.

#8390 opened Oct 15, 2025 by Tracin

Loading…

1 task

[None][fix] improve mpirun hang issues

#8385 opened Oct 15, 2025 by xinhe-nv • Draft

1 task

Ampere xqa swa specdec

#8383 opened Oct 15, 2025 by jhaotingc

Loading…

1 task

[TRTLLM-8464][infra] Use public triton 3.5.0

#8382 opened Oct 15, 2025 by ZhanruiSunCh

Loading…

1 task

[None][fix] fix error when processing batches containing both text and mm data Community want to contribute

PRs initiated from Community

#8381 opened Oct 15, 2025 by Nekofish-L

Loading…

1 task done

Previous 1 2 3 4 5 … 15 16 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!