Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Re-position PyTorch arch as v1.0
#3956 opened Apr 29, 2025 by laikhtewari Loading…
chore: remove release branch codeowners from main
#3954 opened Apr 29, 2025 by tburt-nv Loading…
fix: ModelRunnerCpp num_return_sequences
#3951 opened Apr 29, 2025 by Funatiq Loading…
test: Add fp8kv to DS-v3-lite integration tests.
#3950 opened Apr 29, 2025 by bobboli Loading…
chore: bump version to 0.20.0rc2
#3949 opened Apr 29, 2025 by ZhanruiSunCh Loading…
infra: Fix pipeline step error in post merge
#3948 opened Apr 29, 2025 by ZhanruiSunCh Loading…
fix: Move all casters to customCasters.
#3945 opened Apr 29, 2025 by dcampora Loading…
fix cache transfer buffer
#3942 opened Apr 29, 2025 by chuangz0 Loading…
chore: Remove duplicated get_sm_version.
#3935 opened Apr 29, 2025 by yuxianq Loading…
feat: NIXL interface integration
#3934 opened Apr 29, 2025 by Shixiaowei02 Loading…
chore: revert PR 3751
#3933 opened Apr 29, 2025 by byshiue Loading…
[NVBUG 5247699]Fix mixtral fp4 llmapi bug.
#3929 opened Apr 29, 2025 by Tracin Loading…
unwaive disagg tests
#3925 opened Apr 29, 2025 by chuangz0 Loading…
feat:[AutoDeploy] E2E build example for llama4 VLM
#3922 opened Apr 28, 2025 by Fridah-nv Loading…
Llama4: Fix PP
#3920 opened Apr 28, 2025 by v-shobhit Draft
test: skip tests on b200
#3913 opened Apr 28, 2025 by xinhe-nv Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.