Skip to content

TensorRT-LLM v0.18.2 release #3611

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Apr 16, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@ TensorRT-LLM
[![python](https://img.shields.io/badge/python-3.10-green)](https://www.python.org/downloads/release/python-31012/)
[![cuda](https://img.shields.io/badge/cuda-12.8.1-green)](https://developer.nvidia.com/cuda-downloads)
[![trt](https://img.shields.io/badge/TRT-10.9.0-green)](https://developer.nvidia.com/tensorrt)
[![version](https://img.shields.io/badge/release-0.18.1-green)](./tensorrt_llm/version.py)
[![version](https://img.shields.io/badge/release-0.18.2-green)](./tensorrt_llm/version.py)
[![license](https://img.shields.io/badge/license-Apache%202-blue)](./LICENSE)

[Architecture](./docs/source/architecture/overview.md)   |   [Performance](./docs/source/performance/perf-overview.md)   |   [Examples](./examples/)   |   [Documentation](./docs/source/)   |   [Roadmap](https://docs.google.com/presentation/d/1gycPmtdh7uUcH6laOvW65Dbp9F1McUkGDIcAyjicBZs/edit?usp=sharing)
Expand Down
Original file line number Diff line number Diff line change
@@ -1,2 +1,2 @@
9f9942768fd5b0cf5ed19860ad539dc9 libtensorrt_llm_ucx_wrapper.so
d2efc6043262c896e262e8d8b97055af0f1f8b47 commit
edf502396e4443f284a5fae6044402478cf457c1 commit
Git LFS file not shown
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -1,2 +1,2 @@
e383212a40dca932c7b77bf4544dab80 libtensorrt_llm_ucx_wrapper.so
d2efc6043262c896e262e8d8b97055af0f1f8b47 commit
edf502396e4443f284a5fae6044402478cf457c1 commit
Git LFS file not shown
Git LFS file not shown
6 changes: 3 additions & 3 deletions cpp/tensorrt_llm/executor/aarch64-linux-gnu/version.txt
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
61ab1a6d4c62ee2a648f6daa5083c4de libtensorrt_llm_executor_static.a
2f2bc67944c45ce0965704da43c9b1c4 libtensorrt_llm_executor_static.pre_cxx11.a
d2efc6043262c896e262e8d8b97055af0f1f8b47 commit
1146671822817c690387dc77d775b8c7 libtensorrt_llm_executor_static.a
8f7cb0047a0c2690497a97911a60ed6d libtensorrt_llm_executor_static.pre_cxx11.a
edf502396e4443f284a5fae6044402478cf457c1 commit
Git LFS file not shown
Git LFS file not shown
6 changes: 3 additions & 3 deletions cpp/tensorrt_llm/executor/x86_64-linux-gnu/version.txt
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
e5da8cc2936606dfb49f4417d6961060 libtensorrt_llm_executor_static.a
ad5dfb89c2d719d99d67346828e92e25 libtensorrt_llm_executor_static.pre_cxx11.a
d2efc6043262c896e262e8d8b97055af0f1f8b47 commit
34a5173ddebafd3f1621af2717a92f54 libtensorrt_llm_executor_static.a
34eacc123dc995815fbd1e68ec98f78b libtensorrt_llm_executor_static.pre_cxx11.a
edf502396e4443f284a5fae6044402478cf457c1 commit
Original file line number Diff line number Diff line change
@@ -1,2 +1,2 @@
f3143205203b038b9dca6dd32cf02f59 libtensorrt_llm_nvrtc_wrapper.so
d2efc6043262c896e262e8d8b97055af0f1f8b47 commit
edf502396e4443f284a5fae6044402478cf457c1 commit
Original file line number Diff line number Diff line change
@@ -1,2 +1,2 @@
770ca93818f3f04837a67353e3f71fbc libtensorrt_llm_nvrtc_wrapper.so
d2efc6043262c896e262e8d8b97055af0f1f8b47 commit
edf502396e4443f284a5fae6044402478cf457c1 commit
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
6bf0ba4e9b8b1152a21316243d30bec6 libtensorrt_llm_internal_cutlass_kernels_static.a
96f8a359c84a78ba415f4d98ef1c4e1d libtensorrt_llm_internal_cutlass_kernels_static.pre_cxx11.a
d2efc6043262c896e262e8d8b97055af0f1f8b47 commit
edf502396e4443f284a5fae6044402478cf457c1 commit
Git LFS file not shown
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
0b3322f5047dd4ee549211c2d15483c4 libtensorrt_llm_internal_cutlass_kernels_static.a
502d4901fad6e648b8858051017c4cf2 libtensorrt_llm_internal_cutlass_kernels_static.pre_cxx11.a
d2efc6043262c896e262e8d8b97055af0f1f8b47 commit
4de75ffa1ff225422ba27f367175448f libtensorrt_llm_internal_cutlass_kernels_static.a
e91d6c762f26c0b158eba8f376914e6e libtensorrt_llm_internal_cutlass_kernels_static.pre_cxx11.a
edf502396e4443f284a5fae6044402478cf457c1 commit
6 changes: 6 additions & 0 deletions docs/source/release-notes.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,12 @@
All published functionality in the Release Notes has been fully tested and verified with known limitations documented. To share feedback about this release, access our [NVIDIA Developer Forum](https://forums.developer.nvidia.com/).


## TensorRT-LLM Release 0.18.2

### Key Features and Enhancements
- This update addresses known security issues. For the latest NVIDIA Vulnerability Disclosure Information visit https://www.nvidia.com/en-us/security/.


## TensorRT-LLM Release 0.18.1

### Key Features and Enhancements
Expand Down
2 changes: 1 addition & 1 deletion examples/baichuan/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.15.0
evaluate~=0.4.1
rouge_score~=0.1.2
Expand Down
2 changes: 1 addition & 1 deletion examples/bloom/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
evaluate~=0.4.1
rouge_score~=0.1.2
Expand Down
2 changes: 1 addition & 1 deletion examples/chatglm/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
evaluate~=0.4.1
protobuf
Expand Down
2 changes: 1 addition & 1 deletion examples/commandr/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets==2.14.6
evaluate~=0.4.1
rouge_score~=0.1.2
2 changes: 1 addition & 1 deletion examples/dbrx/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
evaluate~=0.4.1
rouge_score~=0.1.2
Expand Down
2 changes: 1 addition & 1 deletion examples/deepseek_v1/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.6
evaluate~=0.4.1
rouge_score~=0.1.2
2 changes: 1 addition & 1 deletion examples/draft_target_model/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
rouge_score~=0.1.2
sentencepiece>=0.1.99
Expand Down
2 changes: 1 addition & 1 deletion examples/eagle/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
rouge_score~=0.1.2
SentencePiece~=0.1.99
Expand Down
2 changes: 1 addition & 1 deletion examples/falcon/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
transformers>=4.31.0
datasets~=2.14.5
evaluate~=0.4.1
Expand Down
2 changes: 1 addition & 1 deletion examples/gemma/requirements.txt
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
# WAR the new posting of "nvidia-cudnn-cu12~=9.0".
# "jax[cuda12_pip]~=0.4.19" specifies "nvidia-cudnn-cu12>=8.9" but actually requires "nvidia-cudnn-cu12~=8.9".
nvidia-cudnn-cu12~=8.9; platform_machine == "x86_64"
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
flax~=0.8.0
# jax[cuda12_pip]~=0.4.19; platform_system != "Windows"
jax~=0.4.19; platform_system == "Windows"
Expand Down
2 changes: 1 addition & 1 deletion examples/gpt/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
evaluate~=0.4.1
rouge_score~=0.1.2
Expand Down
2 changes: 1 addition & 1 deletion examples/gptj/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
evaluate~=0.4.1
rouge_score~=0.1.2
2 changes: 1 addition & 1 deletion examples/gptneox/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
rouge_score~=0.1.2
evaluate~=0.4.1
2 changes: 1 addition & 1 deletion examples/grok/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
-f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets==2.14.6
evaluate~=0.4.1
rouge_score~=0.1.2
Expand Down
2 changes: 1 addition & 1 deletion examples/internlm/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets==2.14.5
rouge_score~=0.1.2
sentencepiece>=0.1.99
Expand Down
2 changes: 1 addition & 1 deletion examples/jais/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
evaluate~=0.4.1
rouge_score~=0.1.2
Expand Down
2 changes: 1 addition & 1 deletion examples/llama/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
transformers>=4.43.0
datasets==2.14.6
evaluate~=0.4.1
Expand Down
2 changes: 1 addition & 1 deletion examples/lookahead/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
rouge_score~=0.1.2
sentencepiece>=0.1.99
Expand Down
2 changes: 1 addition & 1 deletion examples/mamba/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
transformers>=4.39.0
datasets~=2.14.5
evaluate
Expand Down
2 changes: 1 addition & 1 deletion examples/medusa/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
rouge_score~=0.1.2
sentencepiece>=0.1.99
Expand Down
2 changes: 1 addition & 1 deletion examples/mixtral/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
transformers==4.38.2
accelerate==0.25.0
2 changes: 1 addition & 1 deletion examples/mpt/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
evaluate~=0.4.1
rouge_score~=0.1.2
2 changes: 1 addition & 1 deletion examples/nemotron/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
nemo-toolkit[all]==2.0.0rc1
megatron-core==0.8.0
datasets~=2.14.5
Expand Down
2 changes: 1 addition & 1 deletion examples/opt/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
evaluate~=0.4.1
rouge_score~=0.1.2
2 changes: 1 addition & 1 deletion examples/phi/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
evaluate~=0.4.1
rouge_score~=0.1.2
Expand Down
2 changes: 1 addition & 1 deletion examples/prompt_lookup/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
--extra-index-url https://pypi.nvidia.com
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
rouge_score~=0.1.2
sentencepiece~=0.1.99
Expand Down
2 changes: 1 addition & 1 deletion examples/quantization/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets>=2.14.4
nemo-toolkit[all]==2.0.0rc1
rouge_score~=0.1.2
Expand Down
2 changes: 1 addition & 1 deletion examples/qwen/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.16.0
evaluate~=0.4.1
rouge_score~=0.1.2
Expand Down
2 changes: 1 addition & 1 deletion examples/qwenvl/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.16.0
evaluate~=0.4.1
rouge_score~=0.1.2
Expand Down
2 changes: 1 addition & 1 deletion examples/recurrentgemma/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
git+https://github.com/google-deepmind/recurrentgemma.git@8a32e365
flax>=0.8.2
jax~=0.4.23
Expand Down
2 changes: 1 addition & 1 deletion examples/redrafter/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.14.5
rouge_score~=0.1.2
sentencepiece>=0.1.99
Expand Down
2 changes: 1 addition & 1 deletion examples/skywork/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets~=2.16.1
evaluate~=0.4.1
rouge_score~=0.1.2
Expand Down
2 changes: 1 addition & 1 deletion examples/smaug/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
datasets==2.14.6
evaluate~=0.4.1
rouge_score~=0.1.2
Expand Down
2 changes: 1 addition & 1 deletion examples/whisper/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
tensorrt_llm==0.18.1
tensorrt_llm==0.18.2
tiktoken
datasets
kaldialign
Expand Down
Loading