Add trt decoder #307

wsttiger · 2025-09-29T17:49:58Z

Add TensorRT Decoder Plugin for Quantum Error Correction

Overview

This PR introduces a new TensorRT-based decoder plugin for quantum error correction, leveraging NVIDIA TensorRT for accelerated neural network inference in QEC applications.

Key Features

TensorRT Integration: Full TensorRT runtime integration with support for both ONNX model loading and pre-built engine loading
Flexible Precision Support: Configurable precision modes (fp16, bf16, int8, fp8, tf32, best) with automatic hardware capability detection
Memory Management: Efficient CUDA memory allocation and stream-based execution
Parameter Validation: Comprehensive input validation with clear error messages
Python Utilities: ONNX to TensorRT engine conversion script for model preprocessing

Technical Implementation

Core Decoder Class: trt_decoder implementing the decoder interface with TensorRT backend
Hardware Detection: Automatic GPU capability detection for optimal precision selection
Error Handling: Robust error handling with graceful fallbacks and informative error messages
Plugin Architecture: CMake-based plugin system with conditional TensorRT linking

Files Added/Modified

libs/qec/include/cudaq/qec/trt_decoder_internal.h - Internal API declarations
libs/qec/lib/decoders/plugins/trt_decoder/trt_decoder.cpp - Main decoder implementation
libs/qec/lib/decoders/plugins/trt_decoder/CMakeLists.txt - Plugin build configuration
libs/qec/python/cudaq_qec/plugins/tensorrt_utils/build_engine_from_onnx.py - Python utility
libs/qec/unittests/test_trt_decoder.cpp - Comprehensive unit tests
Updated CMakeLists.txt files for integration

Testing

✅ All 8 unit tests passing
Parameter validation tests
File loading utility tests
Edge case handling tests
Error condition testing

Usage Example

// Load from ONNX model
cudaqx::heterogeneous_map params;
params.insert("onnx_load_path", "model.onnx");
params.insert("precision", "fp16");
auto decoder = std::make_unique<trt_decoder>(H, params);

// Or load pre-built engine
params.clear();
params.insert("engine_load_path", "model.trt");
auto decoder = std::make_unique<trt_decoder>(H, params);

Dependencies

TensorRT 10.13.3.9+
CUDA 12.0+
NVIDIA GPU with appropriate compute capability

Performance Benefits

GPU-accelerated inference for QEC decoding
Optimized precision selection based on hardware capabilities
Efficient memory usage with CUDA streams
Reduced latency compared to CPU-based decoders

This implementation provides a production-ready TensorRT decoder plugin that can significantly accelerate quantum error correction workflows while maintaining compatibility with the existing CUDA-Q QEC framework.

copy-pr-bot · 2025-09-29T17:50:02Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

- Add trt_decoder class implementing TensorRT-accelerated inference - Support both ONNX model loading and pre-built engine loading - Include precision configuration (fp16, bf16, int8, fp8, tf32, best) - Add hardware platform detection for capability-based precision selection - Implement CUDA memory management and stream-based execution - Add Python utility script for ONNX to TensorRT engine conversion - Update CMakeLists.txt to build TensorRT decoder plugin - Add comprehensive parameter validation and error handling

Signed-off-by: Scott Thornton <[email protected]>

libs/qec/unittests/test_trt_decoder.cpp

libs/qec/include/cudaq/qec/trt_decoder_internal.h

Signed-off-by: Scott Thornton <[email protected]>

.github/workflows/all_libs.yaml

libs/qec/python/cudaq_qec/plugins/tensorrt_utils/build_engine_from_onnx.py

libs/qec/lib/decoders/plugins/trt_decoder/trt_decoder.cpp

bmhowe23 · 2025-10-03T23:01:19Z

libs/qec/python/cudaq_qec/plugins/tensorrt_utils/build_engine_from_onnx.py

+import tensorrt as trt
+
+
+def build_engine(onnx_file,


Is this file exposed as part of the wheel such that regular users will be able to use this file?

libs/qec/unittests/CMakeLists.txt

libs/qec/unittests/test_trt_decoder.cpp

Signed-off-by: Scott Thornton <[email protected]>

…trix) Signed-off-by: Scott Thornton <[email protected]>

Signed-off-by: Scott Thornton <[email protected]>

…ecoder model, added to unittest Signed-off-by: Scott Thornton <[email protected]>

Signed-off-by: Scott Thornton <[email protected]>

I, Scott Thornton <[email protected]>, hereby add my Signed-off-by to this commit: 9e97e26 Signed-off-by: Scott Thornton <[email protected]>

wsttiger · 2025-10-16T15:28:20Z

/ok to test fb16b36

copy-pr-bot · 2025-10-16T15:28:24Z

/ok to test fb16b36

@wsttiger, there was an error processing your request: E2

See the following link for more information: https://docs.gha-runners.nvidia.com/cpr/e/2/

wsttiger · 2025-10-16T15:30:06Z

/ok to test c9e563f

wsttiger requested review from bmhowe23 and melody-ren September 29, 2025 17:50

wsttiger force-pushed the add_trt_decoder branch from bd14f16 to c34be87 Compare September 30, 2025 17:33

wsttiger force-pushed the add_trt_decoder branch from c34be87 to 9e97e26 Compare September 30, 2025 18:34

Formatting

79a7e19

Signed-off-by: Scott Thornton <[email protected]>

bmhowe23 reviewed Sep 30, 2025

View reviewed changes

libs/qec/unittests/test_trt_decoder.cpp Outdated Show resolved Hide resolved

melody-ren reviewed Sep 30, 2025

View reviewed changes

libs/qec/include/cudaq/qec/trt_decoder_internal.h Show resolved Hide resolved

wsttiger added 5 commits October 1, 2025 19:02

Removed hardcoded paths to TensorRT installation

d88452f

Signed-off-by: Scott Thornton <[email protected]>

Merge branch 'main' into add_trt_decoder

e7ec736

Signed-off-by: Scott Thornton <[email protected]>

Incorrect URL

7cbbeb1

Signed-off-by: Scott Thornton <[email protected]>

Fixed up the references to cuda in CMake

5287b09

Signed-off-by: Scott Thornton <[email protected]>

Switched to finding cuda toolkit instead of hardcoding cuda headers

88b3cc1

Signed-off-by: Scott Thornton <[email protected]>

bmhowe23 reviewed Oct 3, 2025

View reviewed changes

.github/workflows/all_libs.yaml Show resolved Hide resolved

bmhowe23 reviewed Oct 3, 2025

View reviewed changes

libs/qec/python/cudaq_qec/plugins/tensorrt_utils/build_engine_from_onnx.py Show resolved Hide resolved

bmhowe23 reviewed Oct 3, 2025

View reviewed changes

wsttiger added 13 commits October 6, 2025 19:14

Disabled trt_decoder for ARM

1bfbb3d

Signed-off-by: Scott Thornton <[email protected]>

Redo platform check for x86

4c040dc

Signed-off-by: Scott Thornton <[email protected]>

Added include directory for the Arm64 arch

e01de62

Signed-off-by: Scott Thornton <[email protected]>

Removed cudaqx namespace

ce0f24c

Signed-off-by: Scott Thornton <[email protected]>

Added copyright notice

e641f49

Signed-off-by: Scott Thornton <[email protected]>

Added CUDAQ logging + minor details

f3d7a95

Signed-off-by: Scott Thornton <[email protected]>

Handled CUDA (potential) errors + formatting

6533797

Signed-off-by: Scott Thornton <[email protected]>

Removed block_size from trt_decoder logic (there's no parity check ma…

83e957b

…trix) Signed-off-by: Scott Thornton <[email protected]>

Default initialization + formatting

cdb1754

Signed-off-by: Scott Thornton <[email protected]>

Added LFS (no assets yet), added training for E2E test with test AI d…

b6cfa6f

…ecoder model, added to unittest Signed-off-by: Scott Thornton <[email protected]>

Added test AI model (onnx)

aea8d56

Signed-off-by: Scott Thornton <[email protected]>

Formatting

036b331

Signed-off-by: Scott Thornton <[email protected]>

Added test_trt_decoder.py - for the python path

1deb4f5

Signed-off-by: Scott Thornton <[email protected]>

wsttiger added 6 commits October 15, 2025 02:40

Added trt-decoder optional dependency to cudaq_qec pyproject.toml

5db5c88

Signed-off-by: Scott Thornton <[email protected]>

Added platform detection to test_trt_decoder.py

2fb89c7

Signed-off-by: Scott Thornton <[email protected]>

Modified platform checks

8fa4ef8

Signed-off-by: Scott Thornton <[email protected]>

Formatting

4f133f9

Signed-off-by: Scott Thornton <[email protected]>

Merge branch 'main' into add_trt_decoder

fb16b36

DCO Remediation Commit for Scott Thornton <[email protected]>

c9e563f

I, Scott Thornton <[email protected]>, hereby add my Signed-off-by to this commit: 9e97e26 Signed-off-by: Scott Thornton <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add trt decoder #307

Add trt decoder #307

Uh oh!

wsttiger commented Sep 29, 2025

Uh oh!

copy-pr-bot bot commented Sep 29, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bmhowe23 Oct 3, 2025

Uh oh!

Uh oh!

Uh oh!

wsttiger commented Oct 16, 2025

Uh oh!

copy-pr-bot bot commented Oct 16, 2025

Uh oh!

wsttiger commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add trt decoder #307

Are you sure you want to change the base?

Add trt decoder #307

Uh oh!

Conversation

wsttiger commented Sep 29, 2025

Add TensorRT Decoder Plugin for Quantum Error Correction

Overview

Key Features

Technical Implementation

Files Added/Modified

Testing

Usage Example

Dependencies

Performance Benefits

Uh oh!

copy-pr-bot bot commented Sep 29, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bmhowe23 Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

wsttiger commented Oct 16, 2025

Uh oh!

copy-pr-bot bot commented Oct 16, 2025

Uh oh!

wsttiger commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants