10.8.0 GA
For more information, see the TensorRT 10.8.0 release notes.
Key Features and Updates:
- Demo changes
- demoDiffusion
- Added Image-to-Image support for Flux-1.dev and Flux.1-schnell pipelines.
- Added ControlNet support for FLUX.1-Canny-dev and FLUX.1-Depth-dev pipelines. Native FP8 quantization is also supported for these pipelines.
- Added support for ONNX model export only mode. See --onnx-export-only.
- Added FP16, BF16, FP8, and FP4 support for all Flux Pipelines.
- demoDiffusion
- Plugin changes
- Added SM 100 and SM 120 support to bertQKVToContextPlugin. This enables demo/BERT on Blackwell GPUs.
- Sample changes
- Added a new
sampleEditableTimingCache
to demonstrate how to build an engine with the desired tactics by modifying the timing cache. - Deleted the
sampleAlgorithmSelector
sample. - Fixed
sampleOnnxMNIST
by updating the correct INT8 dynamic range.
- Added a new
- Parser changes
- Added support for
FLOAT4E2M1
types for quantized networks. - Added support for dynamic axes and improved performance of
CumSum
operations. - Fixed the import of local functions when their input tensor names aliased one from an outside scope.
- Added support for
Pow
ops with integer-typed exponent values.
- Added support for
- Fixed issues