10.8.0 GA

For more information, see the TensorRT 10.8.0 release notes.

Key Features and Updates:

Demo changes
- demoDiffusion
  - Added Image-to-Image support for Flux-1.dev and Flux.1-schnell pipelines.
  - Added ControlNet support for FLUX.1-Canny-dev and FLUX.1-Depth-dev pipelines. Native FP8 quantization is also supported for these pipelines.
  - Added support for ONNX model export only mode. See --onnx-export-only.
  - Added FP16, BF16, FP8, and FP4 support for all Flux Pipelines.
Plugin changes
- Added SM 100 and SM 120 support to bertQKVToContextPlugin. This enables demo/BERT on Blackwell GPUs.
Sample changes
- Added a new sampleEditableTimingCache to demonstrate how to build an engine with the desired tactics by modifying the timing cache.
- Deleted the sampleAlgorithmSelector sample.
- Fixed sampleOnnxMNIST by updating the correct INT8 dynamic range.
Parser changes
- Added support for FLOAT4E2M1 types for quantized networks.
- Added support for dynamic axes and improved performance of CumSum operations.
- Fixed the import of local functions when their input tensor names aliased one from an outside scope.
- Added support for Pow ops with integer-typed exponent values.
Fixed issues
- Fixed segmentation of boolean constant nodes - 4224.
- Fixed accuracy issue when multiple optimization profiles were defined 4250.

Provide feedback