Check out the latest release of NVIDIA TensorRT Model Optimizer v0.15! This toolkit includes techniques like quantization and sparsity to optimize inference speed for generative AI models. #NVIDIA #TensorRT #AI
https://developer.nvidia.com/blog/nvidia-tensorrt-model-optimizer-v0-15-boosts-inference-performance-and-expands-model-support/