TensorRT Optimization #293

nityanandmathur · 2024-11-06T14:36:23Z

Is there any TensorRT optimised inference available?

78Alpha · 2024-11-10T08:56:28Z

No

UmerrAhsan · 2024-12-12T08:45:55Z

StyleTTS2 consists of several models and components that work together to generate audio. To optimize it using TensorRT, you first need to convert each model separately from PyTorch to ONNX.

Once converted, you can either:

Run the ONNX models using ONNX Runtime with the TensorRT execution provider, or
Convert the ONNX models directly into TensorRT format and perform inference using TensorRT's Python or C++ API.

From my experience with ablation studies, the decoder is the most resource-intensive component in StyleTTS2. If you aim for partial optimization, converting just the decoder from PyTorch to ONNX and running it in TensorRT can provide significant speed improvements. Alternatively, converting all models to ONNX and running them in ONNX Runtime with the TensorRT execution provider will also yield noticeable performance gains. This approach is feasible and I have tested it.

nityanandmathur · 2024-12-12T11:15:27Z

HI @UmerrAhsan. Could you please share the overall latencies of your ONNX model?

UmerrAhsan · 2024-12-12T12:26:10Z

Hi @nityanandmathur. I have ran the decoder model and predictor.text encoder model in tensorrt. It decreases my latency by over 50%. Also I have cached the style vectors from diffusion and style encoder before. After that, a single short sentence runs in under 100ms.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TensorRT Optimization #293

TensorRT Optimization #293

nityanandmathur commented Nov 6, 2024

78Alpha commented Nov 10, 2024

UmerrAhsan commented Dec 12, 2024 •

edited

Loading

nityanandmathur commented Dec 12, 2024

UmerrAhsan commented Dec 12, 2024 •

edited

Loading

TensorRT Optimization #293

TensorRT Optimization #293

Comments

nityanandmathur commented Nov 6, 2024

78Alpha commented Nov 10, 2024

UmerrAhsan commented Dec 12, 2024 • edited Loading

nityanandmathur commented Dec 12, 2024

UmerrAhsan commented Dec 12, 2024 • edited Loading

UmerrAhsan commented Dec 12, 2024 •

edited

Loading

UmerrAhsan commented Dec 12, 2024 •

edited

Loading