-
Notifications
You must be signed in to change notification settings - Fork 55
Open
Labels
enhancementNew feature or requestNew feature or requestnon-staleThis label can be used to prevent marking issues or PRs as StaleThis label can be used to prevent marking issues or PRs as Stale
Description
Describe the solution you'd like
With the recent TensorRT-LLM support for Whipser, and now that PyTriton supports TensorRT-LLM, would be great to get examples of efficient client and server code, as well as decoupled mode examples.
Describe alternatives you've considered
I've experimented with WhisperS2T coupled with FastAPI and PyTriton, and both perform well. It would be great to get a more involved example, like here and here.
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestnon-staleThis label can be used to prevent marking issues or PRs as StaleThis label can be used to prevent marking issues or PRs as Stale