Skip to content

Releases: triton-inference-server/model_analyzer

Release 1.37.0 corresponding to NGC container 24.02

01 Mar 01:12
5366d6f
Compare
Choose a tag to compare
v1.37.0

Update README.md for 24.02 (#830)

Release 1.36.0 corresponding to NGC container 24.01

30 Jan 01:17
4e45d78
Compare
Choose a tag to compare

New Features and Improvements

  • Model Analyzer now correctly loads and optimizes ensemble models
  • Model Analyzer now correctly works with SSL via gRPC
  • Model Analyzer now handles the case of optimizing a model on a remote Triton server without requiring a local GPU

Release 1.35.0 corresponding to NGC container 23.12

20 Dec 01:03
697390a
Compare
Choose a tag to compare

Known issues

  • Model Analyzer is not able to analyze and optimize ensemble model configs due to a bug in the way composing models are loaded.
  • Model Analyzer does not work with SSL via gRPC

Release 1.34.0 corresponding to NGC container 23.11

01 Dec 23:31
c24efcc
Compare
Choose a tag to compare
v1.34.0

Update README and verssion for 1.34.0 / 23.11 (#788)

Release 1.33.0 corresponding to NGC container 23.10

27 Oct 01:24
47ee88d
Compare
Choose a tag to compare
v1.33.0

 Update README and versions for 23.10 branch (#772)

Release 1.32.0 corresponding to NGC container 23.09

27 Sep 21:36
83aa535
Compare
Choose a tag to compare
  • Remote mode now has the same capabilities as other modes

  • Supports profiling in both brute and quick search modes

Release 1.31.0 corresponding to NGC container 23.08

30 Aug 17:33
4b45d2d
Compare
Choose a tag to compare
  • Added Quick Start guides for Ensemble and BLS models

Release 1.30.0 corresponding to NGC container 23.07

31 Jul 15:53
6b985b9
Compare
Choose a tag to compare
  • Implemented periodic checkpointing
  • Added support for custom docker args
  • Detect and handle invalid metrics url
  • Profile will now automatically create the default detailed reports

Release 1.29.0 corresponding to NGC container 23.06

30 Jun 01:35
fe04987
Compare
Choose a tag to compare
  • request-rate-range can now be searched in brute mode
  • Capture PA errors in a log file
  • Added detection for Triton Server launch failures
  • Added cpu_only option for ensemble composing models
  • Added binary concurrency search to quick search mode
  • Added binary parameter search to brute search mode

Release 1.28.0 corresponding to NGC container 23.05

25 Jul 22:56
2ec9299
Compare
Choose a tag to compare