NVIDIA Triton
Intro¶
In this meeting, we discover NVIDIA Triton, an open-source tool that allows for the deployment of machine learning models.
With a simple configuration of a Docker image, it's possible to create a machine learning inference service capable of doing everything we might desire: serving models from all major Python frameworks, automatic scaling, dynamic batching, online model updates, pipelines, metrics, and more.