Skip to content

NVIDIA Triton

Intro

In this meeting, we discover NVIDIA Triton, an open-source tool that allows for the deployment of machine learning models.

With a simple configuration of a Docker image, it's possible to create a machine learning inference service capable of doing everything we might desire: serving models from all major Python frameworks, automatic scaling, dynamic batching, online model updates, pipelines, metrics, and more.

Material

Github

Meetup video