← Back to the schedule

MLflow: Infrastructure for a Complete Machine Learning Life

Calendar icon

Thursday 15th

Time icon

11:30 | 12:10

Location icon

Theatre 19


Keywords defining the session:

- Machine Learning

- Apache Spark

Takeaway points of the session:

- MLflow concepts and abstractions for models, experiments, and project

- Understand aspects of MLflow APIs


ML development brings many new complexities beyond the traditional software development lifecycle. Unlike in traditional software development, ML developers want to try multiple algorithms, tools and parameters to get the best results, and they need to track this information to reproduce work. In addition, developers need to use many distinct systems to productionize models. To address these problems, many companies are building custom “ML platforms” that automate this lifecycle, but even these platforms are limited to a few supported algorithms and to each company’s internal infrastructure.

In this talk, we will present MLflow, a new open source project from Databricks that aims to design an open ML platform where organizations can use any ML library and development tool of their choice to reliably build and share ML applications. MLflow introduces simple abstractions to package reproducible projects, track results, and encapsulate models that can be used with many existing tools, accelerating the ML lifecycle for organizations of any size.