MLflow
MLflow is an open-source platform that helps you manage the end-to-end machine learning lifecycle, from tracking experiments to registering and deploying models efficiently within Studio.
What is MLflow?
MLflow provides four core functionalities:
Experiment Tracking: Log and compare parameters, metrics, and outputs across multiple model training runs
Project Packaging: Package data science code into reproducible formats for consistent execution
Model Registry: Store, version, and manage ML models with stage transitions (e.g. Staging, Production)
Model Deployment: Deploy models seamlessly for batch or real-time inference
MLflow in Studio
Studio integrates MLflow directly within your development environment, enabling you to:
- Track experiments systematically for each notebook or script run
- Save and manage models centrally in the shared Jovyan volume
- View, compare, and analyse experiments within the MLflow UI
- Register models for team usage and promote them to production confidently
Example: The MLflow launch interface within Studio, accessible from your apps list.
Key Features
1 Experiment Tracking
Log all training details such as:
- Parameters: Hyperparameters or configuration values used for each run
- Metrics: Model performance results like accuracy or loss
- Artifacts: Outputs such as model files, plots, or evaluation reports
- Models: Save trained models with metadata for reuse or deployment
2 Model Registry
Manage your models with:
- Version Control: Track multiple versions of the same model seamlessly
- Stage Transitions: Move models through stages like Staging, Production, or Archived
- Team Collaboration: Allow team members to view, promote, or roll back models as needed
Example: The Model Registry interface showing models, versions, and stage statuses.
3 Integrated Deployment
Registered models can be used for:
- Batch Inference: Run predictions in notebooks or pipelines
- App Integration: Embed models in Streamlit or Dash apps for interactive use cases
- Automated Pipelines: Combine with Airflow for scheduled retraining and deployment workflows
Security and Environment
MLflow runs within your activated Python environment in Studio, ensuring compatibility
Models and artifacts are securely stored in the shared Jovyan volume
Access is controlled based on your workspace permissions and roles
Summary
MLflow enables you to:
- Track and compare your machine learning experiments
- Manage models efficiently with versioning and stage transitions
- Deploy models seamlessly into pipelines, notebooks, or apps
- Collaborate with your team to promote the best performing models to production
- Maintain full reproducibility and visibility for every stage of your ML workflow
Explore the next section on Connections and Secrets to integrate your ML workflows securely with external databases and deployment environments.