The roadmap provides a high level overview of key areas that will likely span multiple releases.
The roadmap provides links to critical user journeys(CUJs) that we want to deliver. A CUJ shows how a user would accomplish some critical task (for example build, train, and deploy a model).
Kubeflow does a major release at the end of every quarter. Minor releases occur as needed to fix important bugs.
For detailed information about what will be in a release look for the issues taged "area/X.Y.Z".
If you are a member of the Kubeflow org you can use these search queries
- Issues for 0.4.0
We are working diligently to get Kubeflow to its first major version release 1.0 and plan to have this ready in early half of 2019. This will be a significant milestone for the project. Here are some critical areas for the release:
-
Stabilized APIs for training (TFJob/PyTorch operators) and serving.
-
Robust support for monitoring and logging.
-
Scale and load testing.
-
Integration with hyperparameter tuning with Katib.
The features in this enterprise readiness theme focus on better integration with existing enterprise infrastructure and support for secure data access. Some of the highlights in the area include:
- Multi User Kubeflow Deployments
- Isolation of environments within a cluster.
- RBAC and IAM integrations.
- Support for multi-tenancy.
- Hybrid/Multi-cluster deployments.
- Support for POSIX filesystems.
- Issues
We have heard from our users and based on the feedback we are continuing to improve the deployment experience of Kubeflow. Here are some areas we are working on:
- A uniform CLI / UI based deployment experience based on a common backend kubeflow/kubeflow#1419.
- Simplified UI Driven deployment.
- Support for upgrading existing Kubeflow deployments.
- Simplified deployment for local Kubeflow using Minikube and Microk8s.
- Deployment related issues
Continue to improve development experience for Data Scientists and ML Practitioners using Kubeflow.
- Notebooks driven interface for developing ML workflows and pipelines.
- CUJ illustrating Build/Train/Deploy from Notebook
- Jupyter related issues
- Fairing project
- Minimize the need for switching contexts out of the notebook / development environment for launching / tracking jobs.
- Provide a seamless experience for local development connected with cloud/on-prem execution environment.
Continue to build and incorporate additional components enabling advanced ML workflows.
- Katib integration to work with TFJob or PyTorch operators for hyperparameter tuning kubeflow/katib#39.
- Make all new and updated TFX components available.
- Feature engineering and feature management support.
- Model management and deployment support.
With a growing community of developers across Kubeflow there is a need to build/support tools and engineering practices that will enable faster development and reliable releases.
- Support for release workflows.
- Scalable testing across platforms: GPU Testing, Different base images, multiple H/W and Cloud platforms.
- Upgrade testing.
- Testing Issues
- Build/Release issues