Skip to content

Latest commit

 

History

History
88 lines (60 loc) · 4.74 KB

ROADMAP.md

File metadata and controls

88 lines (60 loc) · 4.74 KB

Kubeflow 2019 roadmap

The roadmap provides a high level overview of key areas that will likely span multiple releases.

The roadmap provides links to critical user journeys(CUJs) that we want to deliver. A CUJ shows how a user would accomplish some critical task (for example build, train, and deploy a model).

Kubeflow does a major release at the end of every quarter. Minor releases occur as needed to fix important bugs.

For detailed information about what will be in a release look for the issues taged "area/X.Y.Z".

If you are a member of the Kubeflow org you can use these search queries

Kubeflow 1.0

We are working diligently to get Kubeflow to its first major version release 1.0 and plan to have this ready in early half of 2019. This will be a significant milestone for the project. Here are some critical areas for the release:

  • Stabilized APIs for training (TFJob/PyTorch operators) and serving.

  • Robust support for monitoring and logging.

  • Scale and load testing.

  • Integration with hyperparameter tuning with Katib.

Enterprise Readiness

The features in this enterprise readiness theme focus on better integration with existing enterprise infrastructure and support for secure data access. Some of the highlights in the area include:

  • Multi User Kubeflow Deployments
  • Isolation of environments within a cluster.
  • RBAC and IAM integrations.
  • Support for multi-tenancy.
  • Hybrid/Multi-cluster deployments.
  • Support for POSIX filesystems.
  • Issues

Deployment Experience

We have heard from our users and based on the feedback we are continuing to improve the deployment experience of Kubeflow. Here are some areas we are working on:

Development Experience

Continue to improve development experience for Data Scientists and ML Practitioners using Kubeflow.

  • Notebooks driven interface for developing ML workflows and pipelines.
  • Minimize the need for switching contexts out of the notebook / development environment for launching / tracking jobs.
  • Provide a seamless experience for local development connected with cloud/on-prem execution environment.

Advanced ML Platform

Continue to build and incorporate additional components enabling advanced ML workflows.

  • Katib integration to work with TFJob or PyTorch operators for hyperparameter tuning kubeflow/katib#39.
  • Make all new and updated TFX components available.
  • Feature engineering and feature management support.
  • Model management and deployment support.

Test Release Infrastructure

With a growing community of developers across Kubeflow there is a need to build/support tools and engineering practices that will enable faster development and reliable releases.

  • Support for release workflows.
  • Scalable testing across platforms: GPU Testing, Different base images, multiple H/W and Cloud platforms.
  • Upgrade testing.
  • Testing Issues
  • Build/Release issues