Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dask Deployment #7

Open
9 tasks
Jan-Willem opened this issue Dec 3, 2024 · 0 comments
Open
9 tasks

Dask Deployment #7

Jan-Willem opened this issue Dec 3, 2024 · 0 comments

Comments

@Jan-Willem
Copy link
Member

Objective

Deploy Dask on a Kubernetes cluster using the official Dask Helm chart, if available, or an alternative deployment method.


Requirements

  • Deploy Dask components as Kubernetes pods, including:
    • Dask Scheduler
    • Dask Workers
    • Dask Gateway (if applicable)
  • Ensure the Dask dashboard is accessible.
  • Verify that Dask can access Lustre storage.
  • Validate functionality by running a test computation task with example workflows.

Definition of Done

Deployment Specification

  • Deploy Dask using the official Helm chart (if available) or another recommended deployment method.

Infrastructure Setup

  • Dask Scheduler and Workers are running as Kubernetes pods.
  • Dask Gateway (if applicable) is deployed and operational.

Functional Validation

  • The Dask dashboard is accessible and functional.
  • A test computation task executes successfully using example workflows.

Storage Integration

  • Dask has verified access to Lustre storage.

Key Decision Points

  1. Helm Chart or Deployment Method:
    • Confirm the availability of an official Dask Helm chart and assess its suitability.
    • If unavailable, choose an alternative deployment method.
  2. Lustre Integration:
    • Evaluate methods for connecting Dask with Lustre storage.
  3. Scaling and Resource Management:
    • Define Kubernetes resource limits and scaling policies for Dask Scheduler and Workers.
  4. Networking Configuration:
    • Ensure efficient communication between the Dask Scheduler and Workers.

Artifacts

  • Dask Helm chart configuration files or equivalent deployment artifacts.
  • Documentation for deployment and integration with Kubernetes and Lustre.
  • Example computation workflows for validation.

Success Criteria

  • Dask is fully deployed and operational on the Kubernetes cluster.
  • The Dask dashboard is accessible to users.
  • Example workflows execute successfully without errors.
  • Dask can reliably access and utilize Lustre storage.

Potential Challenges

  • Network or configuration issues preventing Dask dashboard accessibility.
  • Resource constraints impacting the stability of Dask Scheduler or Workers.
  • Inefficiencies in communication between Scheduler and Workers on Kubernetes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant