[core] Moved namespaces and computes to database #3446

amitsrivastava · 2023-08-25T18:52:04Z

The existing implementation was connecting to external Altus and DWX
api's to fetch the available computes. This commit changes that by
moving the cluster config to database. Two new tables have been added
(a) beeswax_namespace to hold config for a namespace/dialect
(b) beeswax_compute to hold configs for individual compute clusters
linked to the namespaces.
This change currently support hive and impala clusters.

There is also a service-discovery component that keeps the list of
namespaces and computes updated in the corresponding tables.
sync_warehouses.py performs the service discovery in the CDW environ
by talking to kubernetes api's. It creates one Hive and one Impala
namespace and one compute for each virtual warehouse. The command is
supposed to run every minute and keep the list of warehouses updated.

There are related changes and fixes to the rest of the code to support
the query execution on different computes.

Change-Id: Ifd8dbc8d716dfe2000fbfa8121e39f2610051fa1

ranade1

looks good so far. if I get some code walkthrough will be helpful.

apps/beeswax/src/beeswax/server/dbms.py

ranade1 · 2023-08-29T20:58:59Z

desktop/core/src/desktop/management/commands/sync_warehouses.py

@@ -0,0 +1,203 @@
+#!/usr/bin/env python


how frequently does the sync warehouse happen? Is this script responsible for deleting VW which has gone away?

wing2fly

Please do downgrade test before merging.

apps/beeswax/src/beeswax/server/dbms.py

apps/jobbrowser/src/jobbrowser/apis/query_api.py

Harshg999

Nice work! Taking shape, few review comments to address.

apps/beeswax/src/beeswax/server/dbms.py

apps/jobbrowser/src/jobbrowser/apis/query_api.py

Harshg999 · 2023-09-05T07:21:32Z

desktop/core/src/desktop/api2.py


  response[interface] = namespaces
  response['status'] = 0

  return JsonResponse(response)

-
 @api_error_handler
 def get_context_computes(request, interface):


Update docstring

desktop/core/src/desktop/lib/computes/models.py

desktop/core/src/desktop/management/commands/sync_warehouses.py

Harshg999 · 2023-09-05T07:30:31Z

desktop/core/src/desktop/models.py

@@ -1800,7 +1800,7 @@ def get_config(self):
      ],
      'default_sql_interpreter': default_sql_interpreter,
      'cluster_type': self.cluster_type,
-      'has_computes': self.cluster_type in ('altus', 'snowball'), # or any grouped engine connectors
+      'has_computes': self.cluster_type in ('cdw', 'altus', 'snowball'), # or any grouped engine connectors


Let's remove altus and snowball? They are not used anywhere right?
@JohanAhlen Any idea?

I am thinking of creating a separate commit to remove the altus/snowball related code.

desktop/libs/notebook/src/notebook/connectors/base.py

amitsrivastava · 2023-09-14T19:41:48Z

Please do downgrade test before merging.

Done the downgrade test. Rollback worked fine.

ranade1

Looks good

The existing implementation was connecting to external Altus and DWX api's to fetch the available computes. This commit changes that by moving the cluster config to database. Two new tables have been added (a) `beeswax_namespace` to hold config for a namespace/dialect (b) `beeswax_compute` to hold configs for individual compute clusters linked to the namespaces. This change currently support hive and impala clusters. There is also a service-discovery component that keeps the list of namespaces and computes updated in the corresponding tables. `sync_warehouses.py` performs the service discovery in the CDW environ by talking to kubernetes api's. It creates one Hive and one Impala namespace and one compute for each virtual warehouse. The command is supposed to run every minute and keep the list of warehouses updated. There are related changes and fixes to the rest of the code to support the query execution on different computes. Change-Id: Ifd8dbc8d716dfe2000fbfa8121e39f2610051fa1

amitsrivastava requested review from wing2fly, JohanAhlen, ranade1, athithyaaselvam and Harshg999 August 25, 2023 18:52

ranade1 reviewed Aug 29, 2023

View reviewed changes

wing2fly approved these changes Aug 31, 2023

View reviewed changes

wing2fly reviewed Aug 31, 2023

View reviewed changes

apps/beeswax/src/beeswax/server/dbms.py Outdated Show resolved Hide resolved

apps/jobbrowser/src/jobbrowser/apis/query_api.py Outdated Show resolved Hide resolved

Harshg999 reviewed Sep 5, 2023

View reviewed changes

amitsrivastava force-pushed the dev/amit/compute-with-namespace-master-aug-25 branch 2 times, most recently from f81856f to 074dae5 Compare September 14, 2023 19:34

amitsrivastava force-pushed the dev/amit/compute-with-namespace-master-aug-25 branch 5 times, most recently from 7a1c082 to 3ea8782 Compare September 15, 2023 20:32

ranade1 approved these changes Sep 18, 2023

View reviewed changes

amitsrivastava force-pushed the dev/amit/compute-with-namespace-master-aug-25 branch from 3ea8782 to 7b326ed Compare September 18, 2023 22:13

amitsrivastava enabled auto-merge (rebase) September 18, 2023 22:14

amitsrivastava force-pushed the dev/amit/compute-with-namespace-master-aug-25 branch 3 times, most recently from 3c9a591 to acaeb5f Compare September 19, 2023 05:23

amitsrivastava force-pushed the dev/amit/compute-with-namespace-master-aug-25 branch from acaeb5f to 3594a73 Compare September 19, 2023 05:50

amitsrivastava merged commit 283676c into master Sep 19, 2023
3 checks passed

amitsrivastava deleted the dev/amit/compute-with-namespace-master-aug-25 branch September 19, 2023 06:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[core] Moved namespaces and computes to database #3446

[core] Moved namespaces and computes to database #3446

amitsrivastava commented Aug 25, 2023 •

edited

Loading

ranade1 left a comment

ranade1 Aug 29, 2023

wing2fly left a comment

Harshg999 left a comment

Harshg999 Sep 5, 2023

Harshg999 Sep 5, 2023

amitsrivastava Sep 14, 2023

amitsrivastava commented Sep 14, 2023

ranade1 left a comment

[core] Moved namespaces and computes to database #3446

[core] Moved namespaces and computes to database #3446

Conversation

amitsrivastava commented Aug 25, 2023 • edited Loading

ranade1 left a comment

Choose a reason for hiding this comment

ranade1 Aug 29, 2023

Choose a reason for hiding this comment

wing2fly left a comment

Choose a reason for hiding this comment

Harshg999 left a comment

Choose a reason for hiding this comment

Harshg999 Sep 5, 2023

Choose a reason for hiding this comment

Harshg999 Sep 5, 2023

Choose a reason for hiding this comment

amitsrivastava Sep 14, 2023

Choose a reason for hiding this comment

amitsrivastava commented Sep 14, 2023

ranade1 left a comment

Choose a reason for hiding this comment

amitsrivastava commented Aug 25, 2023 •

edited

Loading