Add parsec_advise_data_on_device for zpotrf_L #118

QingleiCao · 2024-06-07T13:39:06Z

These are the performance comparisons.

I changed to Lower in Cholesky testing and will create an issue to track why Upper behaves differently.

src/zpotrf_L.jdf

src/zpotrf_wrapper.c

abouteiller · 2024-06-14T14:14:07Z

tests/testing_zpotrf.c

@@ -18,7 +18,8 @@ int main(int argc, char ** argv)
 {
    parsec_context_t* parsec;
    int iparam[IPARAM_SIZEOF];
-    dplasma_enum_t uplo = dplasmaUpper;


Don't forget to undo that at the end when the U is also done

Why do you map the A matrix on lower ? You don't need to alter this variable, instead just call the function with dplasmaLower and add a comment on why you choose to do so.

QingleiCao · 2024-06-14T18:58:22Z

I uploaded the performance results. BTW, I updated to the latest PaRSEC.

bosilca · 2024-06-18T15:27:30Z

I understand the performance benefits but I have many concerns about the approach. We (or at least I) claimed for a long time that JDF algorithms were expressing the dataflow and were independent of other platform related constraints. This PR is clearly at odds with this statement, and I'm not sure we want to go that route.

QingleiCao · 2024-06-21T15:42:58Z

As discussed, these changes will be moved to the wrapper to keep the JDF clean.

QingleiCao · 2024-09-05T14:14:14Z

How about adding this feature in parsec instead of in dplasma?

bosilca · 2024-09-05T18:59:28Z

Can you give us a hint on how that would look in parsec ?

QingleiCao · 2024-09-06T12:03:36Z

Can you give us a hint on how that would look in parsec ?

I'm thinking of providing this feature in PaRSEC like apply with several predefined functions (like operation in apply). Then, users can call that in the wrapper or other places, as this advice_device needs to be called only once (if my understanding is correct) and no change is needed in JDF

bosilca

Other programming paradigms call this a mapper. It would be nice if we reuse a concept that is familiar to [at least some] users.

bosilca · 2024-09-30T14:49:50Z

src/dplasmaaux.c

+#if defined(DPLASMA_HAVE_CUDA) || defined(DPLASMA_HAVE_HIP)
+
+/* Find all devices */
+void dplasma_find_nb_devices(int **dev_index, int *nb) {


This function has a generic name suggesting it gets all devices when internally it only selects the accelerators. Please change the name, and please add also support for ZE.

bosilca · 2024-09-30T14:50:26Z

src/dplasmaaux.c

+    if((*nb) == 0) {
+        char hostname[256];
+        gethostname(hostname, 256);
+        fprintf(stderr, "No CUDA device found on rank %d on %s\n",


We have proper methods in parsec to output warnings. Please use parsec_warning instead.

In general I don't think it is a good idea to return low level memory upstream, the ownership is unclear. As here we are talking about a very small amount of memory, I would suggest requiring the caller to provide an array of the size parsec_nb_devices into this call, and this function then does a single loop to fill this array up. All memory ownership is then on the caller size.

bosilca · 2024-09-30T14:53:06Z

src/dplasmaaux.c

+		dplasma_find_nb_devices(&args->gpu_device_index, &args->nb_gpu_devices);
+
+		/* Calculate the nested grid for the multiple GPUs on one process
+         * gpu_rows >= gpu_cols and as square as possible */


the indentation is screwed. Please fix.

bosilca · 2024-09-30T14:58:39Z

tests/testing_zpotrf.c

@@ -18,7 +18,8 @@ int main(int argc, char ** argv)
 {
    parsec_context_t* parsec;
    int iparam[IPARAM_SIZEOF];
-    dplasma_enum_t uplo = dplasmaUpper;


Why do you map the A matrix on lower ? You don't need to alter this variable, instead just call the function with dplasmaLower and add a comment on why you choose to do so.

QingleiCao requested a review from a team as a code owner June 7, 2024 13:39

bosilca approved these changes Jun 7, 2024

View reviewed changes

abouteiller reviewed Jun 14, 2024

View reviewed changes

QingleiCao force-pushed the qinglei/potrf_gpu_advice branch from eeb82ec to 811e173 Compare June 14, 2024 14:45

abouteiller mentioned this pull request Jun 14, 2024

Provide API to map real device ID to parsec ID and back ICLDisco/parsec#660

Open

QingleiCao force-pushed the qinglei/potrf_gpu_advice branch 2 times, most recently from a4b81a9 to f676e32 Compare June 14, 2024 18:53

QingleiCao force-pushed the qinglei/potrf_gpu_advice branch from f676e32 to a597442 Compare June 14, 2024 19:00

QingleiCao force-pushed the qinglei/potrf_gpu_advice branch 2 times, most recently from c67b4b5 to 6743a5a Compare September 5, 2024 02:03

QingleiCao force-pushed the qinglei/potrf_gpu_advice branch 2 times, most recently from 9bfc9fc to 78f8740 Compare September 27, 2024 12:12

Update PaRSEC

d0c2327

QingleiCao force-pushed the qinglei/potrf_gpu_advice branch 3 times, most recently from e0a77ea to 252d978 Compare September 27, 2024 17:22

Add advice device support in dplasma

726d4ab

QingleiCao force-pushed the qinglei/potrf_gpu_advice branch from 252d978 to 726d4ab Compare September 27, 2024 17:24

QingleiCao requested review from bosilca, abouteiller and therault September 27, 2024 17:28

bosilca reviewed Sep 30, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add parsec_advise_data_on_device for zpotrf_L #118

Add parsec_advise_data_on_device for zpotrf_L #118

QingleiCao commented Jun 7, 2024 •

edited

Loading

abouteiller Jun 14, 2024

QingleiCao Jun 14, 2024

bosilca Sep 30, 2024

QingleiCao commented Jun 14, 2024

bosilca commented Jun 18, 2024

QingleiCao commented Jun 21, 2024

QingleiCao commented Sep 5, 2024

bosilca commented Sep 5, 2024

QingleiCao commented Sep 6, 2024

bosilca left a comment

bosilca Sep 30, 2024

bosilca Sep 30, 2024

bosilca Sep 30, 2024

bosilca Sep 30, 2024

bosilca Sep 30, 2024

Add parsec_advise_data_on_device for zpotrf_L #118

Are you sure you want to change the base?

Add parsec_advise_data_on_device for zpotrf_L #118

Conversation

QingleiCao commented Jun 7, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

QingleiCao commented Jun 14, 2024

bosilca commented Jun 18, 2024

QingleiCao commented Jun 21, 2024

QingleiCao commented Sep 5, 2024

bosilca commented Sep 5, 2024

QingleiCao commented Sep 6, 2024

bosilca left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

QingleiCao commented Jun 7, 2024 •

edited

Loading