From fa15b257515b7b47b81db59f12cb6e5a7c1af828 Mon Sep 17 00:00:00 2001 From: gha3mi <36126354+gha3mi@users.noreply.github.com> Date: Thu, 14 Dec 2023 18:50:52 +0000 Subject: [PATCH] =?UTF-8?q?Deploying=20to=20gh-pages=20from=20@=20gha3mi/f?= =?UTF-8?q?ormatmul@428ee68a2eac5a766e7270514f2b579a4ff2425c=20?= =?UTF-8?q?=F0=9F=9A=80?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- index.html | 30 +++++------ interface/matmul.html | 2 +- interface/matmul_blas.html | 2 +- interface/matmul_opts.html | 2 +- lists/files.html | 2 +- lists/modules.html | 2 +- lists/procedures.html | 2 +- lists/programs.html | 2 +- module/formatmul.html | 2 +- module/formatmul_benchmark.html | 2 +- module/formatmul_opts.html | 2 +- proc/compute_block_ranges.html | 2 +- proc/gemm_mat_mat_rel.html | 2 +- proc/gemv_mat_vec_rel.html | 2 +- proc/impure_mm_12.html | 2 +- proc/impure_mm_13.html | 2 +- proc/mat_mat.html | 2 +- proc/mat_vec.html | 2 +- proc/matmul_mat_mat_rel_opts.html | 2 +- proc/matmul_mat_vec_rel_opts.html | 2 +- proc/mm_10.html | 2 +- proc/mm_11.html | 2 +- proc/mm_12.html | 2 +- proc/mm_13.html | 2 +- proc/mm_9.html | 2 +- proc/mm_mnp.html | 2 +- proc/mm_mpn.html | 2 +- proc/mm_nmp.html | 2 +- proc/mm_npm.html | 2 +- proc/mm_pmn.html | 2 +- proc/mm_pnm.html | 2 +- proc/mv_3.html | 2 +- proc/mv_4.html | 2 +- proc/mv_5.html | 2 +- proc/mv_6.html | 2 +- proc/start_benchmark.html | 2 +- proc/stop_benchmark.html | 2 +- proc/write_benchmark.html | 2 +- proc/write_benchmark~2.html | 2 +- program/benchmark1.html | 42 +++++++-------- program/benchmark2.html | 38 +++++++------- program/benchmark3.html | 40 +++++++-------- program/benchmark4.html | 68 ++++++++++++------------- program/example1.html | 32 ++++++------ program/example2.html | 32 ++++++------ program/example3.html | 28 +++++----- program/example4.html | 28 +++++----- program/example5.html | 32 ++++++------ program/example6.html | 32 ++++++------ program/example7.html | 28 +++++----- program/example8.html | 28 +++++----- program/mat_mat.html | 32 ++++++------ program/mat_vec.html | 28 +++++----- search.html | 2 +- sourcefile/benchmark1.f90.html | 2 +- sourcefile/benchmark2.f90.html | 2 +- sourcefile/benchmark3.f90.html | 4 +- sourcefile/benchmark4.f90.html | 2 +- sourcefile/example1.f90.html | 2 +- sourcefile/example2.f90.html | 2 +- sourcefile/example3.f90.html | 2 +- sourcefile/example4.f90.html | 2 +- sourcefile/example5.f90.html | 2 +- sourcefile/example6.f90.html | 2 +- sourcefile/example7.f90.html | 2 +- sourcefile/example8.f90.html | 2 +- sourcefile/formatmul.f90.html | 2 +- sourcefile/formatmul_benchmark.f90.html | 2 +- sourcefile/formatmul_opts.f90.html | 2 +- sourcefile/test1.f90.html | 2 +- sourcefile/test2.f90.html | 2 +- src/benchmark3.f90 | 2 +- tipuesearch/tipuesearch_content.js | 2 +- 73 files changed, 316 insertions(+), 320 deletions(-) diff --git a/index.html b/index.html index ab76578..be56259 100644 --- a/index.html +++ b/index.html @@ -76,7 +76,7 @@

ForMatmul

License Build

ForMatmul

-

ForMatmul: A Fortran library that overloads the matmul function to enable efficient matrix multiplication with coarray.

+

ForMatmul: A Fortran library that overloads the matmul function to enable efficient matrix multiplication with/without coarray.

Usage

use formatmul
 
@@ -153,42 +153,38 @@ 

How to run tests and examples

Benchmark:

To set the stack size to unlimited, use the following command: ulimit -s unlimited.

Intel Fortran Compiler (ifort)

-
fpm run --example benchmark3 --compiler ifort --flag "-Ofast -xHost -qopenmp -qmkl -coarray -coarray-num-images=4 -DUSE_COARRAY"
+
fpm run --example benchmark3 --compiler ifort --flag "-Ofast -mtune=native -xHost -qmkl -qopenmp -ipo -coarray -coarray-num-images=4 -DUSE_COARRAY"
 

Intel Fortran Compiler (ifx)

-
fpm run --example benchmark3 --compiler ifx --flag "-Ofast -xHost -qopenmp -qmkl -coarray -coarray-num-images=4 -DUSE_COARRAY"
+
fpm run --example --all --compiler ifx --flag "-Ofast -mtune=native -xHost -qmkl -qopenmp -coarray -coarray-num-images=4 -DUSE_COARRAY"
 

You can then use the provided Python script to generate visual plots for the benchmark3 data:

python benchmark/benchmark3_co.py
 
-

Results obtained on an Intel(R) Core(TM) i9-9980HK CPU @ 2.40GHz using ifort (IFORT) 2021.10.0 20230609 are as follows:

+

Results obtained on an Intel(R) Core(TM) i9-9980HK CPU @ 2.40GHz using ifort (IFORT) 2021.11.0 20231010 are as follows:

  • with -coarray-num-images=4, MKL_NUM_THREADS=1 and OMP_NUM_THREADS=1:
-

ForMatmul

+

ForMatmul +ForMatmul

  • with -coarray-num-images=4 and Multithread:
-

ForMatmul

+

ForMatmul +ForMatmul

  • with -coarray-num-images=5, MKL_NUM_THREADS=1 and OMP_NUM_THREADS=1:
-

ForMatmul

+

ForMatmul +ForMatmul

  • with -coarray-num-images=5 and Multithread:
-

ForMatmul

-
    -
  • with -coarray-num-images=6, MKL_NUM_THREADS=1 and OMP_NUM_THREADS=1:
  • -
-

ForMatmul

-
    -
  • with -coarray-num-images=6 and Multithread:
  • -
-

ForMatmul

+

ForMatmul +ForMatmul

API documentation

The most up-to-date API documentation for the master branch is available here. @@ -262,7 +258,7 @@

Procedures

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/interface/matmul.html b/interface/matmul.html index 0578680..2661a1c 100644 --- a/interface/matmul.html +++ b/interface/matmul.html @@ -951,7 +951,7 @@

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/interface/matmul_blas.html b/interface/matmul_blas.html index 157ccc1..423085a 100644 --- a/interface/matmul_blas.html +++ b/interface/matmul_blas.html @@ -690,7 +690,7 @@

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/interface/matmul_opts.html b/interface/matmul_opts.html index 281cbc6..e18cc8b 100644 --- a/interface/matmul_opts.html +++ b/interface/matmul_opts.html @@ -901,7 +901,7 @@

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/lists/files.html b/lists/files.html index 840132f..7e58883 100644 --- a/lists/files.html +++ b/lists/files.html @@ -426,7 +426,7 @@

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/lists/modules.html b/lists/modules.html index 760bd7d..7a7ae50 100644 --- a/lists/modules.html +++ b/lists/modules.html @@ -683,7 +683,7 @@

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/lists/procedures.html b/lists/procedures.html index ba4560a..625a0f1 100644 --- a/lists/procedures.html +++ b/lists/procedures.html @@ -1079,7 +1079,7 @@

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/lists/programs.html b/lists/programs.html index fa9f731..5229b9d 100644 --- a/lists/programs.html +++ b/lists/programs.html @@ -94,7 +94,7 @@

Programs

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/module/formatmul.html b/module/formatmul.html index f968802..e4858f4 100644 --- a/module/formatmul.html +++ b/module/formatmul.html @@ -1101,7 +1101,7 @@

Arguments

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/module/formatmul_benchmark.html b/module/formatmul_benchmark.html index 63db5d8..1db0783 100644 --- a/module/formatmul_benchmark.html +++ b/module/formatmul_benchmark.html @@ -1029,7 +1029,7 @@

Arguments

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/module/formatmul_opts.html b/module/formatmul_opts.html index 5a297ee..d9f26a8 100644 --- a/module/formatmul_opts.html +++ b/module/formatmul_opts.html @@ -2949,7 +2949,7 @@

Arguments

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/compute_block_ranges.html b/proc/compute_block_ranges.html index 6be49fe..830b1c0 100644 --- a/proc/compute_block_ranges.html +++ b/proc/compute_block_ranges.html @@ -440,7 +440,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/gemm_mat_mat_rel.html b/proc/gemm_mat_mat_rel.html index 199dfcb..561f425 100644 --- a/proc/gemm_mat_mat_rel.html +++ b/proc/gemm_mat_mat_rel.html @@ -551,7 +551,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/gemv_mat_vec_rel.html b/proc/gemv_mat_vec_rel.html index e89bd89..cca9074 100644 --- a/proc/gemv_mat_vec_rel.html +++ b/proc/gemv_mat_vec_rel.html @@ -548,7 +548,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/impure_mm_12.html b/proc/impure_mm_12.html index 752b2a0..d61f146 100644 --- a/proc/impure_mm_12.html +++ b/proc/impure_mm_12.html @@ -403,7 +403,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/impure_mm_13.html b/proc/impure_mm_13.html index 75c3729..3351000 100644 --- a/proc/impure_mm_13.html +++ b/proc/impure_mm_13.html @@ -403,7 +403,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mat_mat.html b/proc/mat_mat.html index 837211f..a908b1d 100644 --- a/proc/mat_mat.html +++ b/proc/mat_mat.html @@ -987,7 +987,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mat_vec.html b/proc/mat_vec.html index a4f4dc1..6d161af 100644 --- a/proc/mat_vec.html +++ b/proc/mat_vec.html @@ -932,7 +932,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/matmul_mat_mat_rel_opts.html b/proc/matmul_mat_mat_rel_opts.html index dd09e68..d928123 100644 --- a/proc/matmul_mat_mat_rel_opts.html +++ b/proc/matmul_mat_mat_rel_opts.html @@ -791,7 +791,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/matmul_mat_vec_rel_opts.html b/proc/matmul_mat_vec_rel_opts.html index 895cddb..1aa25f3 100644 --- a/proc/matmul_mat_vec_rel_opts.html +++ b/proc/matmul_mat_vec_rel_opts.html @@ -654,7 +654,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mm_10.html b/proc/mm_10.html index dd03c98..6ec3a26 100644 --- a/proc/mm_10.html +++ b/proc/mm_10.html @@ -601,7 +601,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mm_11.html b/proc/mm_11.html index 67471e2..f3b9ada 100644 --- a/proc/mm_11.html +++ b/proc/mm_11.html @@ -603,7 +603,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mm_12.html b/proc/mm_12.html index 8225fe9..9569a5f 100644 --- a/proc/mm_12.html +++ b/proc/mm_12.html @@ -488,7 +488,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mm_13.html b/proc/mm_13.html index 42bf1d5..cec8453 100644 --- a/proc/mm_13.html +++ b/proc/mm_13.html @@ -489,7 +489,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mm_9.html b/proc/mm_9.html index bf83ef3..7ab1ce9 100644 --- a/proc/mm_9.html +++ b/proc/mm_9.html @@ -484,7 +484,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mm_mnp.html b/proc/mm_mnp.html index 1785466..4a0c79c 100644 --- a/proc/mm_mnp.html +++ b/proc/mm_mnp.html @@ -486,7 +486,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mm_mpn.html b/proc/mm_mpn.html index 316f9ba..11ded57 100644 --- a/proc/mm_mpn.html +++ b/proc/mm_mpn.html @@ -486,7 +486,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mm_nmp.html b/proc/mm_nmp.html index 7281225..019c4d4 100644 --- a/proc/mm_nmp.html +++ b/proc/mm_nmp.html @@ -486,7 +486,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mm_npm.html b/proc/mm_npm.html index 30b4bd5..39b9f7a 100644 --- a/proc/mm_npm.html +++ b/proc/mm_npm.html @@ -486,7 +486,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mm_pmn.html b/proc/mm_pmn.html index 4c7ddda..fd78e3e 100644 --- a/proc/mm_pmn.html +++ b/proc/mm_pmn.html @@ -486,7 +486,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mm_pnm.html b/proc/mm_pnm.html index f58f3a8..a82085e 100644 --- a/proc/mm_pnm.html +++ b/proc/mm_pnm.html @@ -486,7 +486,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mv_3.html b/proc/mv_3.html index cc325cc..e1fe257 100644 --- a/proc/mv_3.html +++ b/proc/mv_3.html @@ -469,7 +469,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mv_4.html b/proc/mv_4.html index dbcd451..c98bd1a 100644 --- a/proc/mv_4.html +++ b/proc/mv_4.html @@ -469,7 +469,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mv_5.html b/proc/mv_5.html index 6b3997a..db1bbd3 100644 --- a/proc/mv_5.html +++ b/proc/mv_5.html @@ -467,7 +467,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/mv_6.html b/proc/mv_6.html index 63d30e2..47b88e3 100644 --- a/proc/mv_6.html +++ b/proc/mv_6.html @@ -467,7 +467,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/start_benchmark.html b/proc/start_benchmark.html index 9fce726..ac3087f 100644 --- a/proc/start_benchmark.html +++ b/proc/start_benchmark.html @@ -535,7 +535,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/stop_benchmark.html b/proc/stop_benchmark.html index 02c78c8..a1246dc 100644 --- a/proc/stop_benchmark.html +++ b/proc/stop_benchmark.html @@ -647,7 +647,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/write_benchmark.html b/proc/write_benchmark.html index befca1d..60c7785 100644 --- a/proc/write_benchmark.html +++ b/proc/write_benchmark.html @@ -477,7 +477,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/proc/write_benchmark~2.html b/proc/write_benchmark~2.html index 1c1e2fb..1974ab7 100644 --- a/proc/write_benchmark~2.html +++ b/proc/write_benchmark~2.html @@ -356,7 +356,7 @@

Source Code

Documentation generated by FORD - on 2023-12-14T18:07:25.398471

+ on 2023-12-14T18:50:46.650924


diff --git a/program/benchmark1.html b/program/benchmark1.html index e2ff58d..770592c 100644 --- a/program/benchmark1.html +++ b/program/benchmark1.html @@ -109,16 +109,16 @@

Variables

- A - B - C - t - m - n - o - i + A + B + C + t + m + n + o + i l - im + im
@@ -160,10 +160,10 @@

Uses