-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathcpe-22.08-sles15-sp3-ReleaseNotes.txt
1830 lines (1362 loc) · 64.4 KB
/
cpe-22.08-sles15-sp3-ReleaseNotes.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
HPE Cray Programming Environment 22.08 Release Notes
====================================================
Abnormal Termination Processing (ATP) 3.14.13:
================================================
Release Date:
--------------
July 2022
Product Description:
--------------------
Abnormal Termination Processing (ATP) is a system that
monitors user applications, and should an application
take a system trap, ATP preforms analysis on the dying
application. All of the stack backtraces of the
application processes are gathered into a merged
stack backtrace tree and written to disk as the file
"atpMergedBT.dot". The stack backtrace for the first process
to die is sent to stderr as is the number of the signal that
caused the death. If the core file size limit (RLIMIT_CORE)
is non-zero, a heuristically selected set of processes dump
their core.
atpMergedBT.dot can be viewed with 'statview', a component
of the STAT package (module load stat). The merged stack
backtrace tree provides a concise, yet comprehensive, view
of what the application was doing at the time of its death.
Changelog:
---------
=======================================================================
## [3.14.13] - 2022-07-15 (22.08)
=======================================================================
### Bug Fixes
* PE-41158: Add additional checks to signal handler / backend communication
### Features
* PE-41728: Implement node disk space and custom runtime checks
* Frontend will disable corefiles if failed to ship gcore
* Add Flux launch plugin and build script
* PE-40384: Set ZMQNet as default network, switch to MRNet with ATP_PROTO
* PE-41330: Use llvm-addr2line to resolve missing line numbers
* Added script to reload spank plugin
* Added support for rhel86 - PE-41140 #259
=======================================================================
## [3.14.12] - 2022-05-06 (22.06)
=======================================================================
### Features
* Remove minidumps generated by user trace API after successful parse
* Added support for sles15sp4 x86 - PE-409893 #256
* Added autogen changelog/release notes functionality - PE-40699 #253
=======================================================================
## [3.14.11] - 2022-04-14 (22.05)
=======================================================================
### Features
* Update release notes for PE 22.05
* Added support for rhel85 - PE-40102
=======================================================================
## [3.14.10] - 2022-03-21 (22.04)
=======================================================================
* Fix double free in HPCM PALS startup plugin
=======================================================================
## [3.14.9] - 2022-01-21 (22.02)
=======================================================================
* Regarding gen compilers cray-gcc-10.3.0
NOTE: If you encounter the following error:
ImportError: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.26' not found
Temporary workaround:
1) export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/opt/gcc/10.3.0/snos/lib64
or
2) module load gcc/10.3.0
=======================================================================
## [3.14.8] - 2021-12-01 (21.12)
=======================================================================
* Updated gen compilers to cray-gcc-10.3.0
=======================================================================
## [3.14.7] - 2021-10-12 (21.11)
=======================================================================
* Fixes a potential hang or assertion failure when
analyzing a job using the Nvidia runtime with CUDA-GDB
=======================================================================
## [3.14.6] - 2021-09-14 (21.10)
=======================================================================
* Initial support for GPU kernel debugging on Nvidia GPUs using CUDA-GDB.
Product and OS Dependencies:
-----------------------------
The ATP 3.14.13 release is supported on the following Cray systems:
- HPE/Cray XC systems with CLE release 7.0 or later
- HPE/Cray CS systems with CentOS/RH 8.4 or later
- HPE/Cray Shasta systems with SLES 15 service pack 2 or later
- HPE Apollo systems with Slurm or PBS Pro, and RH 8.4 or later
- Baymax systems with RH 8.4 or later
Product Dependencies:
- Cray cray-cti rpm installed
- Cray cray-cdst-support rpm installed
Documentation:
---------------
Type `man atp` with the atp module loaded to read the atp(1) man page.
Installation instructions:
--------------------------
Installation instructions for .sles15sp3.x86_64:
=============================================
rpm -ivh atp-3.14.13-20220715201533_9394be8a-2.sles15sp3.x86_64.rpm
To make ATP 3.14.13 the default version of ATP, execute:
/opt/cray/pe/admin-pe/set_default_files/set_default_atp_3.14.13
Certain components, files or programs contained within this package or product are
Copyright 2010-2022 Hewlett Packard Enterprise Development LP.
CCDB 4.12.13:
==================
Release Date:
--------------
July 2022
Product Description:
--------------------
CCDB, Cray's next generation debugging tool extends the comparative
debugging capabilities of gdb4hpc with a Graphical User Interface (GUI)
enabling programmers to compare corresponding data structures
between two executing applications. Comparative debugging assists users with
locating sections of code containing data deviations introduced by algorithm
changes, compiler differences, and porting to new architectures/libraries.
Some features of ccdb include:
* Side-by-side debugging session execution of two parallel applications
* Automatic creation of comparison statements for all local variables in
scope
* Type templates for structured data types to selectively compare members
* Warning/error epsilon tolerance values for floating point comparison
* PBS PRO, MOAB/TORQUE, and SLURM workload manager support
Changelog:
---------
=======================================================================
## [4.12.12] - 2022-05-19 (22.06)
=======================================================================
### Features
* Added support for rhel86 - PE-41140 #139
* Added support for sles15sp4 x86 - PE-40983 #137
* Added autogen changelog/release notes functionality - PE-39302
=======================================================================
## [4.12.11] - 2022-04-14 (22.05)
=======================================================================
### Features
* Added support for rhel85 - PE-40102
=======================================================================
## [4.12.10] - 2022-03-21 (22.04)
=======================================================================
### Bug Fixes
* Set sourc-on-step off at launch
* Indented continuations
* Fixed comment
* Moved conditional
* Removed atoll
* Segfault caused by missing argument
* Fixed indentation
* Updated to speed up install testing
* Added req exclude of libstdc++
* Fixed comment
* Fixed border width
* Added borders around buttons and entries
### Features
* Ccdb keyboard updates
* Added dynamic copyright date
=======================================================================
## [4.12.9] - 2022-02-10 (22.03)
=======================================================================
* Fixed ccdb segfault on startup
* Fixed epsilon values not changing correctly in the Assert and Compare windows
* Made borders of buttons and entries black for more contrast
* Added decomposition dropdown menus for vectors in Compare window
=======================================================================
## [4.12.8] - 2022-01-20 (22.02)
=======================================================================
* Added missing dependancies to runBuildPrep.sh
* Updated popup menu so "Run Both Apps" is available only if the sources are the same or linked
* Updated ccdb to reference files by full paths
* Fixed issue with setting and clearing breakpoints
* Added visual cues to convey color changes where needed
* Added keyboard access to popup menu
* Regarding gen compilers cray-gcc-10.3.0
If you encounter the following error:
ImportError: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.26' not found
Temporary workaround:
1) export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/opt/gcc/10.3.0/snos/lib64
or
2) module load gcc/10.3.0
=======================================================================
## [4.12.7] - 2021-12-01 (21.12)
=======================================================================
* PE-38153 CCDB not working properly from a VNC.
* Fix focus eyeball when running from an alternate directory.
* Updated build compilers to cray-gcc-10.3.0
=======================================================================
## [4.12.6] - 2021-09-17 (21.11)
=======================================================================
* Added CRAY_CCDB_PREFIX and CRAY_CCDB_VERSION environment variables to modulefiles.
Product and OS Dependencies:
-----------------------------
The CCDB is supported on the following Cray systems.
- HPE/Cray XC systems with CLE 7.0 or later
- HPE/Cray CS systems with RH 8.4 or later
- HPE/Cray Shasta systems with SLES 15 service pack 2 or later
- HPE Apollo systems with Slurm and RH 8.4 or later
- Baymax systems with RH 8.4 or later
Product Dependencies:
- gdb4hpc/4.14.x or later
- Cray cray-cti rpm installed
- Cray cray-cdst-support rpm installed
Documentation:
---------------
Type `man ccdb` with the cray-ccdb module loaded to read the
ccdb(1) man page.
Documentation is available through CCDB from the 'Help'
menu.
Installation instructions:
--------------------------
Installation instructions for .sles15sp3.x86_64:
=============================================
rpm -ivh --oldpackage cray-ccdb-4.12.13-20220715191607_8acd349-1.sles15sp3.x86_64.rpm
To make cray-ccdb 4.12.13 the default version of cray-ccdb, execute:
/opt/cray/pe/admin-pe/set_default_files/set_default_cray-ccdb_4.12.13
Certain components, files or programs contained within this package or product are
Copyright 2007-2022 Hewlett Packard Enterprise Development LP.
HPE Cray Compiling Environment (CCE) 14.0.2
==============================================
Release Date:
-------------
August 2022
Purpose:
--------
CCE 14.0.2 provides Fortran, C, and C++ compilers for Cray XC and CS systems,
HPE Cray supercomputer systems, HPE Apollo 2000 Gen10Plus systems, and HPE Apollo 80 systems.
Key Changes and Support with CCE 14.0.2:
-------------------------------------------
CCE 14.0.2:
- Bug Fixes
- AMD ROCm 5.2 support
- Support for generating native GPU floating-point atomic add instructions for "omp atomic"
constructs
- Support for the "-m[no-]unsafe-fp-atomics" compiler flag (C, C++, and Fortran) for
controlling native floating-point atomic generation on AMD MI250X GPUs
------------
CCE 14.0.1:
- Bug Fixes
------------
CCE 14.0.0:
- LLVM 14 base
- C++17 support
- Note: libstdc++ (and associated C++ library headers) updated from GCC 8.1.0
to 10.3.0. Recommend C++ objects be recompiled to avoid ABI incompatibilities.
- SVE support (beta)
- AMD ROCm 5.0 and 5.1 support
- Tuned Fortran AMDGPU loop unrolling heuristic
- Optimized OpenMP "device scope" atomics/fences
- Complete support for AddressSanitizer, ThreadSanitizer (incl. Archer) for
C/C++ programs (CPU only), enabled using the command line option
-fsanitize=address|thread
- Preliminary support for AddressSanitizer, ThreadSanitizer (incl. Archer)
for Fortran programs (CPU only), enabled using the command line option
-fsanitize=address|thread
Note that:
- The option -g is required for AddressSanitizer
- COMMON variables are not supported by AddressSanitizer currently
- Optimized OpenMP offload kernel launch performance
- CCE supports partial OpenMP 5.0. The following OpenMP 5.0 features
have been added in this release for Fortran, C, and C++, unless noted otherwise.
- task reductions (Fortran)
- simd if (Fortran)
- "requires unified_shared_memory" for AMD MI250X (Fortran)
- CCE supports partial OpenACC 2.6 and 3.0. The following OpenACC features
have been added in this release for Fortran.
- "if_present" clause on "update"
- "acc_attach" and "acc_attach_async" APIs
- "async" and "wait" on "acc data"
- "if" clause on "acc wait"
User and Application Impact:
----------------------------
C++ applications built using CCE 13 or earlier may need to be recompiled due to the significant
changes that were necessary to implement C++17. This is expected to be a one-time requirement.
Some non-standard Cray Fortran extensions supporting shorthand notation for logical operations
will be removed in a future release. CCE 14 will issue warning messages when these are
encountered, providing time to adapt the application to use standard Fortran.
Issues or Bugs Resolved:
------------------------
CCE 14.0.2:
-----------
CAST-30368 cce 14.0.1 Fortran crashes with current working directory above a certain length
CAST-30181 A compiler regression in Cray Fortran BIND(C) has been found in cce/14
CAST-30038 OLCFDEV-935 - CCE 14 Fortran OpenMP Metadirective Bogus Error Message Statements Between Directive and DO
CAST-29999 (OLCFDEV - 838) CCE Fortran OpenMP Metadirective with Collapse clause Issue
CAST-29974 Cray compiler support for scalar variables in target region
CAST-29916 Error in `/opt/cray/pe/cce/12.0.3/cce/x86_64/bin/ftnfe': munmap_chunk(): invalid pointer: 0x00007f6cf8240010
CAST-29889 Navy narwhal: Users fortran code causes CCE to segfault - Elevation of case 5363548947
CAST-29706 CCE - ENDASSOCIATE not accepted in FREE FORMAT.
CAST-29705 CCE - ENDENUM (without space) is not accepted in FREE format. This is legal
CAST-26211 ICE "Invalid expression type" pdgcs/v_acc_util.c, line 3240
CAST-24430 CCE Fortran MAXLOC intrinsic is slow compared to Intel
CAST-24233 crayftn internal compiler error on allocate statement with invalid syntax
CCE 14.0.1:
-----------
CAST-29778 FTN Compiler seems not to support a valid Fortran construct in an OpenMP offload context ..
CAST-28223 OLCFDEV-378: CCE Fortran Bogus Error Message with Stop statement in OpenMP Declare Target
CAST-22748 Cray Fortran allows RECL specifier on OPEN with ACCESS=STREAM
CAST-29303 (OLCFDEV-643) CCE+Fortran+OpenMP Compiler Error
CAST-26853 (OLCFDEV-209) Linking OpenMP offload with global const variable gives "symbol multiply defined" with Cray compiler
CAST-27129 (OLCFDEV-276) CCE Fortran Link Error with OpenMP Offload (gtsim)
CAST-28065 CRAY_ACC_ERROR - cuStreamSynchronize returned CUDA_ERROR_ILLEGAL_ADDRESS for COSMO code
CAST-28156 Different behavior of Fortran RESHAPE with cce vs GNU or Intel
CAST-25161 CCE Fortran generates 2 stores for conditional update of array in vector loop (rome)
CAST-22727 Cray Fortran does not return error from RESHAPE when SOURCE is short
CCE 14.0.0:
-----------
CAST-28909 Excessively long Fortran compile times
CAST-28805 Navy-narwhal: CCE Fortran ICE (inline issue) when trying to compile WRFDM
CAST-29354 Cray Fortran compiler is_contiguous function returns incorrect result
CAST-29249 Cray Fortran compiler hangs with O2 optimization
CAST-29004 CCE v13.x fails to build ncurses via Spack
CAST-28920 Excessively long Fortran compile times
CAST-29124 Problem with inlining routine with an OpenACC kernel
CAST-25673 Curious behavior with negative stop-code using CCE Fortran 10.0.3
CAST-25678 RFE: Add compiler option -fopenmp-simd
CAST-27262 CCE does not support std::reduce
CAST-25896 Using upc_memput on a single node is very slow
CAST-27066 Variable is used before it is defined warning on loop index variables
CAST-27636 CCE Fortran OpenMP Offload Array Reduction Gives Linking Error
CAST-27692 CCE Fortran OpenMP offload Error Declare Target Private Var
CAST-27188 CCE Compiler ICE with OpenMP offloading array reduction
CAST-26788 FTN OPTCG goes into an infinite loop trying to compile a file
CAST-28854 cce compile error for Kokkos app
CAST-28621 CCE Fortran OpenMP Wrong Results with Metadirective User Selector in Default Clause
CAST-29502 Segfault and CRAY_ACC_ERROR in UMT OMP Target "map to" clause
CAST-29105 CCE Fortran: Answer wrong when compiler optimization enabled
CAST-25161 CCE Fortran generates 2 stores for a conditional update of the array in vector loop (rome)
CAST-28065 CRAY_ACC_ERROR - cuStreamSynchronize returned CUDA_ERROR_ILLEGAL_ADDRESS for COSMO code
CAST-23376 Would like a vector version of certain Fortran intrinsics, particularly for ARM SVE systems
CAST-22727 Cray Fortran does not return an error from RESHAPE when SOURCE is short
Product and OS Dependencies:
----------------------------
This CCE release is supported on
- Cray XC systems running CLE 7.0 UP03
- Cray CS systems running RedHat 8
- HPE Cray supercomputer systems running CSM with COS 2.1.X (SLES15 SP2) or COS 2.2.X (SLES15 SP3)
- HPE Cray supercomputer systems running HPCM with SLES15 SP2 or SLES15 SP3
- HPE Cray supercomputer systems running HPCM with RHEL 8.4
- HPE Apollo 2000 Gen10Plus systems running RedHat 8
- HPE Apollo 80 systems running RedHat 8
AMD GPU offloading support requires at least ROCm 5.0
Notes and Limitations:
----------------------
None
Documentation:
--------------
- S-5212 Cray Compiling Environment Release Overview (13.0)
- S-3901 Cray Fortran Reference Manual
- S-2179 Cray C and C++ Quick Reference
- Basic man pages: crayftn(1), craycc(1), crayCC(1), intro_openmp()
- Please see https://clang.llvm.org/docs/UsersManual.html or use the -help command line
option for more information on using Clang
Modulefile:
-----------
The following will load the modules necessary to use CCE:
module load PrgEnv-cray
The following will switch to x.y.z version of CCE:
module swap cce cce/x.y.z
Installation instructions:
--------------------------
rpm -ivh cce-14.0.2-202207182110.333f76287fada-5.sles15_1.x86_64.rpm.x86_64.rpm
The following script will set CCE version 14.0.2 default:
/opt/cray/pe/admin-pe/set_default_files/set_default_cce_14.0.2
License:
--------
Except for the third party components and software licensed by HPE through proprietary
agreements, components, files or programs contained within this package or product are
Copyright -2022 Hewlett Packard Enterprise Development LP.
Attribution notices for open source licensed software for this package aredetailed
in the file:
/opt/cray/pe/cce/14.0.1/ATTRIBUTIONS_14.0.txt
CrayPE 2.7.17
==============
Release Date:
--------------
April 2022
Purpose:
--------
Add support for mixed Programming Environments PrgEnv-cray-amd and
PrgEnv-gnu-amd, to support using the AMD ROCm C/C++ compiler with
the Cray CCE and GNU Fortran compilers, respectively.
Bugs fixed in this release:
---------------------------
- CAST-29420 - Cray Wrappers optimized for Zen2 and not Zen3
Dependencies:
---------------------------------------------
The CrayPE 2.7.17 release is dependent on .pc files in the following
software products:
ATP 1.6.3 or later
FFTW 3.3.0.4 or later
FFTW 2.1.5.6 or later
Global Arrays 5.1.0.2 or later
HDF5 1.8.11 or later
iobuf 2.0.5 or later
LibSci 12.1.01 or later
MPT 6.0.2 or later
NetCDF 4.3.0 or later
Parallel-NetCDF 1.3.1.1 or later
PMI 4.0.1 or later
PETSc 3.4.2.0 or later
Trilinos 11.4.1.0 or later
TPSL 1.3.04 or later
TotalView 8.12-totalview-support-1.1.5 or later
Documentation:
---------------
See manpages for cc, CC, ftn, intro_craype-api, intro_hugepages and pkg-config
See section 2.6 Using Targeting Modules of the Cray Programming Environment
User's Guide (S-2529-116)
See http://www.freedesktop.org/wiki/Software/pkg-config for a pkg-config
introduction.
Installation instructions:
--------------------------
rpm -ivh craype-2.7.17-202207011251.f5fcf606abe58-1.sles15sp3.x86_64.rpm
To make this the default version, execute:
/opt/cray/pe/admin-pe/set_default_files/set_default_craype_2.7.17
Product description:
--------------------
CrayPE contains drivers, cc, CC, and ftn to compile for the CCE, PGI, GNU,
and Intel Programming Environments.
The craype-* targeting modules are also provided in this product.
Certain components, files or programs contained within this package or product
are Copyright 2013-2022 Hewlett Packard Enterprise Development LP.
Cray PE DL Plugin 22.08.1:
Release Date:
-------------
August 2022
Purpose:
--------
The following changes have been made since Cray PE DL Plugin 22.06.1.2:
o PyTorch Apex Amp Optimization Support
Supported Configurations:
-------------------------
CS: OpenMPI 4.0.2 and cray-mvapich 2.3.2 or newer with CPU and GPU support
XC: CLE 7 CPU and GPU support
EX: CPU and GPU support
Apollo: CPU-only support
The following Deep Learning frameworks and versions are supported:
* TensorFlow v2.6
* PyTorch v1.10
Documentation:
--------------
For more information see the intro_dl_plugin man pages.
Known Issues:
-------------
* Using Tensorflow 1.14 binaries distributed by Google and Intel require portions of the DL
Plugin to be compiled with gcc 4.8. If using the included source distribution of the
DL Plugin to install the Python packagesinto a given Python installation, gcc 4.8
will need to be used to install said packages, assuming Google and Intel distributed
Tensorflow 1.14 binaries are used. For TensorFlow 1.15 and greater, use of gcc 7.x
is required instead.
Sample Installation instructions:
rpm -ivh craype-dl-plugin-py3-22.08.1*.x86_64.rpm
The "*" in the install command represents date and hash specific information.
Modulefile:
-----------
XC/EX:
module load craype-dl-plugin-py3/22.08.1
CS:
module load craype-dl-plugin-py3/openmpi/21.04.1
module load craype-dl-plugin-py3/mvapich/21.04.1
Certain components, files or programs contained within this package or
product are Copyright 2017-2021 Hewlett Packard Enterprise Development LP.
Cray Common Tools Interface (CTI) 2.15.13:
======================================
Release Date:
--------------
July 2022
Product Description:
--------------------
Cray Common Tools Interface (CTI) is a library that abstracts debugger
tool support to a common interface regardless of the underlying work
load manager. It facilitates the staging of files and the launch of
tool daemons on the compute nodes associated with a parallel job.
Options and interfaces can be found in the cti(1) and cti(3) manpages.
Changelog:
---------
=======================================================================
## [2.15.13] - 2022-07-07 (22.08)
=======================================================================
### Bug Fixes
* Capture srun stderr output during launch
* Default to HSN interface for HPCM PALS and HPCM Slurm
* Copy environment to back end in generic/ssh implementation
### Features
* Add Slurm multi-cluster / allocation detection
* Switch to HPCM PALS highspeed network
* Added support for rhel86 x86 - PE-41140
=======================================================================
## [2.15.12] - 2022-05-06 (22.06)
=======================================================================
### Bug Fixes
* Resolve ordering issues with HPCM PALS backend
* Ending main loop from signal handler in daemon will also end in-progress MPIR launch
### Features
* Added support for sles15sp4 x86 - PE-39146 #423
* Added autogen changelog/release notes functionality - PE-40699 #421
=======================================================================
## [2.15.11] - 2022-04-14 (22.05)
=======================================================================
### Bug Fixes
* PALS implementation's getApid function - PE-40533
* Re-add CTI manpages to RPM
### Features
* Update release notes
* Added sles15sp1 aarch64 jenkinsfile - PE-36379 #418
* Added support for rhel85 x86 - PE-40102
=======================================================================
## [2.15.10] - 2022-03-21 (22.04)
=======================================================================
* Update Shasta PALS implementation to use PALS utilities and deprecate
the previous Shasta PALS API
* Update the HPCM PALS implementations to use PALS utilities
This release fixes the following bugs:
* Fix the usage of the `aprun` launcher with HPCM PALS instead of
the default `mpiexec` launcher
* Fix HPCM PALS backend startup when node hostnames are in xname format
=======================================================================
## [2.15.9] - 2022-01-21 (22.02)
=======================================================================
* Default Shasta PALS node count in job launches to 1
* Include `--overlap` in Slurm tool daemon launches for Slurm versions
20.11 and above
=======================================================================
## [2.15.8] - 2021-12-01 (21.12)
=======================================================================
* Added user documentation manpage cti(1), and developer
documentation manpage cti(3).
* Preliminary support for the Flux workload manager.
Note that passwordless SSH access to compute nodes must be
configured for Flux support. This limitation will be
removed in a future release.
* Flux, Shasta PALS, and ALPS support can be disabled at
compile-time if development headers are not available.
* Updated gen compilers to cray-gcc-10.3.0
=======================================================================
## [2.15.7] - 2021-10-21 (21.11)
=======================================================================
* Fix potential deadlock during job launch / attach for HPCM PALS systems,
as well as the generic SSH interface
* Fix Slurm job attach when running on compute node where hostname does
not match the Slurm node name
* Add environment variable CTI_SLURM_DAEMON_GRES to set the --gres
for Slurm daemon launches. For Slurm versions 21.08 and newer, this
should be set to an empty string (see SchedMD bug)
https://bugs.schedmd.com/show_bug.cgi?id=12642
Product and OS Dependencies:
-----------------------------
The Cray CTI 2.15.13 release is supported on the following Cray systems:
- HPE/Cray XC systems with CLE 7.0 or later
- HPE/Cray CS systems with CentOS/RH 8.4 or later
- HPE/Cray Shasta systems with SLES 15 service pack 2 or later
- HPE Apollo systems with Slurm and RH 8.4 or later
- Baymax systems with RH 8.4 or later
Product Dependencies:
- Cray cray-cdst-support rpm installed
Documentation:
---------------
Currently the only documentation available is found in
common_tools_fe.h, common_tools_be.h, and the tests in the 'tests'
directory.
Installation instructions:
--------------------------
Installation instructions for .sles15sp3.x86_64:
=============================================
rpm -ivh cray-cti-2.15.13-2.sles15sp3.x86_64.rpm
To make cti 2.15.13 the default version of cray-cti, execute:
/opt/cray/admin-pe/set_default_files/set_default_cray-cti-2.15.13
Certain components, files or programs contained within this package or
product are Copyright 2010-2022 Hewlett Packard Enterprise Development LP.
cray-dyninst 12.1.1
=============
Release Date:
-------------
May 2022
Product Description:
--------------------
cray-dyninst provides tools for binary instrumentation, analysis, and modification.
Changelog:
---------
=======================================================================
## [12.1.1] - 2022-05-10 (22.06)
=======================================================================
### Features
* Added support for rhel86 - PE-41140 #18
* Added support for sles15sp4 x86 - PE-40983 #16
* Added autogen changelog/release notes functionality - PE-40699
=======================================================================
## [12.1.0] - 2022-04-14 (22.05)
=======================================================================
### Bug Fixes
* Updated dyninst version to 12.1.0 #11 - PE-40714
### Features
* Added support for rhel85 - PE-40102
=======================================================================
## [12.0.0] - 2022-01-21 (22.02)
=======================================================================
* Updated dyninst to v12.0.0
=======================================================================
## [10.1.0] - 2021-12-01 (21.12)
=======================================================================
* Initial release of cray-dyninst v10.0
Product and OS Dependencies:
----------------------------
The cray-dyninst 12.1.1 release is supported on the following Cray systems:
- HPE/Cray XC systems with CLE 7.0 or later
- HPE/Cray CS systems with RH 8.4 or later
- HPE/Cray Shasta systems with SLES 15 service pack 2 or later
- HPE Apollo systems with Slurm and RH 8.4 or later
- Baymax systems with RH 8.4 or later
Product Dependencies:
- Cray cray-cdst-support rpm installed
Documentation:
--------------
https://github.com/dyninst/dyninst
Installation Instructions:
-------------
Installation Instructions for .sles15sp3.x86_64:
------------------------------------------
rpm -ivh cray-dyninst-12.1.1-20220523132008_41131bd-2.sles15sp3.x86_64.rpm
To make this the default version of cray-dyninst, execute:
/opt/cray/pe/admin-pe/set_default_files/set_default_cray-dyninst_12.1.1
Certain components, files or programs contained within this package or product are
Copyright 2007-2022 Hewlett Packard Enterprise Development LP.
gdb4hpc 4.14.2:
======================================
Release Date:
--------------
July 2022
Product Description:
--------------------
gdb4hpc is a GDB-based parallel debugger used to debug applications compiled with
CCE, AOCC, GNU, and Intel Fortran, C and C++ compilers. It allows programmers to
either launch an application or attach to an already running application that
was launched on the system. Additionally, it provides comparative debugging
technology that enables programmers to compare data structures between two
executing applications. Comparative debugging should be used in conjunction
with the CCDB GUI tool accessed by loading the cray-ccdb module.
Some features of gdb include:
* Command line parallel debugger allows for launching/attaching applications.
* Utilizes process sets to operate on a subset of application ranks.
* GDB-like feel, also implements a gdbmode to enable a true parallel gdb.
* GPU debugging is supported for OpenMP 4.0+ GPU directives; OpenMP 5.0 on AMD
and Nvidia GPU's.
* Workload manager support via Common Tools Interface (cray-cti).
Changelog:
---------
=======================================================================
## [4.14.2] - 2022-07-14 (22.08)
=======================================================================
### Bug Fixes
* Handle printing enum values
* Handle breakpoints with multiple locations
* Allow program stopped at exit
* Print vector.size
* Remove extra blank line from ptype command
* Fix uncaught exception after SIGINT during launch
* Update testing Makefile to work around PE-41446
* Use correct program arguments when re-launching an app with the run command
* Handle printing fortran structs
* Hang if gdb initialization fails
* Removed internal mrnet build - building with external cray-mrnet - PE-41651
* Correct indices in the difference vector when comparing decomposed fortran arrays
* Handle array
* Fixed hang when working with very large arrays
* Improve parsing of templated classes
* Make functional test cleanup smarter in the case of a test time out
* Printing of arrays of fortran complex values
* Handle printing a corrupted vector
* Printing of unique_ptr
### Features
* Handle SIGINT during launch.
* Print types of pointers; make pointer references more readable
* Support info cuda, set cuda, show cuda and help cuda
* Extend the run interface and add documention
* Frame command only shows program counter with -v option
* Show feedback during a job launch while using the zmqnet network protocol.
* Add info args
* Add local shell command
* Add getting started guide and tutorial
=======================================================================
## [4.14.1] - 2022-05-19 (22.06)
=======================================================================
### Bug Fixes
* Assertion script was continuing in halt mode
* Fix mrnet failure message on exit on PALS
* Fix launch on PALS with default gdb4hpc internal gdb
* Error trying to finish outermost frame
* Launch on some cray-pals systems using --gdb=gdb work-around
* Implement --non-mpi launch with zmqnet
* Fix --non-mpi launch on some PALS systems
* Fix inconsistent results with comparisons over one million elements
* Fix print for typedef values, null char* and function pointers
* Fix array comparison for arrays over 1 million elements
### Features
* Added support for rhel86 - PE-41140 #476
* Add ability to re-launch applications with new "run" command, preserving breakpoints between launches
* Added support for sles15sp4 x86 - PE-40983
* Added autogen changelog/release notes functionality - PE-40699 #457
=======================================================================
## [4.14.0] - 2022-04-14 (22.05)
=======================================================================
### Bug Fixes
* Support "array
* Add reference to man cti PE-40399
* Set up structure for defining gpu tests PE-40398
* Gdb4hpc is crashing after an interrupted launch
* Crash during list command
* Fixed certain functional tests reporting failure despite actually passing on certain machines.
* Print out char* as string values
* Print <optimized out> when values are not available.
### Features
* Added optional full suite to functional tests allowing tests to be run against every protocol at once.
* Overhaul Comparative Debugging and Assertion Scripts
* Added support for rhel85 - PE-40102
* Add --non-mpi option for launch. Allows gdb4hpc to find the initial breakpoint without WLM support.
=======================================================================
## [4.13.10] - 2022-03-21 (22.04)
=======================================================================
* Fix the identification of nodes with AMD GPU's when launching or
attaching with the --gpu option
* Improved error detection and reporting for assertion scripts
* Removed extraneous line number reporting when used within ccdb
* Decompositions now work with C/C++ arrays. The array length
is inferred from the decomposition.
=======================================================================
## [4.13.9] - 2022-01-21 (22.02)
=======================================================================
* Regarding gen compilers cray-gcc-10.3.0
NOTE: If you encounter the following error:
ImportError: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.26' not found
Temporary workaround:
1) export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/opt/gcc/10.3.0/snos/lib64
or
2) module load gcc/10.3.0
=======================================================================
## [4.13.8] - 2021-12-06 (21.12)
=======================================================================
* Updated gen compilers to cray-gcc-10.3.0
Product and OS Dependencies:
-----------------------------
The gdb4hpc 4.14.2 is supported on the following Cray systems:
- HPE/Cray XC systems with CLE 7.0 or later
- HPE/Cray CS systems with CentOS/RH 8.4 or later
- HPE/Cray Shasta systems with SLES 15 service pack 2 or later
- HPE Apollo systems with Slurm and RH 8.4 or later
- Baymax systems with RH 8.4 or later
Product Dependencies:
- Cray cray-cti rpm installed
- Cray cray-cdst-support rpm installed
Documentation:
---------------
Type `man gdb4hpc` with the gdb4hpc module loaded to read the gdb4hpc(1) man page.
Simple usage examples are provided at the end.
Also, type `help` at the gdb4hpc command line for more information on commands.
Installation instructions: