You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I followed the procedure outlined on github page and successfully compiled libnvcd.
However, running nvcdrun produced a segmentation fault on our V100 system.
$ export NVCDINFO_DEVICE_ID=0
$ export NVCDINFO_GROUP_SIZE=5
$ LD_LIBRARY_PATH=$PWD/bin:$LD_LIBRARY_PATH make nvcdinfo_generate_csv
/scratch/optpar01/work/2023/17-libnvcd/libnvcd/bin/nvcdinfo -d 0 -n 5
GPU 0
gpu_name = Tesla V100-SXM2-32GB
gpu_uuid = GPU-8cfe493c-fe94-943f-4f3d-99abd2ba7fa3
=======multiplex=========
|INFO|Processing domain: domain_a
|INFO| Number of events available in this domain: 24
|INFO|Processing domain: domain_b
|INFO| Number of events available in this domain: 4
|INFO|Processing domain: domain_d
|INFO| Number of events available in this domain: 30
|INFO|Processing domain: domain_e
|INFO| Number of events available in this domain: 20
|INFO|Processing domain: domain_p
|INFO| Number of events available in this domain: 2
|INFO|Processing domain: domain_s
|INFO| Number of events available in this domain: 2
Could you kindly clarify what the meaning of 'a, b, d, e, p, s' are ? (Notably, 'c' is missing)
Running nvcdrun
$ export BENCH_EVENTS=$(head -n1 cupti_group_info/device_0/domain_a.csv)
$ LD_LIBRARY_PATH=$PWD/bin/:$LD_LIBRARY_PATH LD_PRELOAD=$LD_PRELOAD:$PWD/bin/libnvcdhook.so bin/nvcdrun
TEST MODE: MULTI-THREADED
CUDA RUNTIME: /scratch/optpar01/work/2023/17-libnvcd/libnvcd/nvcdrun/src/main.c:64:'cudaSetDevice(device)' failed. [Reason] cudaErrorInvalidDevice:invalid device ordinal
CUDA RUNTIME: /scratch/optpar01/work/2023/17-libnvcd/libnvcd/nvcdrun/src/main.c:64:'cudaSetDevice(device)' failed. [Reason] cudaErrorInvalidDevice:invalid device ordinal
CUDA RUNTIME: /scratch/optpar01/work/2023/17-libnvcd/libnvcd/nvcdrun/src/main.c:64:'cudaSetDevice(device)' failed. [Reason] cudaErrorInvalidDevice:invalid device ordinal
*** Error in `bin/nvcdrun': double free or corruption (fasttop): 0x0000000000659790 ***
Regards.
The text was updated successfully, but these errors were encountered:
Hi,
I followed the procedure outlined on github page and successfully compiled libnvcd.
However, running
nvcdrun
produced a segmentation fault on our V100 system.CentOS Linux release 7.9.2009
GPU: V100-SMX2 (Driver 510.47.03)
Modules: gcc/8.3.0, cuda/10.1
nvcdinfo
Could you kindly clarify what the meaning of 'a, b, d, e, p, s' are ? (Notably, 'c' is missing)
nvcdrun
Regards.
The text was updated successfully, but these errors were encountered: