-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update and sandardize implementation of packages, in sync with spack update #593
Conversation
This reverts commit 8fba6f2.
@daboehme It appears that recent changes in Caliper main branch fixed the issues we were seeing with cce compilers. |
@daboehme any idea what could be causing this ?
|
Hi @adrienbernede, where did you see this happening? Can't find it in any of the recent CI results. |
@daboehme I think you just missed it, it right after the test summary in the logs of the only failing job:
|
Hi @adrienbernede, thanks I found it. I tried building Caliper with the same compiler and libraries, but I can't reproduce these issues. All tests are running fine for me. It also doesn't seem like the CI is running this particular configuration lately. Can we simply retry running this config? Maybe it was a HW issue or something. |
Hello @daboehme I ran the job again and it failed the same. The easiest way to reproduce the issue is by using the in-log reproducer. working_dir="/usr/workspace/${USER}/Caliper/2222036-$(date +%s)"
mkdir -p ${working_dir} && cd ${working_dir}
git clone https://github.com/LLNL/Caliper.git --single-branch --depth=1
cd Caliper
git fetch origin --depth=1 c634187441c3ad88420de7d00ca642b78dd14da5
git checkout c634187441c3ad88420de7d00ca642b78dd14da5
git submodule update --init --recursive
# Required variables
export MODULE_LIST=""
export SPEC="+tests +rocm amdgpu_target=gfx90a %rocmcc@=6.2.0 ^hip@6.2.0 "
# Allow to set job script for debugging (only this differs from CI)
export DEBUG_MODE=true
flux watch $(flux batch -o output.stdout.type=kvs --nodes=1 --begin-time=+5s ./scripts/gitlab/build-and-test.sh) Please note that the failing job is new: we were previously testing with rocm@6.1.1 and this PR updates rocm to 6.2.0. |
Hi @adrienbernede, Thanks, that's very helpful. I was finally able to reproduce the failing tests, which did actually catch a real issue. This should now be fixed in the current Caliper master branch. Can you rebase your branch and try it again? Thanks! |
@daboehme glad it helped. |
Hi @adrienbernede, a merge is fine. Are we good to merge this in then? |
@daboehme yes we are ! |
Summary
Supersedes #588
This PR :
.uberenv-config.json
.