Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] Fix high-AM CPU sn-K #98

Merged
merged 7 commits into from
Mar 14, 2024
Merged

Conversation

wavefunction91
Copy link
Owner

Un-unrolled loops for high-AM sn-K kernels on the CPU were incorrect. This PR manually unrolls loops up to $L_A+L_B < 10$.

Closes #97

@wavefunction91 wavefunction91 merged commit 843fe01 into master Mar 14, 2024
12 checks passed
@wavefunction91 wavefunction91 deleted the bugfix/cpu_os_higham branch March 14, 2024 16:42
mikovtun pushed a commit to mikovtun/GauXC that referenced this pull request Mar 29, 2024
* Fix (d|f) and (f|f) sn-K contractions on the CPU

* MIN -> min in f sn-K CPU kernels

* Fix (p|g) ... (g|g) sn-K CPU kernels

* Missing MIN -> std::min

* Added BasisSet::max_l

* Added H2O2 def2-{T,Q}ZVP tests to check sn-K with high AM, disable L > 2 for device tests for now

* Add L=4 to CUDA collocation kernels
wavefunction91 added a commit that referenced this pull request Apr 3, 2024
* improve standalone_driver compatibility with X/Y/Z densities

* Change "spin" nomenclature

Co-authored-by: David Williams-Young <wavefunction91@gmail.com>

* [BugFix] Fix high-AM CPU sn-K (#98)

* Fix (d|f) and (f|f) sn-K contractions on the CPU

* MIN -> min in f sn-K CPU kernels

* Fix (p|g) ... (g|g) sn-K CPU kernels

* Missing MIN -> std::min

* Added BasisSet::max_l

* Added H2O2 def2-{T,Q}ZVP tests to check sn-K with high AM, disable L > 2 for device tests for now

* Add L=4 to CUDA collocation kernels

* Handle potential external gau2grid with higher-level target check (#95)

* Change ENABLE variables to HAS/USE variables

* Cleanup of old config header

* Fixed integrate_den warning and added reference VXC guards for non-VXC jobs

---------

Co-authored-by: mk <mikovtun@uw.edu>
Co-authored-by: David Williams-Young <wavefunction91@gmail.com>
Co-authored-by: David Williams-Young <dbwy@lbl.gov>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

SCF Convergence Issues in Psi4-GauXC Interface with High AM
1 participant