Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Venado optimizations #297

Draft
wants to merge 66 commits into
base: master
Choose a base branch
from
Draft

Venado optimizations #297

wants to merge 66 commits into from

Conversation

mewall
Copy link
Collaborator

@mewall mewall commented Jan 14, 2025

Venado optimizations with many new functionalities added

Notable revisions:

o gpmdk example code for graph-based QMD using the kernel method
o updates to use the new bml_transpose fortran interface
o additional progress subroutines to support gpmdk functionalities
o bug fixes
o modifications to decrease memory allocations, speeding up the code


This change is Reviewable

mewall and others added 30 commits January 14, 2025 15:59
o Kernel before rank update was being used. Fix improves SCF error
o Fix the matmul equation by specifying index range on lhs
…code to fix issue with XYZ coordinates being printed across two lines instead of on a single line
o Pass subarray of ham, over to get_skblock
o Change get_hsmat nested loop to enable collapse(2)

In particular, the collapse() clause improves performance
o Modify get_skblock_inplace to assume zero elements on entry
o Modify get_hsmat to zero the ham, over matrices before calc
o Create get_dH_or_dS using new get_skblock_inplace method
  - Initialize matrices with zero
o Use new method in gpmdk
o Performance is now on par with cray vectorized method
o Rewrite CPU code to eliminate packs
o Offload the calc for nvidia build
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants