Stat db gpu improvments #125

rrhodgson · 2023-10-26T13:43:36Z

I've added the gpu comms memory and the total memory usage reported by cuda to the stat db.
Now e.gridTotalCurrent is the total memory allocated on thee gpu by grid (in use objects + cache + comms).

I have also added a new column in the view gridDeficitCurrentMB which is the total memory reported by cuda minus grids total usage.
I've been using this to try to debug cases where the deficit jumps suddenly.
It is also useful to see how much headroom we have to increase the --device-mem flag.

Added comms and cuda total memory usage to stat db

f7af1be

rrhodgson requested a review from aportelli as a code owner October 26, 2023 13:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stat db gpu improvments #125

Stat db gpu improvments #125

rrhodgson commented Oct 26, 2023

Stat db gpu improvments #125

Are you sure you want to change the base?

Stat db gpu improvments #125

Conversation

rrhodgson commented Oct 26, 2023