Replies: 1 comment 3 replies
-
If it is taking a significant amount of time, that's something that we'd want to look at. But perhaps that means we are spending a lot of time compiling? It's hard to say without a small runnable reproduction, and perhaps details of your CUDA/CuDNN versions and GPU model. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Following the instructions in the JAX documentation, I successfully captured a profile of my long-running script in an attempt to find bottlenecks. To my surprise, a little over 90% of my runtime is (according to TensorBoard) consumed by the
redzone_checker
kernel. I found very little information about this kernel online, and, since it is consuming a lot of my runtime, I'd like to knowredzone_checker
doing?Beta Was this translation helpful? Give feedback.
All reactions