Describe the Bug
In latency = do_bench( , there is a probability that the returned latency is 0.
To Reproduce
python -m pytest tests/ops/test_mha.py
Expected Behavior
The return value latency is expected to be greater than zero.
Environment
- TileLang version: 0.1.7.post1
- PyTorch version: 2.9.0
- OS: Ubuntu22.04
- CUDA version: 12.9