### Required prerequisites - [x] I have searched the [Issue Tracker](https://github.com/tile-ai/tilelang/issues) that this hasn't already been reported. (comment there if it has.) ### Motivation NPU-IR provides two operations: transform_layout_async and transpose_async. ### Solution _No response_ ### Alternatives _No response_ ### Additional context _No response_