Skip to content

Commit

Permalink
comment
Browse files Browse the repository at this point in the history
  • Loading branch information
zasdfgbnm committed Jan 29, 2025
1 parent 8a7f03f commit 225b35b
Showing 1 changed file with 4 additions and 0 deletions.
4 changes: 4 additions & 0 deletions csrc/device_lower/pass/index.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -2152,6 +2152,10 @@ void IndexLowering::handle(const LoadStoreOp* ldst) {
dataTypeSize(ldst->out()->dtype()) == 4,
"For now, we only support 32-bit types in tmem");
// TODO: hard code size 1 for now.
// According to the specification of tcgen05.{ld,st}, the register
// operand must be viewed as a vector of 32-bit elements.
// See:
// https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#tensor-memory-and-register-load-store-instructions
as_type = ArrayType{std::make_shared<DataType>(ldst->in()->dtype()), 1};
}
if (auto tv = dynamic_cast<TensorView*>(ldst->in());
Expand Down

0 comments on commit 225b35b

Please sign in to comment.