-
Notifications
You must be signed in to change notification settings - Fork 940
Pull requests: NVIDIA/cutlass
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix Copy_Traits<SM80_CP_ASYNC_*_ZFILL> without definition of with(pred) function
#1887
opened Oct 20, 2024 by
CalebDu
Loading…
Fix potential smem misaligned address issue for ws pingpong kernel.
#1870
opened Oct 14, 2024 by
Junkai-Wu
Loading…
fix wrong A/BLayout in MMA_Traits for binary mma and append other MMA_Traits support
#1856
opened Oct 9, 2024 by
CalebDu
Loading…
added mapping for bf16 to torch::kBFloat16
#1843
opened Sep 27, 2024 by
Bogumil-Sapinski-Mobica
Loading…
Include of regular_tile_iterator.h fixed for NVRTC
inactive-30d
#1765
opened Sep 1, 2024 by
MaxAkaAltmer
Loading…
Fix EVT for cutlass::gemm::kernel::DefaultGemmWithVisitor's behavior when constructing GemmUniversalAdapter
#1753
opened Aug 28, 2024 by
Xinyu302
Loading…
Fixing std::numeric_limits<half_t>::digits to include the implicit one
inactive-30d
#1702
opened Aug 10, 2024 by
akamiru
Loading…
Avoid LDGSTS routing by changing default copy to be universalcopy
inactive-30d
#1674
opened Aug 1, 2024 by
ZelboK
Loading…
feat: allow print_latex(TiledMMA) to colorize sliced thread and add print_latex(ThrMMA)
inactive-30d
#1656
opened Jul 26, 2024 by
cloudhan
Loading…
Add atomic_add and BlockStripedReduce to bfloat162
inactive-30d
#1653
opened Jul 24, 2024 by
yzh119
Loading…
Add
infinity
to cutlass::platform::numeric_limits<half_t>
inactive-30d
#1650
opened Jul 22, 2024 by
eqy
Loading…
Make mainloop schedule type available as
GemmKernel::Schedule
inactive-30d
#1638
opened Jul 16, 2024 by
manishucsd
Loading…
Make integer_subbyte Fully Compliant with is_integral
inactive-30d
#1632
opened Jul 15, 2024 by
osayamenja
Loading…
Add Ampere GEMM example using Cute and CUTLASS 3.x
inactive-30d
#1604
opened Jun 27, 2024 by
aacostadiaz
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.