kamalrss88

Kamal Sharma kamalrss88

Popular repositories Loading

FlashMLA FlashMLA Public

🚀 Accelerate attention mechanisms with FlashMLA, featuring optimized kernels for DeepSeek models, enhancing performance through sparse and dense attention.

C++
kamalrss88.github.io kamalrss88.github.io Public

⚡ Optimize attention in AI models with FlashMLA, featuring advanced sparse and dense kernels for enhanced performance in DeepSeek applications.