Global web icon
gpumode.com
https://www.gpumode.com/v2/leaderboard/537
Kernel Leaderboard
Description Implement a 2D convolution kernel that matches the reference implementation.
Global web icon
gpumode.com
https://www.gpumode.com/v2/news
Kernel Leaderboard
2nd Place - Flash Hogs (Github) Flash-HOG is an optimized kernel for running higher order gradient methods (HOG) for attention on NVIDIA Blackwell. Their kernel implements the backward pass of attention. Many research-level and current SoTA architectures depend on XLA to produce a kernel for this operation, and having an efficient kernel opens this approach up to wider use. They built a fast ...