LLM Infrastructure

The Attention Kernel Of Dao

Archive

Expert

300pts0 solves

Dao et al. (2024) released a Hopper-specific rewrite of the fast attention kernel exploiting warp-specialization and FP8, shipping with PyTorch and cuDNN. Which name (with version)?

Show hint

The family + a major version bump.

Archive — no submissions accepted

This challenge is preserved for reference. Play live challenges at /challenges.