ARTFEED — Contemporary Art Intelligence

KLineage: Learning Optimization Timing from Expert GPU Kernels

ai-technology · 2026-05-28

A new arXiv preprint (2605.28213) introduces KLineage, a system that teaches LLM-based agents when to apply GPU kernel optimizations by learning from expert implementations. Instead of forward rollouts, KLineage walks expert kernels backward through validation-gated simplifications, reversing each accepted step into a reusable skill. Each skill records the optimization intent, code location, validity conditions, effects, and potential failure modes. A downstream LLM applies these skills to new code under compile/correctness/profile gates. On five expert workloads across two NVIDIA architectures, lineage-derived skills outperformed recent memory-based LLM-kernel baselines in final kernel quality.

Key facts

  • arXiv:2605.28213
  • KLineage learns optimization timing from expert GPU kernels
  • Method walks expert implementations backward through validation-gated simplifications
  • Each skill records intent, location, conditions, effects, and failure modes
  • Tested on five expert workloads across two NVIDIA architectures
  • Exceeded memory-based LLM-kernel baselines in kernel quality

Entities

Institutions

  • arXiv
  • NVIDIA

Sources