Tue 16 Jun 2026 14:40 - 15:00 at Meadows CD - KEYNOTE-2 and Tensorial Themes Chair(s): Sangeeta Chowdhary

LLM-based kernel generators automate optimized kernel synthesis for heterogeneous accelerators, but correctness is validated only by numerical testing on random inputs, a method that misses structural bugs below floating-point tolerance and cannot formally reason about semantic equivalence. The critical gap lies at the PyTorch and Triton IR level, where LLM introduced divergences first appear, yet no existing tool targets equivalence checking at this abstraction.

We present early work on a semantic equivalence checker that formally verifies whether an optimized kernel (PyTorch or Triton) computes the same function as its baseline for \emph{all} inputs. The equivalence checker lowers both programs through a five-phase pipeline: FX graph extraction, TTIR expansion, decomposition, scalarization, and Z3 verification, reducing them to scalar SMT formulas over symbolic inputs.

On KernelBench Level1, it formally proved 16 optimized kernels equivalent and confirmed 2 bugs in kernels that had passed numerical testing, including a float32 constant-truncation error two orders of magnitude below a typical tolerance.

We demonstrate that formal equivalence checking is practical for the straight-line, loop-free kernels that LLM generators commonly produce.

Submission Category: Extended Abstract

Tue 16 Jun

Displayed time zone: Mountain Time (US & Canada) change

13:40 - 15:20
KEYNOTE-2 and Tensorial ThemesARRAY at Meadows CD
Chair(s): Sangeeta Chowdhary AMD Research
13:40
50m
Keynote
The Shape of Things to Come
ARRAY
14:30
10m
Live Q&A
Q&A for Keynote-2
ARRAY

14:40
20m
Talk
Tensor Algebra Equivalence Checker
ARRAY
Jubi Taneja Gimlet Labs, Tom St. John Gimlet Labs, Natalie Serrino Gimlet Labs
15:00
20m
Talk
Rhyme: A Multi-Paradigm Declarative Query Language
ARRAY
Ran Guo Purdue University, Tiark Rompf Purdue University