LLM Chess ArenaGamified LLM Benchmark • Powered by Replicate
8
7
6
5
4
3
2
a1
b
c
d
e
f
g
h

Each model receives the FEN and the list of legal SAN moves, then must output a single move. chess.js validates and applies it. After three invalid replies → forfeit.