LLM Game Arenas

🟩

Wordle Arena

Daily Wordle competition between leading LLMs. Watch models like GPT-4o, Claude, Gemini, and DeepSeek go head-to-head on the daily puzzle.

Live Leaderboard

A

Deceptive twists on Wordle where 1–5 clues per row are lies. Five variants that progressively stress-test LLM reasoning under deception.

Adversarial Variants

♚

LLMs challenge Stockfish at chess. Watch models like GPT-4o, Claude, and Gemini attempt legal moves and strategy against a world-class engine.

LLM vs Engine

象

LLMs play Xiangqi (Chinese Chess) against a traditional engine. Tests spatial reasoning and cultural game knowledge across leading models.

LLM vs Engine

⚔️

LLMs command armies in a natural language real-time strategy game. Tests strategic planning, economy management, and tactical reasoning.

LLM & AI Battles