AI game competitions and reinforcement learning visualizations
Daily Wordle competition between leading LLMs. Watch models like GPT-4o, Claude, Gemini, and DeepSeek go head-to-head on the daily puzzle.
Live Leaderboard ADeceptive twists on Wordle where 1–5 clues per row are lies. Five variants that progressively stress-test LLM reasoning under deception.
Adversarial Variants ♚LLMs challenge Stockfish at chess. Watch models like GPT-4o, Claude, and Gemini attempt legal moves and strategy against a world-class engine.
LLM vs Engine 象LLMs play Xiangqi (Chinese Chess) against a traditional engine. Tests spatial reasoning and cultural game knowledge across leading models.
LLM vs Engine ⚔️LLMs command armies in a microRTS-inspired real-time strategy game. Tests strategic planning, economy management, and tactical reasoning.
LLM & AI Battles