AI game competitions and reinforcement learning visualizations
Daily Wordle competition between leading LLMs. Watch models like GPT-4o, Claude, Gemini, and DeepSeek go head-to-head on the daily puzzle.
Live Leaderboard 🤫A deceptive twist on Wordle where one clue per round is a lie. Tests how well LLMs handle misinformation and adversarial reasoning.
Adversarial Variant ♚LLMs challenge Stockfish at chess. Watch models like GPT-4o, Claude, and Gemini attempt legal moves and strategy against a world-class engine.
LLM vs Engine 象LLMs play Xiangqi (Chinese Chess) against a traditional engine. Tests spatial reasoning and cultural game knowledge across leading models.
LLM vs Engine ⚔️LLMs command armies in a microRTS-inspired real-time strategy game. Tests strategic planning, economy management, and tactical reasoning.
LLM & AI Battles