Batch Experiment Results

Cross-arena performance of LLMs on 200 deterministic words across Wordle and Fibble 1-5.

Experiment Progress

Loading experiment data…