SketchBench

Full Table

Detailed per-run data

RankModelSolvedFailedGuessesCostTimeCompleted
#1Gemini 3 Flash (dynamic)74/1000736$0.28663.8s08/03/2026, 04:39:27
#2GPT-5 Mini (medium)69/1003890$0.924733.6s08/03/2026, 05:05:41
#3Gemini 3.1 Flash Lite (medium)62/1000978$0.32787.4s08/03/2026, 04:35:46
#4Gemini 3.1 Flash Lite (minimal)46/10001294$0.10393.7s08/03/2026, 04:28:33
#5Claude Haiku 4.533/10001522$0.43591.2s08/03/2026, 05:13:46