Prompt Judge

Coding judge scored on prompts, not code.

Write a prompt, pick a small local model, and see whether your words coax it into a correct solution. Fewer input tokens and smaller models score higher.

Problems

Loading…