Rank the models within each row (same input):
1 = best. The leaderboard below
updates live — mean rank + win-rate across the rows you've scored. This is a sandbox: break it freely, the
real /review is untouched.Live leaderboard (mean rank · win-rate, from your ranks)