LMArena at the University of California, Berkeley is making it easier , thanks to help from NVIDIA and Nebius. Its rankings, powered by the Prompt-to-Leaderboard (P2L) model, collect votes from humans on which AI performs best in areas such as math, coding, or creative writing. “We capture user preferences across tasks and apply Bradley-Terry coefficients to identify which model performs best in each domain,” said Wei-Lin Chiang, co-founder of LMArena and a doctoral student at Berkeley.