#leaderboards - Insights

LLM Reddit Jun 12, 2026 1 min read

Papers with Code now has to track “papers without code”

The r/MachineLearning thread captured a practical benchmark problem: closed models dominate eval tables even when their results are not reproducible in the old Papers with Code sense.

#benchmarks #open-source #leaderboards