LLM Reddit May 5, 2026 1 min read
DeepSeek V4 Pro tied with GPT-5.2 on FoodTruck Bench, a 30-day agentic benchmark using 34 tools, arriving roughly 10 weeks after GPT-5.2 was tested at approximately 17x lower cost.
DeepSeek V4 Pro tied with GPT-5.2 on FoodTruck Bench, a 30-day agentic benchmark using 34 tools, arriving roughly 10 weeks after GPT-5.2 was tested at approximately 17x lower cost.
r/MachineLearning paid attention because the benchmark did not just crown a winner. It argued that many teams are overpaying for document extraction, then backed that claim with repeated runs, cost-per-success numbers, and a leaderboard where several cheaper models outperformed pricey defaults.