Why it matters: public coding benchmarks are getting less useful at the frontier, so a fresh product-side score can move developer attention fast. Cursor says GPT-5.5 is now its top model on CursorBench at 72.8% and is discounting usage by 50% through May 2.
#cursor
RSS FeedWhy it matters: AI coding leaders are now competing on compute access and strategic ownership, not only editor features. TechCrunch reported a $2B funding round, a $10B collaboration fee, and a path to a $60B Cursor acquisition.
Cursor is putting usage data behind the claim that better coding models change the shape of developer work. In a 500-team study, high-complexity tasks rose 68%, while documentation grew 62% and UI/styling only 15%.
Cursor used an April 3 X post to push developers toward its new Cursor 3 interface. The larger move is shifting from an IDE-side AI panel to a workspace for coordinating many agents across local, cloud, and remote environments.
On April 6, 2026, Cursor said on X that it rebuilt how MoE models generate tokens on NVIDIA Blackwell GPUs. In a companion engineering post, the company said its "warp decode" approach improves throughput by 1.84x while producing outputs 1.4x closer to an FP32 reference.
In an April 8, 2026 X post, Cursor said its code review agent can learn from pull-request activity in real time. The company also claimed that 78% of the issues the agent finds are resolved by the time the PR is merged.
Cursor has published a technical report for Composer 2, outlining a two-stage recipe of continued pretraining and large-scale reinforcement learning for agentic software engineering. The company says the model reaches 61.3 on CursorBench, 61.7 on Terminal-Bench, and 73.7 on SWE-bench Multilingual while keeping pricing at $0.50/M input and $2.50/M output tokens.
Cursor said on March 26, 2026 that real-time reinforcement learning lets it ship improved Composer 2 checkpoints every five hours. Cursor’s March 27 technical report says the model combines continued pretraining on Kimi K2.5 with large-scale RL in realistic Cursor sessions, scores 61.3 on CursorBench, and runs on an asynchronous multi-region RL stack with large sandbox fleets.
Cursor said on April 2, 2026 that Cursor 3 reframes the product as a unified workspace for software development with agents rather than just an AI-augmented editor. The release centers on multi-workspace coordination, parallel local and cloud agents, faster handoff between environments, and tighter review-to-PR workflows.
Cursor 3 reframes AI coding as multi-agent orchestration, combining local and cloud agents, multi-repo context, and PR-oriented workflows in a single interface.
Cursor has published the Composer 2 technical report, outlining its code-focused continued pretraining, large-scale reinforcement learning pipeline, and CursorBench-led evaluation strategy. The report offers an unusually detailed first-party look at how a production coding agent is trained and measured.
A March 29 r/singularity thread amplified Cursor's claim that Composer checkpoints can now be trained from live user interactions and shipped every five hours, with reward-hacking fixes treated as part of the story rather than an afterthought.