#research

Sciences Mar 23, 2026 2 min read

Google upgrades Gemini 3 Deep Think for science, research, and engineering

On Feb. 12, 2026, Google announced a major Gemini 3 Deep Think upgrade for science, research, and engineering. The new version is available in the Gemini app for Google AI Ultra subscribers and, for the first time, via early API access for researchers, engineers, and enterprises.

#google #gemini #science

LLM Hacker News Mar 21, 2026 2 min read

Hacker News Tracks Moonshot AI’s Attention Residuals as a Drop-In Upgrade for Transformer Depth

The March 20, 2026 HN discussion around Attention Residuals focused on a simple claim with large implications: replace fixed residual addition with learned depth-wise attention and recover performance with modest overhead.

#llm #transformers #research

AI Reddit Mar 20, 2026 2 min read

r/MachineLearning Watches Clip to Grok Claim 18x-to-66x Faster Generalization

A March 17, 2026 r/MachineLearning post about Clip to Grok reached 56 points and 20 comments at crawl time. The authors report that per-row L2 clipping after each optimizer step cut grokking delay by 18x to 66x on modular arithmetic benchmarks.

#grokking #optimization #transformers

LLM sources.twitter Mar 20, 2026 2 min read

OpenAI launches Parameter Golf to push efficient pretraining under a 16 MB cap

OpenAI said on X that it is launching Parameter Golf, an open research challenge to build the most efficient pretrained model under a 16 MB artifact limit and a 10-minute training budget on 8×H100s. The challenge uses a fixed FineWeb dataset, a public baseline repo, and optional Runpod credits for participants.

#openai #parameter-golf #model-efficiency

AI Reddit Mar 20, 2026 2 min read

r/MachineLearning Debates Reported ICML Penalties for No-LLM Review Violations

A 184-point r/MachineLearning thread discussed reported ICML enforcement against no-LLM review violations, with commenters focusing on canary-based detection and coauthor risk.

#icml #peer-review #llm-policy

Sciences Mar 20, 2026 2 min read

Google says integrated AI contrail planning cut formation rates 62% on 2,400 flights

Google said on Mar 19, 2026 that it integrated AI contrail forecasts into American Airlines’ existing flight-planning software. Across a trial tied to 2,400 scheduled transatlantic flights, the company says flights that executed the avoidance plan saw a 62% lower contrail formation rate than the control group.

#google #aviation #climate

AI Mar 19, 2026 2 min read

Google DeepMind proposes a cognitive framework for measuring AGI progress

Google DeepMind said on March 17, 2026 that it has published a new cognitive-science framework for evaluating progress toward AGI and launched a Kaggle hackathon to turn that framework into practical benchmarks. The proposal defines 10 cognitive abilities, recommends comparison against human baselines, and puts $200,000 behind community-built evaluations.

#google-deepmind #agi #evaluation

AI Hacker News Mar 19, 2026 2 min read

Hacker News spotlights agent-sat, an autonomous AI system for improving MaxSAT solving

A Hacker News post on March 19, 2026 drew attention to agent-sat, an open-source project that lets AI agents iteratively improve weighted MaxSAT strategies. The repository says it has solved 220 of 229 instances from the 2024 MaxSAT Evaluation, beaten competition-best results on five instances, and produced one novel solve.

#agents #maxsat #optimization

LLM Reddit Mar 18, 2026 2 min read

r/MachineLearning highlights Attention Residuals as Kimi targets fixed-sum PreNorm bottlenecks

A Reddit thread surfaced Kimi's AttnRes paper, which argues that fixed residual accumulation in PreNorm LLMs dilutes deeper layers. The proposed attention-based residual path and its block variant aim to keep the gains without exploding memory cost.

#kimi #llm-architecture #attention

LLM Reddit Mar 13, 2026 2 min read

r/MachineLearning pushes back on an ICML submission that appears fully AI-written

A reviewer in r/MachineLearning says an ICML paper in a no-LLM track reads as if it was fully generated by AI, opening a blunt discussion about enforcement, review burden, and whether writing quality itself has become a policy signal.

#research #peer-review #llm-writing

AI Reddit Mar 13, 2026 2 min read

Researchers Warn That 'Shadow APIs' Are Undermining LLM Reproducibility

A new paper discussed in r/MachineLearning argues that unofficial model-access providers can quietly substitute models and distort both research and production results.

#reproducibility #apis #research

LLM Reddit Mar 13, 2026 2 min read

Reddit Research Notes: A 7-Layer Duplication Trick Climbs the Open LLM Leaderboard

A post in r/MachineLearning argues that duplicating a specific seven-layer block inside Qwen2-72B improved benchmark performance without changing any weights.

#transformers #benchmarks #open-models