#open-models

LLM Mar 16, 2026 1 min read

NVIDIA、Nemotron 3 Superを公開　agentic AI向けopen modelで5x higher throughputを提示

NVIDIAはMarch 11, 2026、Nemotron 3 Superを公開した。120-billion-parameter hybrid MoE、12 billion active parameters、1-million-token context、high-accuracy tool callingを組み合わせた open model と説明している。

#nvidia #nemotron #agentic-ai

LLM Hacker News Mar 16, 2026 1 min read

Hacker Newsが注目した最新LLM architectureの可視化リファレンス

Sebastian Raschka の LLM Architecture Gallery は、最近の open model 群を比較しやすい図にまとめ、dense、MoE、hybrid design の違いを一か所で追える点が HN で評価された。

#llm-architectures #transformers #moe

LLM Reddit Mar 16, 2026 1 min read

LocalLLaMAが追ったNVIDIA Nemotron license変更、derivative modelに何が変わるのか

2026年3月15日に高い反応を集めたLocalLLaMA threadは、NVIDIA Nemotron model familyのlicense変更に注目した。現在のNVIDIA Nemotron Model Licenseを以前のOpen Model Licenseと比べると、communityが反応した理由は明快だ。以前のguardrail termination clauseとTrustworthy AIへの参照が見当たらなくなり、代わりにNOTICEベースのattribution構造が前面に出ている。

#nvidia #nemotron #licensing

LLM Reddit Mar 15, 2026 1 min read

LocalLLaMAが注目したNemotronライセンス更新、派生利用の摩擦を下げる可能性

2026年3月15日のLocalLLaMA投稿は、Hugging Face model card commit と NVIDIA のライセンスページを根拠に、Nemotron Super 3 が従来の NVIDIA Open Model License から NVIDIA Nemotron Open Model License へ移ったことを指摘した。

#nemotron #nvidia #licensing

LLM Reddit Mar 13, 2026 1 min read

OmniCoder-9B、42.5万件のagentic trajectoryで学習した9Bコーディングモデル

r/LocalLLaMAでは、Qwen3.5-9BベースのOmniCoder-9Bがfrontier agent tracesを取り込んだ小型open coding modelとして注目されている。

#coding-agents #open-models #qwen

LLM Reddit Mar 13, 2026 1 min read

2枚のRTX 4090でOpen LLM Leaderboard上位に入った7-layer duplication実験

r/MachineLearningでは、重みを変えずに中間7層ブロックを複製するだけでbenchmarkを押し上げたという実験ノートが注目を集めている。

#transformers #benchmarks #open-models

LLM sources.twitter Mar 11, 2026 1 min read

NVIDIA、multi-agent AI向け Nemotron 3 Super を公開

NVIDIA AI Developerは2026年3月11日、12B active parametersを用いるオープン120B-parameter hybrid MoEモデル Nemotron 3 Super を発表した。NVIDIAはnative 1M-token contextと、前世代Nemotron Super比で最大5倍のthroughputを強調している。

#nvidia #nemotron #open-models

LLM sources.twitter Mar 11, 2026 1 min read

Microsoft Foundry、Fireworks AIでAzureのopen model inferenceを強化

Microsoftは、Fireworks AIがMicrosoft Foundryに加わり、Azureでhigh-performanceかつlow-latencyなopen model inferenceを提供すると発表した。day-zero access、custom model持ち込み、enterprise controlを一体で扱える点が中核だ。

#azure #microsoft-foundry #open-models

LLM Reddit Mar 9, 2026 1 min read

Sarvam、Indiaで学習した30B・105B reasoning modelをopen-source化

LocalLLaMAで大きく取り上げられたSarvam AIの発表は、Apache 2.0のreasoning modelであるSarvam 30BとSarvam 105Bを公開するものだ。会社は両モデルがIndiaでscratchから学習され、Mixture-of-Experts設計を土台にreasoning、coding、agentic workflow、Indian-language性能を狙ったと説明している。

#open-models #india #reasoning-models

LLM Mar 8, 2026 1 min read