#open-source

LLM Feb 22, 2026 1 min read

Cohere Launches Tiny Aya: 3.35B Open-Weight Models Supporting 70+ Languages for Offline Use

At the India AI Summit on February 17, Cohere released Tiny Aya, a family of 3.35B open-weight multilingual models supporting 70+ languages that run offline on standard laptops, targeting global language accessibility.

#cohere #open-source #multilingual

AI Hacker News Feb 22, 2026 1 min read

zclaw: A Personal AI Assistant in Under 888 KB, Running on an ESP32

zclaw is an open-source personal AI assistant that fits in under 888 KB and runs on an ESP32 microcontroller. Part of the emerging Claw ecosystem, it demonstrates how far edge AI has come.

#esp32 #embedded #ai-assistant

LLM Hacker News Feb 22, 2026 1 min read

Running Llama 3.1 70B on a Single RTX 3090 via NVMe-to-GPU

A new open-source project called ntransformer enables running the 140GB Llama 3.1 70B model on a single consumer RTX 3090 by streaming weights directly from NVMe storage to GPU, completely bypassing CPU RAM.

#llama #gpu #open-source

LLM Hacker News Feb 22, 2026 1 min read

Karpathy: "Claws" Are a New Layer on Top of LLM Agents

Andrej Karpathy coined a new term for OpenClaw-like AI agent systems: "Claws." Just as LLM agents were a new layer on top of LLMs, Claws provide orchestration, scheduling, persistent context, and tool calls on top of LLM agents.

#llm-agents #karpathy #openclaw

LLM Feb 22, 2026 1 min read

Alibaba Releases Qwen 3.5 Open-Source Model Claiming Frontier-Level Performance

Alibaba launched Qwen 3.5 on February 16 under Apache 2.0, featuring 397B parameters with a sparse MoE architecture (17B active), 256K context, and native multimodal capabilities matching leading US proprietary models on key benchmarks.

#alibaba #qwen #open-source

AI Reddit Feb 21, 2026 2 min read

Reddit Spotlights KittenTTS v0.8: Open Tiny TTS Stack Aimed at CPU and Edge Deployment

A high-upvote LocalLLaMA thread highlighted KittenTTS v0.8, with community-shared details on 80M/40M/14M model variants, Apache-2.0 licensing, and an edge-friendly focus on local CPU inference.

#tts #edge-ai #open-source

LLM Hacker News Feb 21, 2026 2 min read

HN Tracks ggml.ai Team Joining Hugging Face While Keeping llama.cpp Community Governance

A high-scoring Hacker News thread highlighted announcement #19759 in ggml-org/llama.cpp: the ggml.ai founding team is joining Hugging Face, while maintainers state ggml/llama.cpp will remain open-source and community-driven.

#llama-cpp #ggml #hugging-face

LLM Reddit Feb 20, 2026 2 min read

LocalLLaMA spotlights Kitten TTS v0.8 for compact on-device speech

A widely discussed LocalLLaMA post introduces open Kitten TTS v0.8 models (80M/40M/14M), emphasizing CPU-friendly deployment and sub-25MB footprint for the smallest variant.

#tts #localllama #edge-ai

LLM Reddit Feb 20, 2026 2 min read

Reddit Watches Draft llama.cpp PR Porting IQ*_K Quantization Path from ik_llama.cpp

A popular LocalLLaMA post highlights draft PR #19726, where a contributor proposes porting IQ*_K quantization work from ik_llama.cpp into mainline llama.cpp with initial CPU backend support and early KLD checks.

#llama-cpp #quantization #ggml

LLM Hacker News Feb 19, 2026 2 min read

HN Spotlights Step 3.5 Flash: Open-Source 196B MoE Model Aiming for Fast Agentic Reasoning

A high-signal Hacker News post highlighted StepFun's Step 3.5 Flash launch, describing a 196B-parameter MoE foundation model with about 11B active parameters, 256K context, and vendor-reported coding/agent benchmarks.

#stepfun #open-source #llm

LLM Feb 19, 2026 2 min read

NVIDIA Blackwell Inference Stack Claims Up to 10x Lower Token Costs

In a February 12, 2026 post, NVIDIA said major inference providers are reducing token costs with open-source frontier models on Blackwell. The article includes partner-reported gains across healthcare, gaming, and enterprise support workloads.

#nvidia #blackwell #inference

Gaming Reddit Feb 19, 2026 1 min read

r/gamedev: Godot Maintainers Warn of Surging AI Slop Pull Requests

A high-signal r/gamedev post from 2026-02-18 points to reporting that Godot maintainers are being overwhelmed by low-quality AI-generated code submissions, highlighting a growing governance challenge for open-source game engines.

#godot #game-engine #open-source