#local-ai

LLM Reddit Mar 17, 2026 2 min read

r/LocalLLaMA Questions OpenCode’s Local Story After Finding `serve` Proxies the UI to app.opencode.ai

On March 16, 2026, a r/LocalLLaMA post questioning OpenCode’s local behavior reached 389 points and 154 comments. The post argued that the `opencode serve` web UI path proxies to app.opencode.ai and backed that claim with a linked code path plus related GitHub issues and PRs.

#opencode #local-ai #self-hosting

LLM Hacker News Mar 17, 2026 2 min read

Hacker News Resurfaces a Fully Local Home Assistant Voice Stack Built Around llama.cpp

A March 16, 2026 Hacker News thread resurfaced a detailed Home Assistant community write-up that logged 310 points and 92 comments, showing how a local-first voice assistant stack can combine llama.cpp, Parakeet V2 STT, Kokoro TTS, and prompt tuning into a usable system.

#home-assistant #voice-assistant #llama.cpp

108

LLM Reddit Mar 15, 2026 2 min read

LocalLLaMA Highlights a New Linux Path for Running LLMs on AMD Ryzen AI NPUs

Community discussion in LocalLLaMA pointed to a March 11, 2026 FastFlowLM and Lemonade update that brings Linux support to AMD XDNA 2 NPUs, including setup guidance for Ubuntu and Arch systems.

#amd #npu #linux

115

LLM Hacker News Mar 13, 2026 2 min read

Hacker News spots CanIRun.ai, a browser-side local AI compatibility checker

CanIRun.ai runs entirely in the browser, detects GPU, CPU, and RAM through WebGL, WebGPU, and navigator APIs, and estimates which quantized models fit your machine. HN readers liked the idea but immediately pushed on missing hardware entries, calibration, and reverse-lookup features.

#local-ai #llm-inference #hardware

AI Mar 13, 2026 2 min read

Perplexity unveils Personal Computer, an always-on local AI proxy built around a Mac mini

Perplexity has introduced Personal Computer, an always-on local agent system that runs through a continuously operating Mac mini and exposes files, apps, and sessions to Perplexity Computer and the Comet Assistant. The company is pitching it as a persistent AI operating system with human approval, logging, and a kill switch for sensitive actions.

#perplexity #personal-computer #agents

119

LLM Hacker News Mar 11, 2026 2 min read

Hacker News Highlights RunAnywhere's Local Voice AI Stack for Apple Silicon

A Launch HN thread pushed RunAnywhere's RCLI into view as an Apple Silicon-first macOS voice AI stack that combines STT, LLM, TTS, local RAG, and 38 system actions without relying on cloud APIs.

#apple-silicon #local-ai #voice-ai

LLM Hacker News Mar 2, 2026 1 min read

llmfit: Auto-Select the Right LLM Model for Your Hardware

llmfit is an open-source CLI tool that automatically detects your system's RAM, CPU, and GPU specs to recommend the optimal LLM model and quantization level, dramatically lowering the barrier to running local AI.

#llm #open-source #hardware-optimization

AI Reddit Mar 1, 2026 1 min read

Bare-Metal AI: Running LLM Inference Directly in UEFI, No OS or Kernel Required

A developer has implemented a UEFI application that runs LLM inference directly from boot without any operating system or kernel, using zero-dependency C code for the entire stack from tokenizer to inference engine.

#bare-metal #llm-inference #uefi

117

LLM Feb 23, 2026 1 min read

Ollama 0.17 Arrives with New Inference Engine: Up to 40% Faster Local AI

Ollama 0.17, released February 22, introduces a new native inference engine replacing llama.cpp server mode, delivering up to 40% faster prompt processing and 18% faster token generation on NVIDIA GPUs, plus improved multi-GPU tensor parallelism and AMD RDNA 4 support.

#open-source #ollama #local-ai

108

LLM Reddit Feb 22, 2026 2 min read

ggml.ai Team Announces Move to Hugging Face, Reaffirms Full-Time llama.cpp Maintenance

A high-signal LocalLLaMA thread points to llama.cpp Discussion #19759, where maintainers say the ggml team is joining Hugging Face while continuing full-time support for ggml and llama.cpp.

#ggml #llama-cpp #hugging-face

LLM Hacker News Feb 21, 2026 2 min read

HN Tracks ggml.ai Team Joining Hugging Face While Keeping llama.cpp Community Governance

A high-scoring Hacker News thread highlighted announcement #19759 in ggml-org/llama.cpp: the ggml.ai founding team is joining Hugging Face, while maintainers state ggml/llama.cpp will remain open-source and community-driven.

#llama-cpp #ggml #hugging-face