#rag

RSS Feed

LLM 4d ago 1 min read

Google Agentic RAG、足りない根拠を探し直す企業検索へ

Google Researchは、企業向けRAGを一度きりの検索から反復型agentワークフローへ寄せた。十分な文脈を検査する仕組みにより、factualityデータセットで最大34%の精度改善を示している。

#google #rag #agents

AI Hacker News May 10, 2026 1 min read

Gemini APIファイル検索がマルチモーダルRAGに対応

GoogleがGemini APIのファイル検索ツールをアップデートし、画像・音声・動画を含むマルチモーダルコンテンツ対応のRAGシステム構築が可能になった。

#google #gemini #rag

AI Hacker News Apr 26, 2026 1 min read

オープンソースのagent memory「Stash」　HNがすぐ突っ込んだ論点

Hacker Newsはmodel-agnostic memoryという約束に反応したが、スレッドの熱量はすぐに「それでcontext pollutionをどう防ぐのか」という問いへ移った。memoryは増えるほど厄介になるのでは、という懐疑が早かった。

#agent-memory #mcp #pgvector

LLM Apr 17, 2026 1 min read

Cloudflare AI Search、agentごとにBM25とvector indexを持つ

CloudflareはAutoRAGをAI Searchへ再構成し、agentがWorkersから作成・検索できるretrieval primitiveにした。Open betaではBM25とvectorのhybrid search、built-in storage、metadata boosting、cross-instance searchを具体的な無料枠で試せる。

#cloudflare #rag #agents

AI X/Twitter Apr 14, 2026 1 min read

MongoDB、HeidiのAI scribeがAtlas上で81M clinical consultationsまで拡大したと強調

MongoDBは2026年3月20日、HeidiのAI scribeが18か月で190か国超・81 million clinical consultationsまで拡大したとXで述べた。リンク先の公式事例は、AtlasとAtlas Vector Searchがheterogeneousな医療データ統合、RAG、そしてhealthcare環境での無停止スケールを支えたと説明している。

#mongodb #healthcare-ai #vector-search

LLM Hacker News Apr 4, 2026 1 min read

Mintlify、docs assistantのRAGを virtual filesystemに置き換え

Mintlify は docs assistant が複数 page にまたがる答えや exact syntax を拾いにくいという chunked RAG の限界から、Chroma 上の virtual filesystem `ChromaFs` を構築したと説明した。session creation を約 46 秒から約 100ms に縮め、HN では filesystem-first retrieval が agent に合うという反応が目立った。

#rag #agents #docs

LLM Hacker News Mar 27, 2026 1 min read

Hacker Newsが見直したproduction RAGの現実、local modelで451GBを扱う方法

Andros Fenollosaの振り返りがHacker Newsで反応を集めたのは、production RAGをprompt demoではなくdataと運用の問題として描いたからだ。

#rag #llamaindex #chromadb

LLM Hacker News Mar 23, 2026 1 min read

DSPyの採用は遅いのに、なぜ同じLLMパターンを作り直すのか

Hacker Newsで話題になったSkylar Payneの投稿は、AIシステムが成熟すると多くのチームがDSPyのパターンを再発明すると論じる。一方でHNでは、Python中心の実装、prompt optimizationの位置づけ、evals設計の重さが採用の壁として議論された。

#dspy #llm-engineering #hacker-news

LLM Mar 21, 2026 1 min read

IBM Granite、Mellea 0.4.0とGranite Librariesを公開

IBM Graniteは2026-03-20、Mellea 0.4.0とGranite 4.0 Micro向けのGranite Libraries 3種を公開した。prompt-only orchestrationではなく、構造化され safety-aware な workflow を求めるチームに向けた release だ。

#ibm #granite #rag

LLM Mar 16, 2026 1 min read

Google、Gemini Embedding 2をpublic preview公開初のnatively multimodal embedding model

Googleは2026年3月10日、Gemini Embedding 2をpublic previewで公開した。会社はこのmodelがtext、image、PDFのようなmixed multimodal documentを1つのembedding spaceで扱い、benchmark scoreを68.32と53.3まで高めつつ価格とvector dimensionsは維持すると説明している。

#google #embeddings #multimodal

AI Hacker News Mar 14, 2026 1 min read

Hacker News、McKinsey Lilliを突いたAI時代のSQL injection事例に注目

Hacker Newsでは、CodeWallが2026年3月9日に公開したMcKinsey Lilli侵害報告が広く共有された。報告はautonomous agentがunauthenticated endpoint、SQL injection、prompt-layer accessを連鎖させてproduction DBへ到達したと主張している。

#ai-security #sql-injection #rag

LLM Hacker News Mar 13, 2026 1 min read

RAG文書汚染は、なぜoutput filterよりingestion controlが重要なのかを示した

Hacker Newsで注目されたAmine Rajiのlocal ChromaDB実験は、RAGの安全性がpromptそのものよりもsource corpusの汚染とingestion検証に左右される可能性を示している。

#rag #security #retrieval