Google Cloud、BigQuery の autonomous embedding generation を preview 提供

Google Cloudが強調したこと

2026年4月10日、Google Cloud TechはXでBigQueryのautonomous embedding generation previewを紹介した。考え方はシンプルだが実務的だ。別個のpipelineでvectorを生成し、更新し続ける代わりに、BigQuery自身がsource text columnとembedding columnを自動で同期する。

ドキュメントが示すのは generated column に近いパターンだ。source column にデータが追加・更新されると、BigQuery が background で embedding column を生成または更新する。これにより、application data は変わったのに embedding job が走らず vector が stale になるという retrieval system の典型的な運用問題を減らせる。

どう動くのか

ガイドでは CREATE TABLE や ALTER TABLE の中で AI.EMBED 関数を使う例が示されている。BigQuery は生成された embedding を asynchronous に保持し、その後の search や indexing workflow に利用できる。

table には source の STRING column と、自動生成される embedding column を定義できる。
BigQuery は text-embedding-005 のような Vertex AI embedding model を呼んで column を維持できる。
embedding 作成後は vector index を生成し、AI.SEARCH で検索できる。

previewの意味

興味深いのは、external model path と built-in model path の両方を提示している点だ。ドキュメントによれば、BigQuery は Vertex AI hosted embedding model を利用できる一方、built-in の embeddinggemma-300m に対する preview も用意している。この built-in path を使う場合、Google はデータが BigQuery 内にとどまり、Vertex AI の追加料金も発生しないと説明している。これは embedding freshness を別個の ML ops task ではなく、data platform 機能に近づける設計だ。

同時に governance 面も明示されている。Vertex AI endpoint を呼ぶには適切な BigQuery permission、connection resource、そしてその connection service account に必要な Vertex AI user role が要る。つまり、この機能は glue code を減らすが、運用 discipline まで消してくれるわけではない。

なぜ高シグナルなのか

Vector search は production AI system の中でも特に面倒になりやすい部分だった。多くのチームが「source data の変更検知、embedding 再生成、保存、re-index」というpipelineを別途管理してきたからだ。Autonomous embedding generation はこの反復パターンを database layer に圧縮する。すでに BigQuery を中心に運用している enterprise チームにとっては、RAG、semantic search、similarity workflow をより新鮮に保ち、pipeline drift で壊れる可能性を下げる動きになりうる。

出典: Google Cloud Tech X投稿 · BigQuery documentation

Google Cloud、BigQuery の autonomous embedding generation を preview 提供

Google Cloudが強調したこと

どう動くのか

previewの意味

なぜ高シグナルなのか

Related Articles

Google DeepMind、Gemini Embedding 2をpreview公開しmultimodal retrievalを前進

TensorFlowからJAXへ6倍速く、Google Cloudの複数エージェント移行実験

13年前のXeonでGemma 4 26B、GPUなし5 tok/s