MosaicLeaks shows research agents can leak secrets through search
Original: MosaicLeaks: Can your research agent keep a secret? View original →
The privacy risk in deep research agents is not limited to the final answer. MosaicLeaks, published by ServiceNow researchers on Hugging Face, focuses on a quieter channel: the web queries an agent writes while moving between private local documents and public sources.
The failure mode is easy to miss. A research agent may read a private internal metric, date, or vendor name, then use that fact to form the next web search. No single query has to contain the whole secret. But an observer watching outbound traffic can piece together the fragments and infer information that was supposed to stay inside the local corpus.
MosaicLeaks turns that risk into a benchmark. The task chains interleave local and web hops, so the answer to one private sub-question becomes the bridge entity for a later public search. The final split includes 559 training chains, 98 validation chains, and 344 held-out-company test chains. The agent uses a simplified harness with planning, retrieval, document selection, reading, and resolution steps.
The uncomfortable result is that making an agent better at the task can make it less private. For Qwen3-4B, training only for higher chain success raised strict chain success from 48.7% to 59.3%, but answer/full-information leakage rose from 34.0% to 51.7%. The model learned to pack more useful context into web queries, which helped retrieval and hurt confidentiality.
The proposed fix, Privacy-Aware Deep Research, changes the objective. PA-DR raised strict chain success from 48.7% to 58.7% while reducing answer/full-information leakage from 34.0% to 9.9%. For enterprise agents, this is the next evaluation frontier: not just whether the answer is correct, but whether the path to that answer exposes private context.
Related Articles
The acquisition points Codex toward enterprise agents that keep working after a laptop closes. OpenAI says Codex now has more than 5 million weekly users, up 400% from earlier this year, while Ona brings cloud environments used by 2 million developers.
Google has put Deep Research on Gemini 3.1 Pro, added MCP connections, and created a Max mode that searches more sources for harder research jobs. The April 21 preview targets finance and life sciences teams that need web evidence, uploaded files and licensed data in one workflow.
xAI is pushing Grok deeper into enterprise AI infrastructure by joining Databricks Agent Bricks. The move puts Grok beside OpenAI, Anthropic, Gemini, Qwen, and Kimi inside a governed agent-building platform.