PaperQA3 pushes Edison's science agent across 150M papers and patents
Original: ICYMI 🔬 @edisonsci is redefining scientific discovery. PaperQA3 can now reason across 150M+ research papers & patents and achieved industry-leading accuracy on the LABBench2 benchmark. See how AI accelerates deep research for science: https://edisonscientific.com/articles/edison-literature-agent View original →
What NVIDIA AI Dev posted on X
On March 27, 2026, NVIDIA AI Dev spotlighted Edison Scientific's PaperQA3 and said it can reason across 150M+ research papers and patents while achieving industry-leading accuracy on the LABBench2 benchmark. Even in a crowded field of research assistants, that is a meaningful claim because the hardest scientific workflows rarely depend on plain text alone.
Researchers need systems that can find, compare, and interpret figures, tables, methods, and claims across huge literatures. A paper-reading agent that cannot see those artifacts is limited in exactly the places many scientific questions become difficult.
What Edison's article adds
Edison's own write-up introduces PaperQA3 as a frontier multimodal deep research agent for science. The company says Edison Literature and Kosmos can now read figures and tables from more than 150M research papers and patents, and can inspect hundreds of visual elements before responding. Edison presents this as a major step beyond PaperQA2, which was limited to text pulled from search results.
The article is also careful about performance framing. Edison says the upgraded system is among the strongest deep-research agents on relevant LABBench2 subsets and on two Humanity's Last Exam variants, beating current frontier deep-research agents in those evaluations. The company also says the PaperQA3-backed versions are already available on its platform and API, which turns the story from a lab preview into a deployable product update.
Why this matters
This is high-signal for science and AI tooling because multimodal reading changes what a research agent can actually do. Many critical scientific details live in charts, microscopy images, benchmark tables, ablation plots, or supplementary visuals. A system that can reason over those artifacts alongside text is much closer to real literature review work.
The larger implication is that deep-research products are shifting from broad web synthesis toward domain-specialized reasoning systems. If Edison's benchmarks and scale claims hold up in practice, PaperQA3 points to a new baseline for scientific assistants: not only finding relevant papers, but extracting evidence from the visual structure of the literature itself.
Sources: NVIDIA AI Dev X post · Edison Scientific article
Related Articles
Google DeepMind unveiled Gemini for Science at I/O 2026, a suite of experimental AI tools designed to help scientists explore hypotheses, validate work at scale, and analyze scientific literature.
Microsoft Discovery became generally available on June 2 for organizations building governed R&D workflows. The platform connects specialized agents, scientific knowledge, simulation tools, validation data, and a new local preview app for researchers.
Life-science AI is moving from literature help toward executable research workflows. OpenAI says GPT-Rosalind reached 27.5% on MedChemBench, 21.6% on GeneBench, and 63.2% on LabWorkBench.