Meta introduces TRIBE v2, a tri-modal foundation model for in-silico neuroscience
Original: Meta introduces TRIBE v2, a tri-modal foundation model for in-silico neuroscience View original →
On March 26, 2026, AI at Meta introduced TRIBE v2 on X as a foundation model for predicting how the human brain responds to sight, sound, and language. The linked paper page and interactive demo make clear that Meta is framing the project not as a narrow benchmark result, but as a reusable computational layer for in-silico neuroscience.
The scale claims are notable. In the X post, Meta said TRIBE v2 draws on 500+ hours of fMRI recordings from 700+ people. The paper abstract describes the training base as over 1,000 hours of fMRI across 720 subjects. The phrasing differs across the materials, but the direction is consistent: TRIBE v2 is meant to replace many specialized, task-specific brain-response models with a broader tri-modal foundation model that generalizes across stimuli, tasks, and individuals.
The demo describes a three-stage pipeline. First, pretrained audio, video, and text embeddings represent the incoming stimulus. Second, a transformer learns universal representations shared across modalities, tasks, and subjects. Third, a subject layer maps those shared representations onto individual fMRI voxels. Meta says the result scales to whole-brain prediction across 70,000 voxels, far beyond the 1,000 cortical predictions of TRIBE v1, and achieves a 2-3x improvement over standard methods for zero-shot prediction on new subjects and stimuli.
The larger significance is what Meta says the model can do once trained. The paper argues that TRIBE v2 can reproduce established visual and neuro-linguistic findings in silico, while the demo says it can turn months of lab preparation into seconds of computation. In other words, Meta is presenting TRIBE v2 as a tool for planning experiments, testing hypotheses, and exploring multisensory brain organization before or alongside expensive real-world studies.
Meta is also releasing the surrounding research assets rather than only the headline. The thread points researchers to the model weights, code, paper, and demo. That open release strategy matters because it turns TRIBE v2 from a one-off research announcement into something outside groups can inspect, reproduce, and build on. As a result, this is one of the clearer recent examples of foundation-model ideas moving beyond text and image generation into scientific measurement itself.
Related Articles
Meta said on March 26, 2026 that TRIBE v2 can predict high-resolution fMRI brain activity with zero-shot generalization across new subjects, languages, and tasks. The company is also releasing the model, code, paper, and demo for researchers.
UCLA researchers have identified DDL-920, the first drug to fully reproduce the effects of physical stroke rehabilitation in model mice. The findings, published in Nature Communications, could transform stroke recovery into a pharmacological option.
Google DeepMind unveiled an AI Co-Mathematician system — a multi-agent Gemini-based framework scoring 48% on FrontierMath Tier 4, the highest ever for any AI. AlphaEvolve improved lower bounds on five Ramsey numbers, including R(3,13) whose previous record had stood for 11 years.