TRIBE v2 (Trimodal Brain Encoder): A foundation model trained to predict how the human brain responds to almost any sight or sound.
Quote from the paper:
"The present results strengthen the possibility of a paradigm shift in neuroscience... moving from the fragmented mapping of isolated cognitive tasks toward the use of unified, predictive foundation models of brain and cognitive functions By aligning the representations of Al systems to those of the human brain, we demonstrate that a single architecture can integrate a vast range of fMRI responses across hundreds of individuals, extending the framework that led the 2025 Algonauts competition. The observed log-linear scaling of encoding accuracy mirroring power laws in both artificial intelligence and neuroscience suggests that the ceiling for predicting human brain activity is yet to be reached."
Paper, model, and code: https://aidemos.atmeta.com/tribev2/