Scaling Vision Transformers for Functional MRI with Flat Maps
NeurIPS workshop, 2025
We project volumetric fMRI into videos of 2D flat-map activity and pretrain spatiotemporal MAE Vision Transformers on 2.3K hours of Human Connectome Project data, revealing strict power-law scaling and strong downstream decoding of both shared brain states and subject-specific traits.