Timezone: »
Several deep neural networks have recently been shown to generate activations similar to those of the brain in response to the same input. These algorithms, however, remain largely implausible: they require (1) extraordinarily large amounts of data, (2) unobtainable supervised labels, (3) textual rather than raw sensory input, and / or (4) implausibly large memory (e.g. thousands of contextual words). These elements highlight the need to identify algorithms that, under these limitations, would suffice to account for both behavioral and brain responses. Focusing on speech processing, we here hypothesize that self-supervised algorithms trained on the raw waveform constitute a promising candidate. Specifically, we compare a recent self-supervised model, wav2vec 2.0, to the brain activity of 412 English, French, and Mandarin individuals recorded with functional Magnetic Resonance Imaging (fMRI), while they listened to approximately one hour of audio books. First, we show that this algorithm learns brain-like representations with as little as 600 hours of unlabelled speech -- a quantity comparable to what infants can be exposed to during language acquisition. Second, its functional hierarchy aligns with the cortical hierarchy of speech processing. Third, different training regimes reveal a functional specialization akin to the cortex: wav2vec 2.0 learns sound-generic, speech-specific and language-specific representations similar to those of the prefrontal and temporal cortices. Fourth, we confirm the similarity of this specialization with the behavior of 386 additional participants. These elements, resulting from the largest neuroimaging benchmark to date, show how self-supervised learning can account for a rich organization of speech processing in the brain, and thus delineate a path to identify the laws of language acquisition which shape the human brain.
Author Information
Juliette MILLET (Université Paris Cite)
Charlotte Caucheteux (Meta AI / Inria)
pierre orhan (Ecole Normale Supérieure de Paris)
Yves Boubenec (École Normale Supérieure)
Alexandre Gramfort (Meta)
Ewan Dunbar (University of Toronto)
Christophe Pallier
Jean-Remi King (CNRS)
More from the Same Authors
-
2021 : Electromagnetic neural source imaging under sparsity constraints with SURE-based hyperparameter tuning »
Pierre-Antoine Bannier · Quentin Bertrand · Joseph Salmon · Alexandre Gramfort -
2022 : Validation Diagnostics for SBI algorithms based on Normalizing Flows »
Julia Linhart · Alexandre Gramfort · Pedro Rodrigues -
2022 Poster: Benchopt: Reproducible, efficient and collaborative optimization benchmarks »
Thomas Moreau · Mathurin Massias · Alexandre Gramfort · Pierre Ablin · Pierre-Antoine Bannier · Benjamin Charlier · Mathieu Dagréou · Tom Dupre la Tour · Ghislain DURIF · Cassio F. Dantas · Quentin Klopfenstein · Johan Larsson · En Lai · Tanguy Lefort · Benoît Malézieux · Badr MOUFAD · Binh T. Nguyen · Alain Rakotomamonjy · Zaccharie Ramzi · Joseph Salmon · Samuel Vaiter -
2022 Poster: Deep invariant networks with differentiable augmentation layers »
Cédric ROMMEL · Thomas Moreau · Alexandre Gramfort -
2021 Poster: HNPE: Leveraging Global Parameters for Neural Posterior Estimation »
Pedro Rodrigues · Thomas Moreau · Gilles Louppe · Alexandre Gramfort -
2021 : The NeurIPS 2021 BEETL Competition: Benchmarks for EEG Transfer Learning + Q&A »
Xiaoxi Wei · Vinay Jayaram · Sylvain Chevallier · Giulia Luise · Camille Jeunet · Moritz Grosse-Wentrup · Alexandre Gramfort · Aldo A Faisal -
2021 : Enhanced Zero-Resource Speech Challenge 2021: Language Modelling from Speech and Images + Q&A »
Ewan Dunbar · Alejandrina Cristia · Okko Räsänen · Bertrand Higy · Marvin Lavechin · Grzegorz Chrupała · Afra Alishahi · Chen Yu · Maureen De Seyssel · Tu Anh Nguyen · Mathieu Bernard · Nicolas Hamilakis · Emmanuel Dupoux -
2021 Poster: Shared Independent Component Analysis for Multi-Subject Neuroimaging »
Hugo Richard · Pierre Ablin · Bertrand Thirion · Alexandre Gramfort · Aapo Hyvarinen -
2020 Poster: Modeling Shared responses in Neuroimaging Studies through MultiView ICA »
Hugo Richard · Luigi Gresele · Aapo Hyvarinen · Bertrand Thirion · Alexandre Gramfort · Pierre Ablin -
2020 Spotlight: Modeling Shared responses in Neuroimaging Studies through MultiView ICA »
Hugo Richard · Luigi Gresele · Aapo Hyvarinen · Bertrand Thirion · Alexandre Gramfort · Pierre Ablin -
2020 Poster: Statistical control for spatio-temporal MEG/EEG source imaging with desparsified mutli-task Lasso »
Jerome-Alexis Chevalier · Joseph Salmon · Alexandre Gramfort · Bertrand Thirion -
2019 Poster: Handling correlated and repeated measurements with the smoothed multivariate square-root Lasso »
Quentin Bertrand · Mathurin Massias · Alexandre Gramfort · Joseph Salmon -
2019 Poster: Learning step sizes for unfolded sparse coding »
Pierre Ablin · Thomas Moreau · Mathurin Massias · Alexandre Gramfort -
2019 Poster: Manifold-regression to predict from MEG/EEG brain signals without source modeling »
David Sabbagh · Pierre Ablin · Gael Varoquaux · Alexandre Gramfort · Denis A. Engemann -
2018 Poster: Multivariate Convolutional Sparse Coding for Electromagnetic Brain Signals »
Tom Dupré la Tour · Thomas Moreau · Mainak Jas · Alexandre Gramfort -
2017 Poster: Learning the Morphology of Brain Signals Using Alpha-Stable Convolutional Sparse Coding »
Mainak Jas · Tom Dupré la Tour · Umut Simsekli · Alexandre Gramfort -
2016 Poster: GAP Safe Screening Rules for Sparse-Group Lasso »
Eugene Ndiaye · Olivier Fercoq · Alexandre Gramfort · Joseph Salmon -
2015 Poster: GAP Safe screening rules for sparse multi-task and multi-class models »
Eugene Ndiaye · Olivier Fercoq · Alexandre Gramfort · Joseph Salmon -
2010 Poster: Brain covariance selection: better individual functional connectivity models using population prior »
Gaël Varoquaux · Alexandre Gramfort · Jean-Baptiste Poline · Bertrand Thirion