UniCoRN: Unified Cognitive Signal ReconstructioN bridging cognitive signals and human language (2307.05355v1)
Abstract: Decoding text stimuli from cognitive signals (e.g. fMRI) enhances our understanding of the human language system, paving the way for building versatile Brain-Computer Interface. However, existing studies largely focus on decoding individual word-level fMRI volumes from a restricted vocabulary, which is far too idealized for real-world application. In this paper, we propose fMRI2text, the first openvocabulary task aiming to bridge fMRI time series and human language. Furthermore, to explore the potential of this new task, we present a baseline solution, UniCoRN: the Unified Cognitive Signal ReconstructioN for Brain Decoding. By reconstructing both individual time points and time series, UniCoRN establishes a robust encoder for cognitive signals (fMRI & EEG). Leveraging a pre-trained LLM as decoder, UniCoRN proves its efficacy in decoding coherent text from fMRI series across various split settings. Our model achieves a 34.77% BLEU score on fMRI2text, and a 37.04% BLEU when generalized to EEGto-text decoding, thereby surpassing the former baseline. Experimental results indicate the feasibility of decoding consecutive fMRI volumes, and the effectiveness of decoding different cognitive signals using a unified structure.
- Brain2word: decoding brain activity for language generation. arXiv preprint arXiv:2009.04765.
- Towards reconstructing intelligible speech from the human auditory cortex. Scientific reports, 9(1):874.
- Localising memory retrieval and syntactic composition: an fmri study of naturalistic language comprehension. Language, Cognition and Neuroscience, 34(4):491–510.
- Where is the semantic system? a critical review and meta-analysis of 120 functional neuroimaging studies. Cerebral cortex, 19(12):2767–2796.
- Seeing beyond the brain: Conditional diffusion model with sparse masked modeling for vision decoding. arXiv preprint arXiv:2211.06956.
- Decoding speech from non-invasive brain recordings. arXiv preprint arXiv:2208.12266.
- fmri brain decoding and its applications in brain–computer interface: A survey. Brain Sciences, 12(2):228.
- Liberty S Hamilton and Alexander G Huth. 2020. The revolution will not be controlled: natural stimuli in speech neuroscience. Language, cognition and neuroscience, 35(5):573–582.
- Zuco, a simultaneous eeg and eye-tracking resource for natural sentence reading. Scientific data, 5(1):1–13.
- Natural speech reveals the semantic maps that tile human cerebral cortex. Nature, 532(7600):453–458.
- Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. arXiv preprint arXiv:1910.13461.
- Very deep transformers for neural machine translation. arXiv preprint arXiv:2008.07772.
- Pre-training and fine-tuning transformers for fmri prediction tasks. arXiv preprint arXiv:2112.05761.
- Advanced bioelectrical signal processing methods: Past, present and future approach—part ii: Brain signals. Sensors, 21(19):6343.
- Generalizable spelling using a speech neuroprosthesis in an individual with severe limb and vocal paralysis. Nature Communications, 13(1):1–15.
- Predicting human brain activity associated with the meanings of nouns. science, 320(5880):1191–1195.
- Brain computer interface advancement in neurosciences: Applications and issues. Interdisciplinary Neurosurgery, 20:100694.
- Keep it real: rethinking the primacy of experimental control in cognitive neuroscience. NeuroImage, 222:117254.
- The “narratives” fmri dataset for evaluating models of naturalistic language comprehension. Scientific data, 8(1):1–22.
- Uta Noppeney and Catherine J Price. 2004. An fmri study of syntactic adaptation. Journal of Cognitive Neuroscience, 16(4):702–713.
- Mark Pagel. 2017. Q&a: What is human language, when did it evolve and why should we care? BMC biology, 15(1):1–6.
- Zero-shot learning with semantic output codes. Advances in neural information processing systems, 22.
- Jerrin Thomas Panachakel and Angarai Ganesan Ramakrishnan. 2021. Decoding covert speech from eeg-a comprehensive review. Frontiers in Neuroscience, page 392.
- Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems, 32.
- Toward a universal decoder of linguistic meaning from brain activation. Nature communications, 9(1):1–13.
- Kevin M Pitt and Aimee Dietz. 2022. Applying implementation science to support active collaboration in noninvasive brain–computer interface development and translation for augmentative and alternative communication. American Journal of Speech-Language Pathology, 31(1):515–526.
- Bpe-dropout: Simple and effective subword regularization. arXiv preprint arXiv:1910.13267.
- Adrien B Rapeaux and Timothy G Constandinou. 2021. Implantable brain machine interfaces: first-in-human studies, technology challenges and trends. Current opinion in biotechnology, 72:102–111.
- Real-time fmri brain computer interfaces: self-regulation of single brain regions to networks. Biological psychology, 95:4–20.
- Progress in brain computer interface: Challenges and opportunities. Frontiers in Systems Neuroscience, 15:578875.
- Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of the 2013 conference on empirical methods in natural language processing, pages 1631–1642.
- Towards sentence-level brain decoding with distributed representations. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 7047–7054.
- Attention is all you need. Advances in neural information processing systems, 30.
- Detection of sounds in the auditory stream: event-related fmri evidence for differential activation to speech and nonspeech. Journal of Cognitive Neuroscience, 13(7):994–1005.
- Fine-grained neural decoding with distributed word representations. Information Sciences, 507:256–272.
- Zhenhailong Wang and Heng Ji. 2022. Open vocabulary electroencephalography-to-text decoding and zero-shot sentiment classification. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 5350–5358.
- Transformers: State-of-the-art natural language processing. In Proceedings of the 2020 conference on empirical methods in natural language processing: system demonstrations, pages 38–45.
- Jonathan R Wolpaw. 2007. Brain-computer interfaces (bcis) for communication and control. In Proceedings of the 9th international ACM SIGACCESS conference on Computers and accessibility, pages 1–2.
- Towards brain-to-text generation: Neural decoding with pre-trained encoder-decoder models. In NeurIPS 2021 AI for Science Workshop.