Speech synthesis from invasive brain signals: From offline analysis to closed-loop speech decoding
File | Description | Size | Format | |
---|---|---|---|---|
dissertation_miguel_angrick.pdf | Dissertation von Miguel Angrick | 54.83 MB | Adobe PDF | View/Open |
Authors: | Angrick, Miguel | Supervisor: | Schultz, Tanja | 1. Expert: | Schultz, Tanja | Experts: | Krusienski, Dean | Abstract: | Several neurological diseases and disorders can impact speech communication functions and lead to the complete loss of the ability to speak. Brain-Computer Interfaces (BCIs) – systems that directly receive neural signals as input to control a computing device – raise hope for speech neuroprosthetics, that provide an alternative communication channel and thus restore speech communication functions for speech impaired people. This cumulative dissertation examines methods for enabling synthesis of acoustic speech from brain activity data. Invasive neuroimaging techniques for measuring electrophysiological brain activity have shown suitable characteristics for capturing the complex dynamics of spoken language in both spatial and temporal resolution. Speech processes can be decoded from neural data using appropriate decoding approaches, and strategies from speech synthesis contribute to generating waveforms to be subsequently played back through a loudspeaker. For this purpose, the dissertation first investigates methods to reconstruct spoken speech from experimental recordings in offline analysis, to develop suitable algorithms that can transform brain activity data into high-quality acoustic speech. From here, the focus shifts towards closed-loop speech decoding and synthesis and presents techniques to convert neural speech processes into audible speech in real-time, that can be output as continuous feedback over a loudspeaker. The cumulative dissertation concludes with a discussion of the presented approaches and their limitations, and presents a related modality in which natural speech could be assisted and possibly restored by electrical stimulation of orofacial muscles. |
Keywords: | speech synthesis; brain-computer interfaces; speech neuroprosthetics | Issue Date: | 29-Oct-2021 | Type: | Dissertation | Secondary publication: | no | DOI: | 10.26092/elib/1179 | URN: | urn:nbn:de:gbv:46-elib54431 | Institution: | Universität Bremen | Faculty: | Fachbereich 03: Mathematik/Informatik (FB 03) |
Appears in Collections: | Dissertationen |
Page view(s)
299
checked on Nov 26, 2024
Download(s)
116
checked on Nov 26, 2024
Google ScholarTM
Check
Items in Media are protected by copyright, with all rights reserved, unless otherwise indicated.