Automatic Speech Recognition on Vibrocervigraphic and Electromyographic Signals

Automatic Speech Recognition on Vibrocervigraphic and Electromyographic Signals
Author :
Publisher :
Total Pages : 166
Release :
ISBN-10 : OCLC:653197538
ISBN-13 :
Rating : 4/5 (38 Downloads)

Synopsis Automatic Speech Recognition on Vibrocervigraphic and Electromyographic Signals by : Szu-Chen Stan Jou

Abstract: "Automatic speech recognition (ASR) is a computerized speech-to-text process, in which speech is usually recorded with acoustical microphones by capturing air pressure changes. This kind of air-transmitted speech signal is prone to two kinds of problems related to noise robustness and applicability. The former means the mixing of speech signal and ambient noise usually deteriorates ASR performance. The latter means speech could be overheard easily on the air-transmission channel, and this often results in privacy loss or annoyance to other people. This thesis research solves these two problems by using channels that contact the human body without air transmission, i.e., by vibrocervigraphic and electromyographic methods. The vibrocervigraphic (VCG) method measures the throat vibration with a ceramic piezoelectric transducer contact to the skin on the neck, and the electromyographic (EMG) method measures the muscular electric potential with a set of electrodes attached to the skin where the articulatory muscles underlie. The VCG and EMG methods are inherently more robust to ambient noise, and they make it possible to recognize whispered and silent speech to improve applicability. The major contribution of this dissertation includes feature design and adaptation for optimizing features, acoustic model adaptation for adapting traditional acoustic models onto different feature spaces, and articulatory feature classification for incorporating articulatory information to improve recognition. For VCG ASR, the combination of feature transformation methods and maximum a posteriori adaptation improves the recognition accuracy even with a very small data set. On top of that, additive performance gain is achieved by applying maximum likelihood linear regression and feature space adaptation with different data granularities in order to adapt to channel variations as well as to speaker variations. For EMG ASR, we propose the Concise EMG feature that extracts representative EMG characteristics. It improves the recognition accuracy and advances the EMG ASR research from isolated word recognition to phone-based continuous speech recognition. Articulatory features are studied in both VCG and EMG ASR to analyze the systems and improve recognition accuracy. These techniques are demonstrated to be effective on both experimental evaluations and prototype applications."

Advancing Electromyographic Continuous Speech Recognition: Signal Preprocessing and Modeling

Advancing Electromyographic Continuous Speech Recognition: Signal Preprocessing and Modeling
Author :
Publisher : KIT Scientific Publishing
Total Pages : 256
Release :
ISBN-10 : 9783731502111
ISBN-13 : 3731502119
Rating : 4/5 (11 Downloads)

Synopsis Advancing Electromyographic Continuous Speech Recognition: Signal Preprocessing and Modeling by : Wand, Michael

Speech is the natural medium of human communication, but audible speech can be overheard by bystanders and excludes speech-disabled people. This work presents a speech recognizer based on surface electromyography, where electric potentials of the facial muscles are captured by surface electrodes, allowing speech to be processed nonacoustically. A system which was state-of-the-art at the beginning of this book is substantially improved in terms of accuracy, flexibility, and robustness.

Automatic Speech and Speaker Recognition

Automatic Speech and Speaker Recognition
Author :
Publisher : Springer Science & Business Media
Total Pages : 524
Release :
ISBN-10 : 9781461313670
ISBN-13 : 1461313678
Rating : 4/5 (70 Downloads)

Synopsis Automatic Speech and Speaker Recognition by : Chin-Hui Lee

Research in the field of automatic speech and speaker recognition has made a number of significant advances in the last two decades, influenced by advances in signal processing, algorithms, architectures, and hardware. These advances include: the adoption of a statistical pattern recognition paradigm; the use of the hidden Markov modeling framework to characterize both the spectral and the temporal variations in the speech signal; the use of a large set of speech utterance examples from a large population of speakers to train the hidden Markov models of some fundamental speech units; the organization of speech and language knowledge sources into a structural finite state network; and the use of dynamic, programming based heuristic search methods to find the best word sequence in the lexical network corresponding to the spoken utterance. Automatic Speech and Speaker Recognition: Advanced Topics groups together in a single volume a number of important topics on speech and speaker recognition, topics which are of fundamental importance, but not yet covered in detail in existing textbooks. Although no explicit partition is given, the book is divided into five parts: Chapters 1-2 are devoted to technology overviews; Chapters 3-12 discuss acoustic modeling of fundamental speech units and lexical modeling of words and pronunciations; Chapters 13-15 address the issues related to flexibility and robustness; Chapter 16-18 concern the theoretical and practical issues of search; Chapters 19-20 give two examples of algorithm and implementational aspects for recognition system realization. Audience: A reference book for speech researchers and graduate students interested in pursuing potential research on the topic. May also be used as a text for advanced courses on the subject.

Surface Electromyography Based Speech Recognition System and Development Toolkit

Surface Electromyography Based Speech Recognition System and Development Toolkit
Author :
Publisher :
Total Pages :
Release :
ISBN-10 : OCLC:774919537
ISBN-13 :
Rating : 4/5 (37 Downloads)

Synopsis Surface Electromyography Based Speech Recognition System and Development Toolkit by : Daniel Chang

This thesis describes the implementation of an automatic speech recognition system based on surface electromyography signals. Data collection was done using a bipolar electrode configuration with a sampling rate of 5.77 kHz. Four feature sets, the short-time Fourier transform (STFT), the dual-tree complex wavelet transform (DTCWT), a non-causal time-domain based (E4-NC), and a causal version of E4-NC (E4-C) were implemented. Classification was performed using a hidden Markov model (HMM). The system implemented was able to achieve an accuracy rate of 74.24% with E4-NC and 61.25% with E4-C. These results are comparable to previously reported results for offline, single session, isolated word recognition. Additional testing was performed on five subjects using E4-C and yielded accuracy rates ranging from 51.8% to 81.88% with an average accuracy rate of 64.9% during offline, single session, isolated word recognition. The E4-C was chosen since it offered the best performance among the causal feature sets and non-causal feature sets cannot be used with real-time online classification. Online classification capabilities were implemented and simulations using the confidence interval (CI) and minimum noise likelihood (MNL) decision rubrics yielded accuracy rates of 77.5% and 72.5%, respectively, during online, single session, isolated word recognition.

Robustness in Automatic Speech Recognition

Robustness in Automatic Speech Recognition
Author :
Publisher : Springer Science & Business Media
Total Pages : 457
Release :
ISBN-10 : 9781461312970
ISBN-13 : 1461312973
Rating : 4/5 (70 Downloads)

Synopsis Robustness in Automatic Speech Recognition by : Jean-Claude Junqua

Foreword Looking back the past 30 years. we have seen steady progress made in the area of speech science and technology. I still remember the excitement in the late seventies when Texas Instruments came up with a toy named "Speak-and-Spell" which was based on a VLSI chip containing the state-of-the-art linear prediction synthesizer. This caused a speech technology fever among the electronics industry. Particularly. applications of automatic speech recognition were rigorously attempt ed by many companies. some of which were start-ups founded just for this purpose. Unfortunately. it did not take long before they realized that automatic speech rec ognition technology was not mature enough to satisfy the need of customers. The fever gradually faded away. In the meantime. constant efforts have been made by many researchers and engi neers to improve the automatic speech recognition technology. Hardware capabilities have advanced impressively since that time. In the past few years. we have been witnessing and experiencing the advent of the "Information Revolution." What might be called the second surge of interest to com mercialize speech technology as a natural interface for man-machine communication began in much better shape than the first one. With computers much more powerful and faster. many applications look realistic this time. However. there are still tremendous practical issues to be overcome in order for speech to be truly the most natural interface between humans and machines.

Acoustical and Environmental Robustness in Automatic Speech Recognition

Acoustical and Environmental Robustness in Automatic Speech Recognition
Author :
Publisher : Springer Science & Business Media
Total Pages : 197
Release :
ISBN-10 : 9781461531227
ISBN-13 : 1461531225
Rating : 4/5 (27 Downloads)

Synopsis Acoustical and Environmental Robustness in Automatic Speech Recognition by : A. Acero

The need for automatic speech recognition systems to be robust with respect to changes in their acoustical environment has become more widely appreciated in recent years, as more systems are finding their way into practical applications. Although the issue of environmental robustness has received only a small fraction of the attention devoted to speaker independence, even speech recognition systems that are designed to be speaker independent frequently perform very poorly when they are tested using a different type of microphone or acoustical environment from the one with which they were trained. The use of microphones other than a "close talking" headset also tends to severely degrade speech recognition -performance. Even in relatively quiet office environments, speech is degraded by additive noise from fans, slamming doors, and other conversations, as well as by the effects of unknown linear filtering arising reverberation from surface reflections in a room, or spectral shaping by microphones or the vocal tracts of individual speakers. Speech-recognition systems designed for long-distance telephone lines, or applications deployed in more adverse acoustical environments such as motor vehicles, factory floors, oroutdoors demand far greaterdegrees ofenvironmental robustness. There are several different ways of building acoustical robustness into speech recognition systems. Arrays of microphones can be used to develop a directionally-sensitive system that resists intelference from competing talkers and other noise sources that are spatially separated from the source of the desired speech signal.

Speech Recognition

Speech Recognition
Author :
Publisher : BoD – Books on Demand
Total Pages : 580
Release :
ISBN-10 : 9789537619299
ISBN-13 : 953761929X
Rating : 4/5 (99 Downloads)

Synopsis Speech Recognition by : France Mihelič

Chapters in the first part of the book cover all the essential speech processing techniques for building robust, automatic speech recognition systems: the representation for speech signals and the methods for speech-features extraction, acoustic and language modeling, efficient algorithms for searching the hypothesis space, and multimodal approaches to speech recognition. The last part of the book is devoted to other speech processing applications that can use the information from automatic speech recognition for speaker identification and tracking, for prosody modeling in emotion-detection systems and in other speech processing applications that are able to operate in real-world environments, like mobile communication services and smart homes.

Automatic Speech & Speaker Recognition

Automatic Speech & Speaker Recognition
Author :
Publisher : Institute of Electrical & Electronics Engineers(IEEE)
Total Pages : 448
Release :
ISBN-10 : UOM:39015002081563
ISBN-13 :
Rating : 4/5 (63 Downloads)

Synopsis Automatic Speech & Speaker Recognition by : N. Rex Dixon

Advancing Electromyographic Continuous Speech Recognition

Advancing Electromyographic Continuous Speech Recognition
Author :
Publisher :
Total Pages : 252
Release :
ISBN-10 : 1013282574
ISBN-13 : 9781013282577
Rating : 4/5 (74 Downloads)

Synopsis Advancing Electromyographic Continuous Speech Recognition by : Michael Wand

Speech is the natural medium of human communication, but audible speech can be overheard by bystanders and excludes speech-disabled people. This work presents a speech recognizer based on surface electromyography, where electric potentials of the facial muscles are captured by surface electrodes, allowing speech to be processed nonacoustically. A system which was state-of-the-art at the beginning of this book is substantially improved in terms of accuracy, flexibility, and robustness. This work was published by Saint Philip Street Press pursuant to a Creative Commons license permitting commercial use. All rights not granted by the work's license are retained by the author or authors.