Theory and Applications of Spherical Microphone Array Processing

Theory and Applications of Spherical Microphone Array Processing
Author :
Publisher : Springer
Total Pages : 201
Release :
ISBN-10 : 9783319422114
ISBN-13 : 3319422111
Rating : 4/5 (14 Downloads)

Synopsis Theory and Applications of Spherical Microphone Array Processing by : Daniel P. Jarrett

This book presents the signal processing algorithms that have been developed to process the signals acquired by a spherical microphone array. Spherical microphone arrays can be used to capture the sound field in three dimensions and have received significant interest from researchers and audio engineers. Algorithms for spherical array processing are different to corresponding algorithms already known in the literature of linear and planar arrays because the spherical geometry can be exploited to great beneficial effect. The authors aim to advance the field of spherical array processing by helping those new to the field to study it efficiently and from a single source, as well as by offering a way for more experienced researchers and engineers to consolidate their understanding, adding either or both of breadth and depth. The level of the presentation corresponds to graduate studies at MSc and PhD level. This book begins with a presentation of some of the essential mathematical and physical theory relevant to spherical microphone arrays, and of an acoustic impulse response simulation method, which can be used to comprehensively evaluate spherical array processing algorithms in reverberant environments. The chapter on acoustic parameter estimation describes the way in which useful descriptions of acoustic scenes can be parameterized, and the signal processing algorithms that can be used to estimate the parameter values using spherical microphone arrays. Subsequent chapters exploit these parameters including in particular measures of direction-of-arrival and of diffuseness of a sound field. The array processing algorithms are then classified into two main classes, each described in a separate chapter. These are signal-dependent and signal-independent beamforming algorithms. Although signal-dependent beamforming algorithms are in theory able to provide better performance compared to the signal-independent algorithms, they are currently rarely used in practice. The main reason for this is that the statistical information required by these algorithms is difficult to estimate. In a subsequent chapter it is shown how the estimated acoustic parameters can be used in the design of signal-dependent beamforming algorithms. This final step closes, at least in part, the gap between theory and practice.

Fundamentals of Spherical Array Processing

Fundamentals of Spherical Array Processing
Author :
Publisher : Springer
Total Pages : 201
Release :
ISBN-10 : 9783319995618
ISBN-13 : 3319995618
Rating : 4/5 (18 Downloads)

Synopsis Fundamentals of Spherical Array Processing by : Boaz Rafaely

This book provides a comprehensive introduction to the theory and practice of spherical microphone arrays, and was written for graduate students, researchers and engineers who work with spherical microphone arrays in a wide range of applications. The new edition includes additions and modifications, and references supplementary Matlab code to provide the reader with a straightforward start for own implementations. The book is also accompanied by a Matlab manual, which explains how to implement the examples and simulations presented in the book. The first two chapters provide the reader with the necessary mathematical and physical background, including an introduction to the spherical Fourier transform and the formulation of plane-wave sound fields in the spherical harmonic domain. In turn, the third chapter covers the theory of spatial sampling, employed when selecting the positions of microphones to sample sound pressure functions in space. Subsequent chapters highlight various spherical array configurations, including the popular rigid-sphere-based configuration. Beamforming (spatial filtering) in the spherical harmonics domain, including axis-symmetric beamforming, and the performance measures of directivity index and white noise gain are introduced, and a range of optimal beamformers for spherical arrays, including those that achieve maximum directivity and maximum robustness are developed, along with the Dolph–Chebyshev beamformer. The final chapter discusses more advanced beamformers, such as MVDR (minimum variance distortionless response) and LCMV (linearly constrained minimum variance) types, which are tailored to the measured sound field. Mathworks kindly distributes the Matlab sources for this book on https://www.mathworks.com/matlabcentral/fileexchange/68655-fundamentals-of-spherical-array-processing.

Audio Signal Processing for Next-Generation Multimedia Communication Systems

Audio Signal Processing for Next-Generation Multimedia Communication Systems
Author :
Publisher : Springer Science & Business Media
Total Pages : 375
Release :
ISBN-10 : 9781402077685
ISBN-13 : 1402077688
Rating : 4/5 (85 Downloads)

Synopsis Audio Signal Processing for Next-Generation Multimedia Communication Systems by : Yiteng (Arden) Huang

Audio Signal Processing for Next-Generation Multimedia Communication Systems presents cutting-edge digital signal processing theory and implementation techniques for problems including speech acquisition and enhancement using microphone arrays, new adaptive filtering algorithms, multichannel acoustic echo cancellation, sound source tracking and separation, audio coding, and realistic sound stage reproduction. This book's focus is almost exclusively on the processing, transmission, and presentation of audio and acoustic signals in multimedia communications for telecollaboration where immersive acoustics will play a great role in the near future.

Speech Processing in Modern Communication

Speech Processing in Modern Communication
Author :
Publisher : Springer Science & Business Media
Total Pages : 342
Release :
ISBN-10 : 9783642111303
ISBN-13 : 3642111300
Rating : 4/5 (03 Downloads)

Synopsis Speech Processing in Modern Communication by : Israel Cohen

Modern communication devices, such as mobile phones, teleconferencing systems, VoIP, etc., are often used in noisy and reverberant environments. Therefore, signals picked up by the microphones from telecommunication devices contain not only the desired near-end speech signal, but also interferences such as the background noise, far-end echoes produced by the loudspeaker, and reverberations of the desired source. These interferences degrade the fidelity and intelligibility of the near-end speech in human-to-human telecommunications and decrease the performance of human-to-machine interfaces (i.e., automatic speech recognition systems). The proposed book deals with the fundamental challenges of speech processing in modern communication, including speech enhancement, interference suppression, acoustic echo cancellation, relative transfer function identification, source localization, dereverberation, and beamforming in reverberant environments. Enhancement of speech signals is necessary whenever the source signal is corrupted by noise. In highly non-stationary noise environments, noise transients, and interferences may be extremely annoying. Acoustic echo cancellation is used to eliminate the acoustic coupling between the loudspeaker and the microphone of a communication device. Identification of the relative transfer function between sensors in response to a desired speech signal enables to derive a reference noise signal for suppressing directional or coherent noise sources. Source localization, dereverberation, and beamforming in reverberant environments further enable to increase the intelligibility of the near-end speech signal.

Parametric Time-Frequency Domain Spatial Audio

Parametric Time-Frequency Domain Spatial Audio
Author :
Publisher : John Wiley & Sons
Total Pages : 412
Release :
ISBN-10 : 9781119252580
ISBN-13 : 111925258X
Rating : 4/5 (80 Downloads)

Synopsis Parametric Time-Frequency Domain Spatial Audio by : Ville Pulkki

A comprehensive guide that addresses the theory and practice of spatial audio This book provides readers with the principles and best practices in spatial audio signal processing. It describes how sound fields and their perceptual attributes are captured and analyzed within the time-frequency domain, how essential representation parameters are coded, and how such signals are efficiently reproduced for practical applications. The book is split into four parts starting with an overview of the fundamentals. It then goes on to explain the reproduction of spatial sound before offering an examination of signal-dependent spatial filtering. The book finishes with coverage of both current and future applications and the direction that spatial audio research is heading in. Parametric Time-frequency Domain Spatial Audio focuses on applications in entertainment audio, including music, home cinema, and gaming—covering the capturing and reproduction of spatial sound as well as its generation, transduction, representation, transmission, and perception. This book will teach readers the tools needed for such processing, and provides an overview to existing research. It also shows recent up-to-date projects and commercial applications built on top of the systems. Provides an in-depth presentation of the principles, past developments, state-of-the-art methods, and future research directions of spatial audio technologies Includes contributions from leading researchers in the field Offers MATLAB codes with selected chapters An advanced book aimed at readers who are capable of digesting mathematical expressions about digital signal processing and sound field analysis, Parametric Time-frequency Domain Spatial Audio is best suited for researchers in academia and in the audio industry.

Microphone Array Signal Processing

Microphone Array Signal Processing
Author :
Publisher : Springer Science & Business Media
Total Pages : 245
Release :
ISBN-10 : 9783540786122
ISBN-13 : 3540786120
Rating : 4/5 (22 Downloads)

Synopsis Microphone Array Signal Processing by : Jacob Benesty

In the past few years we have written and edited several books in the area of acousticandspeechsignalprocessing. Thereasonbehindthisendeavoristhat there were almost no books available in the literature when we ?rst started while there was (and still is) a real need to publish manuscripts summarizing the most useful ideas, concepts, results, and state-of-the-art algorithms in this important area of research. According to all the feedback we have received so far, we can say that we were right in doing this. Recently, several other researchers have followed us in this journey and have published interesting books with their own visions and perspectives. The idea of writing a book on Microphone Array Signal Processing comes from discussions we have had with many colleagues and friends. As a c- sequence of these discussions, we came up with the conclusion that, again, there is an urgent need for a monograph that carefully explains the theory and implementation of microphone arrays. While there are many manuscripts on antenna arrays from a narrowband perspective (narrowband signals and narrowband processing), the literature is quite scarce when it comes to s- sor arrays explained from a truly broadband perspective. Many algorithms for speech applications were simply borrowed from narrowband antenna - rays. However, a direct application of narrowband ideas to broadband speech processing may not be necessarily appropriate and can lead to many m- understandings.

Ambisonics

Ambisonics
Author :
Publisher : Springer
Total Pages : 223
Release :
ISBN-10 : 9783030172077
ISBN-13 : 3030172074
Rating : 4/5 (77 Downloads)

Synopsis Ambisonics by : Franz Zotter

This open access book provides a concise explanation of the fundamentals and background of the surround sound recording and playback technology Ambisonics. It equips readers with the psychoacoustical, signal processing, acoustical, and mathematical knowledge needed to understand the inner workings of modern processing utilities, special equipment for recording, manipulation, and reproduction in the higher-order Ambisonic format. The book comes with various practical examples based on free software tools and open scientific data for reproducible research. The book’s introductory section offers a perspective on Ambisonics spanning from the origins of coincident recordings in the 1930s to the Ambisonic concepts of the 1970s, as well as classical ways of applying Ambisonics in first-order coincident sound scene recording and reproduction that have been practiced since the 1980s. As, from time to time, the underlying mathematics become quite involved, but should be comprehensive without sacrificing readability, the book includes an extensive mathematical appendix. The book offers readers a deeper understanding of Ambisonic technologies, and will especially benefit scientists, audio-system and audio-recording engineers. In the advanced sections of the book, fundamentals and modern techniques as higher-order Ambisonic decoding, 3D audio effects, and higher-order recording are explained. Those techniques are shown to be suitable to supply audience areas ranging from studio-sized to hundreds of listeners, or headphone-based playback, regardless whether it is live, interactive, or studio-produced 3D audio material.

Study and Design of Differential Microphone Arrays

Study and Design of Differential Microphone Arrays
Author :
Publisher : Springer Science & Business Media
Total Pages : 184
Release :
ISBN-10 : 9783642337536
ISBN-13 : 3642337538
Rating : 4/5 (36 Downloads)

Synopsis Study and Design of Differential Microphone Arrays by : Jacob Benesty

Microphone arrays have attracted a lot of interest over the last few decades since they have the potential to solve many important problems such as noise reduction/speech enhancement, source separation, dereverberation, spatial sound recording, and source localization/tracking, to name a few. However, the design and implementation of microphone arrays with beamforming algorithms is not a trivial task when it comes to processing broadband signals such as speech. Indeed, in most sensor arrangements, the beamformer output tends to have a frequency-dependent response. One exception, perhaps, is the family of differential microphone arrays (DMAs) who have the promise to form frequency-independent responses. Moreover, they have the potential to attain high directional gains with small and compact apertures. As a result, this type of microphone arrays has drawn much research and development attention recently. This book is intended to provide a systematic study of DMAs from a signal processing perspective. The primary objective is to develop a rigorous but yet simple theory for the design, implementation, and performance analysis of DMAs. The theory includes some signal processing techniques for the design of commonly used first-order, second-order, third-order, and also the general Nth-order DMAs. For each order, particular examples are given on how to form standard directional patterns such as the dipole, cardioid, supercardioid, hypercardioid, subcardioid, and quadrupole. The study demonstrates the performance of the different order DMAs in terms of beampattern, directivity factor, white noise gain, and gain for point sources. The inherent relationship between differential processing and adaptive beamforming is discussed, which provides a better understanding of DMAs and why they can achieve high directional gain. Finally, we show how to design DMAs that can be robust against white noise amplification.

Audio Source Separation and Speech Enhancement

Audio Source Separation and Speech Enhancement
Author :
Publisher : John Wiley & Sons
Total Pages : 517
Release :
ISBN-10 : 9781119279891
ISBN-13 : 1119279895
Rating : 4/5 (91 Downloads)

Synopsis Audio Source Separation and Speech Enhancement by : Emmanuel Vincent

Learn the technology behind hearing aids, Siri, and Echo Audio source separation and speech enhancement aim to extract one or more source signals of interest from an audio recording involving several sound sources. These technologies are among the most studied in audio signal processing today and bear a critical role in the success of hearing aids, hands-free phones, voice command and other noise-robust audio analysis systems, and music post-production software. Research on this topic has followed three convergent paths, starting with sensor array processing, computational auditory scene analysis, and machine learning based approaches such as independent component analysis, respectively. This book is the first one to provide a comprehensive overview by presenting the common foundations and the differences between these techniques in a unified setting. Key features: Consolidated perspective on audio source separation and speech enhancement. Both historical perspective and latest advances in the field, e.g. deep neural networks. Diverse disciplines: array processing, machine learning, and statistical signal processing. Covers the most important techniques for both single-channel and multichannel processing. This book provides both introductory and advanced material suitable for people with basic knowledge of signal processing and machine learning. Thanks to its comprehensiveness, it will help students select a promising research track, researchers leverage the acquired cross-domain knowledge to design improved techniques, and engineers and developers choose the right technology for their target application scenario. It will also be useful for practitioners from other fields (e.g., acoustics, multimedia, phonetics, and musicology) willing to exploit audio source separation or speech enhancement as pre-processing tools for their own needs.