Cantitate/Preț
Produs

Algorithms and Software for Predictive and Perceptual Modeling of Speech: Synthesis Lectures on Algorithms and Software in Engineering

Autor Venkatraman Atti
en Limba Engleză Paperback – 28 mar 2011
From the early pulse code modulation-based coders to some of the recent multi-rate wideband speech coding standards, the area of speech coding made several significant strides with an objective to attain high quality of speech at the lowest possible bit rate. This book presents some of the recent advances in linear prediction (LP)-based speech analysis that employ perceptual models for narrow- and wide-band speech coding. The LP analysis-synthesis framework has been successful for speech coding because it fits well the source-system paradigm for speech synthesis. Limitations associated with the conventional LP have been studied extensively, and several extensions to LP-based analysis-synthesis have been proposed, e.g., the discrete all-pole modeling, the perceptual LP, the warped LP, the LP with modified filter structures, the IIR-based pure LP, all-pole modeling using the weighted-sum of LSP polynomials, the LP for low frequency emphasis, and the cascade-form LP. These extensions canbe classified as algorithms that either attempt to improve the LP spectral envelope fitting performance or embed perceptual models in the LP. The first half of the book reviews some of the recent developments in predictive modeling of speech with the help of Matlab™ Simulation examples. Advantages of integrating perceptual models in low bit rate speech coding depend on the accuracy of these models to mimic the human performance and, more importantly, on the achievable "coding gains" and "computational overhead" associated with these physiological models. Methods that exploit the masking properties of the human ear in speech coding standards, even today, are largely based on concepts introduced by Schroeder and Atal in 1979. For example, a simple approach employed in speech coding standards is to use a perceptual weighting filter to shape the quantization noise according to the masking properties of the human ear. The second half of the book reviews some of the recent developments in perceptual modeling of speech (e.g., masking threshold, psychoacoustic models, auditory excitation pattern, and loudness) with the help of Matlab™ simulations. Supplementary material including Matlab™ programs and simulation examples presented in this book can also be accessed here. Table of Contents: Introduction / Predictive Modeling of Speech / Perceptual Modeling of Speech
Citește tot Restrânge

Din seria Synthesis Lectures on Algorithms and Software in Engineering

Preț: 25437 lei

Nou

Puncte Express: 382

Preț estimativ în valută:
4868 5121$ 4062£

Carte tipărită la comandă

Livrare economică 09-23 ianuarie 25

Preluare comenzi: 021 569.72.76

Specificații

ISBN-13: 9783031003882
ISBN-10: 3031003888
Ilustrații: IX, 113 p.
Dimensiuni: 191 x 235 mm
Greutate: 0.23 kg
Editura: Springer International Publishing
Colecția Springer
Seria Synthesis Lectures on Algorithms and Software in Engineering

Locul publicării:Cham, Switzerland

Cuprins

Introduction.- Predictive Modeling of Speech.- Perceptual Modeling of Speech.

Notă biografică

Atti Venkatraman, PhD, is a Staff Engineer at Verance Corporation. His work focuses on the development of robust and advanced watermarking algorithms for digital cinema. Prior to Verance Corporation, he was with Acoustic Technologies, where he focused on the research and development of perceptually-based algorithms for acoustic echo cancellation, noise reduction, and audio enhancement for both cellular handset and telematics solutions. He has worked extensively on integrating perceptual signal processing methods in linear prediction. He has also contributed heavily to advanced distance learning technologies involving Java.