Cantitate/Preț
Produs

Deep Learning Based Speech Quality Prediction: T-Labs Series in Telecommunication Services

Autor Gabriel Mittag
en Limba Engleză Paperback – 26 feb 2023
This book presents how to apply recent machine learning (deep learning) methods for the task of speech quality prediction. The author shows how recent advancements in machine learning can be leveraged for the task of speech quality prediction and provides an in-depth analysis of the suitability of different deep learning architectures for this task. The author then shows how the resulting model outperforms traditional speech quality models and provides additional information about the cause of a quality impairment through the prediction of the speech quality dimensions of noisiness, coloration, discontinuity, and loudness.
Citește tot Restrânge

Toate formatele și edițiile

Toate formatele și edițiile Preț Express
Paperback (1) 62184 lei  6-8 săpt.
  Springer International Publishing – 26 feb 2023 62184 lei  6-8 săpt.
Hardback (1) 62793 lei  6-8 săpt.
  Springer International Publishing – 25 feb 2022 62793 lei  6-8 săpt.

Din seria T-Labs Series in Telecommunication Services

Preț: 62184 lei

Preț vechi: 73158 lei
-15% Nou

Puncte Express: 933

Preț estimativ în valută:
11899 12566$ 9902£

Carte tipărită la comandă

Livrare economică 13-27 ianuarie 25

Preluare comenzi: 021 569.72.76

Specificații

ISBN-13: 9783030914813
ISBN-10: 303091481X
Pagini: 165
Ilustrații: XIV, 165 p. 58 illus., 54 illus. in color.
Dimensiuni: 155 x 235 mm
Greutate: 0.26 kg
Ediția:1st ed. 2022
Editura: Springer International Publishing
Colecția Springer
Seria T-Labs Series in Telecommunication Services

Locul publicării:Cham, Switzerland

Cuprins

1. Introduction.- 2. Quality Assessment of Transmitted Speech.- 3. Neural Network Architectures for Speech Quality Prediction.- 4. Double-Ended Speech Quality Prediction Using Siamese Networks.- 5. Prediction of Speech Quality Dimensions With Multi-Task Learning.- 6. Bias-Aware Loss for Training From Multiple Datasets.- 7. NISQA – A Single-Ended Speech Quality Model.- 8. Conclusions.- A. Dataset Condition Tables.- B. Train and Validation Dataset Dimension Histograms.- References.

Notă biografică

Gabriel Mittag received his B.Sc. and M.Sc. degree in electrical and electronic engineering at the Technische Universität Berlin. During his master's degree he spent two semesters at the RMIT University in Melbourne, Australia and focused primarily on biomedical and speech signal processing. From 2016 he was employed as research assistant at the Quality and Usability Lab at the TU Berlin, where he finished his PhD on the machine learning based prediction of speech quality. In May 2021, Gabriel Mittag started as Machine Learning Scientist at Microsoft in Redmond, WA, USA.

Textul de pe ultima copertă

This book presents how to apply recent machine learning (deep learning) methods for the task of speech quality prediction. The author shows how recent advancements in machine learning can be leveraged for the task of speech quality prediction and provides an in-depth analysis of the suitability of different deep learning architectures for this task. The author then shows how the resulting model outperforms traditional speech quality models and provides additional information about the cause of a quality impairment through the prediction of the speech quality dimensions of noisiness, coloration, discontinuity, and loudness.

Caracteristici

Presents how to apply deep learning methods for the task of speech quality prediction Includes a model that outperforms traditional speech quality models Presents an in-depth analysis and comparison of different deep learning