Man-Machine Speech Communication: 17th National Conference, NCMMSC 2022, Hefei, China, December 15–18, 2022, Proceedings: Communications in Computer and Information Science, cartea 1765
Editat de Ling Zhenhua, Gao Jianqing, Yu Kai, Jia Jiaen Limba Engleză Paperback – 11 mai 2023
The 21 full papers and 7 short papers included in this book were carefully reviewed and selected from 108 submissions. They were organized in topical sections as follows: MCPN: A Multiple Cross-Perception Network for Real-Time Emotion Recognition in Conversation.- Baby Cry Recognition Based on Acoustic Segment Model, MnTTS2 An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis Dataset.
Din seria Communications in Computer and Information Science
- 20% Preț: 325.48 lei
- 20% Preț: 669.06 lei
- 20% Preț: 324.64 lei
- 20% Preț: 337.85 lei
- 20% Preț: 656.19 lei
- 20% Preț: 659.97 lei
- 20% Preț: 333.88 lei
- 20% Preț: 337.52 lei
- 20% Preț: 656.36 lei
- 20% Preț: 656.69 lei
- 20% Preț: 659.31 lei
- Preț: 392.60 lei
- 20% Preț: 329.58 lei
- Preț: 386.00 lei
- 20% Preț: 336.02 lei
- 20% Preț: 334.53 lei
- 20% Preț: 331.74 lei
- 20% Preț: 655.85 lei
- 20% Preț: 338.49 lei
- 20% Preț: 333.88 lei
- 20% Preț: 334.53 lei
- 20% Preț: 338.68 lei
- 20% Preț: 666.58 lei
- 20% Preț: 307.20 lei
- 20% Preț: 336.02 lei
- 20% Preț: 338.68 lei
- 20% Preț: 330.24 lei
- 20% Preț: 660.81 lei
- 15% Preț: 648.42 lei
- 20% Preț: 653.56 lei
- 20% Preț: 332.06 lei
- 20% Preț: 337.85 lei
- 20% Preț: 1055.29 lei
- 20% Preț: 836.50 lei
- 20% Preț: 656.69 lei
- 20% Preț: 1231.01 lei
- 20% Preț: 331.25 lei
- 20% Preț: 336.02 lei
- 20% Preț: 323.00 lei
- 20% Preț: 113.94 lei
- 20% Preț: 336.21 lei
- Preț: 387.75 lei
- 20% Preț: 470.58 lei
- 20% Preț: 750.86 lei
- 20% Preț: 337.00 lei
- 20% Preț: 327.95 lei
- 20% Preț: 410.03 lei
- 20% Preț: 335.36 lei
- 20% Preț: 534.59 lei
- 20% Preț: 673.02 lei
Preț: 534.08 lei
Preț vechi: 667.60 lei
-20% Nou
Puncte Express: 801
Preț estimativ în valută:
102.21€ • 106.10$ • 85.22£
102.21€ • 106.10$ • 85.22£
Carte tipărită la comandă
Livrare economică 22 martie-05 aprilie
Preluare comenzi: 021 569.72.76
Specificații
ISBN-13: 9789819924004
ISBN-10: 9819924006
Ilustrații: XI, 332 p. 91 illus., 86 illus. in color.
Dimensiuni: 155 x 235 mm
Greutate: 0.49 kg
Ediția:1st ed. 2023
Editura: Springer Nature Singapore
Colecția Springer
Seria Communications in Computer and Information Science
Locul publicării:Singapore, Singapore
ISBN-10: 9819924006
Ilustrații: XI, 332 p. 91 illus., 86 illus. in color.
Dimensiuni: 155 x 235 mm
Greutate: 0.49 kg
Ediția:1st ed. 2023
Editura: Springer Nature Singapore
Colecția Springer
Seria Communications in Computer and Information Science
Locul publicării:Singapore, Singapore
Cuprins
MCPN: A Multiple Cross-Perception Network for Real-Time Emotion Recognition in Conversation.- Baby Cry Recognition Based on Acoustic Segment Model.- A Multi-feature Sets Fusion Strategy with Similar Samples Removal for Snore Sound Classification.- Multi-Hypergraph Neural Networks for Emotion Recognition in Multi-Party Conversations.- Using Emoji as an Emotion Modality in Text-Based Depression Detection.- Source-Filter-Based Generative Adversarial Neural Vocoder for High Fidelity Speech Synthesis.- Semantic enhancement framework for robust speech recognition.- Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model.- Predictive AutoEncoders are Context-Aware Unsupervised Anomalous Sound Detectors.- A pipelined framework with serialized output training for overlapping speech recognition.- Adversarial Training Based on Meta-Learning in Unseen Domains for Speaker Verification.- Multi-Speaker Multi-Style Speech Synthesis with Timbre and Style Disentanglement.- Multiple Confidence Gates for Joint Training of SE and ASR.- Detecting Escalation Level from Speech with Transfer Learning and Acoustic-Linguistic Information Fusion.- Pre-training Techniques For Improving Text-to-Speech Synthesis By Automatic Speech Recognition Based Data Enhancement.- A Time-Frequency Attention Mechanism with Subsidiary Information for Effective Speech Emotion Recognition.- Interplay between prosody and syntax-semantics: Evidence from the prosodic features of Mandarin tag questions.- Improving Fine-grained Emotion Control and Transfer with Gated Emotion Representations in Speech Synthesis.- Violence Detection through Fusing Visual Information to Auditory Scene.- Mongolian Text-to-Speech Challenge under Low-Resource Scenario for NCMMSC2022.- VC-AUG Voice Conversion based Data Augmentation for Text-Dependent Speaker Verification.- Transformer-based potential emotional relation mining network for emotion recognition in conversation.- FastFoley Non-Autoregressive Foley Sound Generation Based On Visual Semantics.- Structured Hierarchical Dialogue Policy with Graph Neural Networks.- Deep Reinforcement Learning for On-line Dialogue State Tracking.- Dual Learning for Dialogue State Tracking.- Automatic Stress Annotation and Prediction For Expressive Mandarin TTS.- MnTTS2 An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis Dataset.