Text, Speech, and Dialogue: 27th International Conference, TSD 2024, Brno, Czech Republic, September 9–13, 2024, Proceedings, Part II: Lecture Notes in Computer Science, cartea 15049
Editat de Elmar Nöth, Aleš Horák, Petr Sojkaen Limba Engleză Paperback – 26 sep 2024
The 50 revised full papers presented in these deadline proceedings were carefully reviewed and selected from 103 submissions.
The papers are organized in the following topical sections:
Part I: Text
Part II: Speech, Dialogue
Din seria Lecture Notes in Computer Science
- 20% Preț: 1040.03 lei
- 20% Preț: 333.46 lei
- 20% Preț: 335.08 lei
- 20% Preț: 444.17 lei
- 20% Preț: 238.01 lei
- 20% Preț: 333.46 lei
- 20% Preț: 438.69 lei
- Preț: 440.52 lei
- 20% Preț: 336.71 lei
- 20% Preț: 148.66 lei
- 20% Preț: 310.26 lei
- 20% Preț: 256.27 lei
- 20% Preț: 632.22 lei
- 17% Preț: 427.22 lei
- 20% Preț: 641.78 lei
- 20% Preț: 307.71 lei
- 20% Preț: 1053.45 lei
- 20% Preț: 579.56 lei
- Preț: 373.56 lei
- 20% Preț: 330.23 lei
- 20% Preț: 607.39 lei
- 20% Preț: 538.29 lei
- Preț: 389.48 lei
- 20% Preț: 326.98 lei
- 20% Preț: 1386.07 lei
- 20% Preț: 1003.66 lei
- 20% Preț: 567.60 lei
- 20% Preț: 575.48 lei
- 20% Preț: 571.63 lei
- 20% Preț: 747.79 lei
- 15% Preț: 568.74 lei
- 17% Preț: 360.19 lei
- 20% Preț: 504.57 lei
- 20% Preț: 172.69 lei
- 20% Preț: 369.12 lei
- 20% Preț: 346.40 lei
- 20% Preț: 574.05 lei
- Preț: 402.62 lei
- 20% Preț: 584.40 lei
- 20% Preț: 747.79 lei
- 20% Preț: 809.19 lei
- 20% Preț: 649.49 lei
- 20% Preț: 343.16 lei
- 20% Preț: 309.90 lei
- 20% Preț: 122.89 lei
Preț: 429.74 lei
Preț vechi: 505.59 lei
-15% Nou
Puncte Express: 645
Preț estimativ în valută:
82.24€ • 85.43$ • 68.32£
82.24€ • 85.43$ • 68.32£
Carte disponibilă
Livrare economică 13-27 ianuarie 25
Preluare comenzi: 021 569.72.76
Specificații
ISBN-13: 9783031705656
ISBN-10: 3031705653
Ilustrații: XX, 312 p.
Dimensiuni: 155 x 235 mm
Ediția:2024
Editura: Springer Nature Switzerland
Colecția Springer
Seriile Lecture Notes in Computer Science, Lecture Notes in Artificial Intelligence
Locul publicării:Cham, Switzerland
ISBN-10: 3031705653
Ilustrații: XX, 312 p.
Dimensiuni: 155 x 235 mm
Ediția:2024
Editura: Springer Nature Switzerland
Colecția Springer
Seriile Lecture Notes in Computer Science, Lecture Notes in Artificial Intelligence
Locul publicării:Cham, Switzerland
Cuprins
.- Speech.
.- Retrieval Augmented Spoken Language Generation for Transport Domain.
.- Adapting Audiovisual Speech Synthesis to Estonian.
.- Dysphonia Diagnosis Using Self-Supervised Speech Models in Mono- and Cross-Lingual Settings.
.- Sentences vs Phrases in Neural Speech Synthesis.
.- Zero-Shot vs. Few-Shot Multi-Speaker TTS Using Pre-trained Czech SpeechT5 Model.
.- Deep Speaker Embeddings for Speaker Verification of Children.
.- Improved Alignment for Score Combination of RNN-T and CTC Decoder for Online Decoding.
.- Attention to Phonetics: A Visually Informed Explanation of Speech Transformers.
.- Effects of Training Strategies and the Amount of Speech Data on the Quality of Speech Synthesis.
.- Stream-Based Active Learning for Speech Emotion Recognition via Hybrid Data Selection and Continuous Learning.
.- Data Alignment and Duration Modelling in VITS.
.- Multiword Expressions Resources for Italian: Presenting a Manually Annotated Spoken Corpus.
.- Generating High-Quality F0 Embeddings Using the Vector-Quantized Variational Autoencoder.
.- Anonymizing Dysarthric Speech: Investigating the Effects of Voice Conversion on Pathological Information Preservation.
.- X-vector-based Speaker Diarization Using Bi-LSTM and Interim Voting-driven Post-processing.
.- A Paradigm for Interpreting Metrics and Measuring Error Severity in Automatic Speech Recognition.
.- Enhancing Speech Emotion Recognition Using Transfer Learning From Speaker Embeddings.
.- Dialogue.
.- Investigating Low-Cost LLM Annotation for Spoken Dialogue Understanding Datasets.
.- PiCo-VITS: Leveraging Pitch Contours for Fine-grained Emotional Speech Synthesis.
.- Improving and Understanding Clarifying Question Generation in Conversational Search.
.- Explainable Multimodal Fusion for Dementia Detection From Text and Speech.
.- Robust Classification of Parkinson’s Speech: an Approximation to a Scenario With Non-controlled Acoustic Conditions.
.- Leveraging Conceptual Similarities to Enhance Modeling of Factors Affecting Adolescents’ Well-Being.
.- Joint-Average Mean and Variance Feature Matching (JAMVFM) Semi-supervised GAN with Additional-Objective Training Function for Intent Detection.
.- Capturing Task-Related Information for Text-Based Grasp Classification Using Fine-Tuned Embeddings.
.- StepDP: A Step Towards Expressive and Pervasive Dialogue Platforms .
.- Automatic Classification of Parkinson’s Disease Using Wav2vec Embeddings at Phoneme, Syllable, and Word Levels.
.- Retrieval Augmented Spoken Language Generation for Transport Domain.
.- Adapting Audiovisual Speech Synthesis to Estonian.
.- Dysphonia Diagnosis Using Self-Supervised Speech Models in Mono- and Cross-Lingual Settings.
.- Sentences vs Phrases in Neural Speech Synthesis.
.- Zero-Shot vs. Few-Shot Multi-Speaker TTS Using Pre-trained Czech SpeechT5 Model.
.- Deep Speaker Embeddings for Speaker Verification of Children.
.- Improved Alignment for Score Combination of RNN-T and CTC Decoder for Online Decoding.
.- Attention to Phonetics: A Visually Informed Explanation of Speech Transformers.
.- Effects of Training Strategies and the Amount of Speech Data on the Quality of Speech Synthesis.
.- Stream-Based Active Learning for Speech Emotion Recognition via Hybrid Data Selection and Continuous Learning.
.- Data Alignment and Duration Modelling in VITS.
.- Multiword Expressions Resources for Italian: Presenting a Manually Annotated Spoken Corpus.
.- Generating High-Quality F0 Embeddings Using the Vector-Quantized Variational Autoencoder.
.- Anonymizing Dysarthric Speech: Investigating the Effects of Voice Conversion on Pathological Information Preservation.
.- X-vector-based Speaker Diarization Using Bi-LSTM and Interim Voting-driven Post-processing.
.- A Paradigm for Interpreting Metrics and Measuring Error Severity in Automatic Speech Recognition.
.- Enhancing Speech Emotion Recognition Using Transfer Learning From Speaker Embeddings.
.- Dialogue.
.- Investigating Low-Cost LLM Annotation for Spoken Dialogue Understanding Datasets.
.- PiCo-VITS: Leveraging Pitch Contours for Fine-grained Emotional Speech Synthesis.
.- Improving and Understanding Clarifying Question Generation in Conversational Search.
.- Explainable Multimodal Fusion for Dementia Detection From Text and Speech.
.- Robust Classification of Parkinson’s Speech: an Approximation to a Scenario With Non-controlled Acoustic Conditions.
.- Leveraging Conceptual Similarities to Enhance Modeling of Factors Affecting Adolescents’ Well-Being.
.- Joint-Average Mean and Variance Feature Matching (JAMVFM) Semi-supervised GAN with Additional-Objective Training Function for Intent Detection.
.- Capturing Task-Related Information for Text-Based Grasp Classification Using Fine-Tuned Embeddings.
.- StepDP: A Step Towards Expressive and Pervasive Dialogue Platforms .
.- Automatic Classification of Parkinson’s Disease Using Wav2vec Embeddings at Phoneme, Syllable, and Word Levels.