PIT 2008: Kloster Irsee, Germany
Elisabeth André, Laila Dybkjær, Wolfgang Minker, Heiko Neumann, Roberto Pieraccini, Michael Weber (Eds.): Perception in Multimodal Dialogue Systems, 4th IEEE Tutorial and Research Workshop on Perception and Interactive Technologies for Speech-Based Systems, PIT 2008, Kloster Irsee, Germany, June 16-18, 2008, Proceedings. Springer 2008 Lecture Notes in Computer Science ISBN 978-3-540-69368-0
Invited Keynote
Anton Batliner: Whence and Whither: The Automatic Recognition of Emotions in Speech (Invited Keynote). 1
Multimodal and Spoken Dialogue Systems
Andrea Corradini, Christer Samuelsson: A Generic Spoken Dialogue Manager Applied to an Interactive 2D Game. 2-13
Milan Gnjatovic, Dietmar Rösner: Adaptive Dialogue Management in the NIMITEK Prototype System. 14-25
Philipp Fischer, André Berton, Andreas Nürnberger: Adaptive Search Results Personalized by a Fuzzy Recommendation Approach. 26-36
Anja Naumann, Ina Wechsung, Sebastian Möller: Factors Influencing Modality Choice in Multimodal Applications. 37-43
Martin Raab, Rainer Gruhn, Elmar Nöth: Codebook Design for Speech Guided Car Infotainment Systems. 44-51
Martin Pfeil, Dirk Bühler, Rainer Gruhn, Wolfgang Minker: Evaluating Text Normalization for Speech-Based Media Selection. 52-59
Classification of Spoken Utterances and Sound
David Griol, Lluís F. Hurtado, Encarna Segarra, Emilio Sanchis: A Two Phases Statistical Approach for Dialog Management. 60-71
Alexander Schmitt, Carolin Hank, Jackson Liscombe: Detecting Problematic Dialogs with Automated Agents. 72-80
David Suendermann, Phillip Hunter, Roberto Pieraccini: Call Classification with Hundreds of Classes and Hundred Thousands of Training Utterances ... ... and No Target Domain Data. 81-87
Amparo Albalate, David Suendermann: Hard vs. Fuzzy Clustering for Speech Utterance Categorization. 88-98
Björn Schuller, Florian Eyben, Gerhard Rigoll: Static and Dynamic Modelling for the Recognition of Non-verbal Vocalisations in Conversational Speech. 99-110
Recognition of Eye Gaze, Head Pose, Mimics and Lip Movements
Nikolaus Bee, Elisabeth André: Writing with Your Eye: A Dwell Time Free Writing System Adapted to the Nature of Human Eye Gaze. 111-122
Ulrich Weidenbacher, Heiko Neumann: Unsupervised Learning of Head Pose through Spike-Timing Dependent Plasticity. 123-131
Takahiro Yoshida, Erika Yamazaki, Seiichiro Hangai: Spoken Word Recognition from Side of Face Using Infrared Lip Movement Sensor. 132-140
Andreas Wendemuth, Jochen Braun, Bernd Michaelis, Frank W. Ohl, Dietmar Rösner, Henning Scheich, Ralf Warnemünde: Neurobiologically Inspired, Multimodal Intention Recognition for Technical Communication Systems (NIMITEK). 141-144
Speech Recognition
Dmitry Zaykovskiy, Alexander Schmitt: Deploying DSR Technology on Today's Mobile Phones: A Feasibility Study. 145-155
Yongjoo Jung: Improving Robustness in Jacobian Adaptation for Noisy Speech Recognition. 168-175
Daniel Vásquez, Rainer Gruhn, Raymond Brueckner, Wolfgang Minker: Comparing Linear Feature Space Transformations for Correlated Features. 176-187
Vocal Emotion Recognition and Annotation
Thurid Vogt, Elisabeth André, Nikolaus Bee: EmoVoice - A Framework for Online Recognition of Emotions from Voice. 188-199
Stefan Scherer, Mohamed Oubbati, Friedhelm Schwenker, Günther Palm: Real-Time Emotion Recognition Using Echo State Networks. 200-204
Taner Danisman, Adil Alpkocak: Emotion Classification of Audio Signals Using Ensemble of Support Vector Machines. 205-216
Bogdan Vlasenko, Björn Schuller, Andreas Wendemuth, Gerhard Rigoll: On the Influence of Phonetic Content Variation for Acoustic Emotion Recognition. 217-220
Zoraida Callejas, Ramón López-Cózar: On the Use of Kappa Coefficients to Measure the Reliability of the Annotation of Non-acted Emotions. 221-232
Federica Cavicchio, Massimo Poesio: Annotation of Emotion in Dialogue: The Emotion in Cooperation Project. 233-239
Human-Like Social Dialogue
Joakim Gustafson, Mattias Heldner, Jens Edlund: Potential Benefits of Human-Like Dialogue Behaviour in the Call Routing Domain. 240-251
Anna Hjalmarsson, Jens Edlund: Human-Likeness in Utterance Generation: Effects of Variability. 252-255
Koen van Turnhout, Jacques M. B. Terken, Berry Eggen: Designing Socially Aware Conversational Agents. 256-267
Marcel Dausend, Ute Ehrlich: A Prototype for Future Spoken Dialog Systems Using an Embodied Conversational Agent. 268-271
Jonas Beskow, Jens Edlund, Björn Granström, Joakim Gustafson, Gabriel Skantze: Innovative Interfaces in MonAMI: The Reminder. 272-275
Evaluation Methods
Ina Wechsung, Anja Naumann: Evaluation Methods for Multimodal Systems: A Comparison of Standardized Usability Questionnaires. 276-284
Benjamin Weiss, Ina Wechsung, Anja Naumann, Sebastian Möller: Subjective Evaluation Method for Speech-Based Uni- and Multimodal Applications. 285-288
Klaus-Peter Engelbrecht, Christine Kühnel, Sebastian Möller: Weighting the Coefficients in PARADISE Models to Increase Their Generalizability. 289-292
Joakim Gustafson, Jens Edlund: EXPROS: A Toolkit for Exploratory Experimentation with Prosody in Customized Diphone Voices. 293-296
Hugo Wesseling, Matthias Bezold, Nicole Beringer: Automatic Evaluation Tool for Multimodal Dialogue Systems. 297-305
Sebastian Möller, Klaus-Peter Engelbrecht: Towards a Perception-Based Evaluation Model for Spoken Dialogue Systems. 306-309



