清華電機 BIIC lab 李祈均教授的論文 An Engineering View on Emotions and Speech: From Analysis and Predictive Models to Responsible Human-Centered Applications
發表在頂尖國際期刊 Proceedings of the IEEE → Link here
Proceedings of the IEEE 創刊於二十世紀初年,傳統悠久,時至今日成為全球最頂尖、最權威、最具影響力的期刊之一。這篇論文從工程技術的觀點,大規模整理了數十年來「情緒與語音」領域的發展,涵括分析、模型,和近年為人注目的可信賴 / 人本應用。
感謝其餘諸位作者協力合作,包含 Texas A&M University (德州A&M大學)副教授Theodora Chaspari 和 University of Michigan(密西根大學)副教授 Emily Mower Provost,他們都是李祈均老師在 USC SAIL(南加州大學)攻讀博士,以及第四位作者 Shri 正是他們當年的指導教授。
Abstract
Abstract: The substantial growth of Internet-of-Things technology and the ubiquity of smartphone devices has increased the public and industry focus on speech emotion recognition (SER) technologies. Yet, conceptual, technical, and societal challenges restrict the wide adoption of these technologies in various domains, including, healthcare, and education. These challenges are amplified when automated emotion recognition systems are called to function “in-the-wild” due to the inherent complexity and subjectivity of human emotion, the difficulty of obtaining reliable labels at high temporal resolution, and the diverse contextual and environmental factors that confound the expression of emotion in real life. In addition, societal and ethical challenges hamper the wide acceptance and adoption of these technologies, with the public raising questions about user privacy, fairness, and explainability. This article briefly reviews the history of affective speech processing, provides an overview of current state-of-the-art approaches to SER, and discusses algorithmic approaches to render these technologies accessible to all, maximizing their benefits and leading to responsible human-centered computing applications. Published in: Proceedings of the IEEE ( Early Access )
Page(s): 1 - 17
Date of Publication: 13 June 2023
DOI: 10.1109/JPROC.2023.3276209
Publisher: IEEE