检索结果-内蒙古大学图书馆

Music, Computing, and Health: A Roadmap for the Current and Future Roles of Music Technology for Health Care and Well-Being

引用

Music and Science 2021年 4卷

作者： Agres, Kat R. Schaefer, Rebecca S. Volk, Anja van Hooren, Susan Holzapfel, Andre Dalla Bella, Simone Müller, Meinard de Witte, Martina Herremans, Dorien Ramirez Melendez, Rafael Neerincx, Mark Ruiz, Sebastian Meredith, David Dimitriadis, Theo Magee, Wendy L. Yong Siew Toh Conservatory of Music National University of Singapore Singapore Social and Cognitive Computing Department Institute of High Performance Computing A*STAR Singapore Institute for Psychology Health Medical Neuropsychology Unit Leiden University Leiden Netherlands Leiden Institute for Brain and Cognition Leiden University Leiden Netherlands Academy of Creative and Performing Arts Leiden University Leiden Netherlands Department of Information and Computing Sciences Utrecht University Utrecht Netherlands Faculty of Health care Department of Arts Therapies Zuyd University of Applied Sciences Heerlen Netherlands KenVaK Research Centre for the Arts Therapies and Psychomotricity Heerlen Netherlands Faculty of Psychology Open University of The Netherlands Heerlen Netherlands Division of Media Technology and Interaction Design KTH Royal Institute of Technology Stockholm Sweden International Laboratory for Brain Music and Sound Research (BRAMS) Outremont QC Canada Department of Psychology University of Montreal Montreal QC Canada Centre for Research on Brain Language and Music (CRBLM) Montreal QC Canada University of Economics and Human Sciences in Warsaw Warsaw Poland International Audio Laboratories Erlangen Friedrich-Alexander Universität Erlangen-Nürnberg Erlangen Germany HAN University of Applied Sciences Department of Arts Therapies and Psychological Studies Nijmegen Netherlands University of Amsterdam Research Institute of Child Development and Education Amsterdam Netherlands Treatment Centre for People with Mild Intellectual Disabilities and Psychiatric and Behavioral Disorders Gennep Netherlands Information Systems Technology and Design Singapore University of Technology and Design Singapore Music and Machine Learning Lab Music Technology Group Universitat Pompeu Fabra Barcelona Spain Faculty of EEMCS Interactive Intelligence Group Delft University of Technology Delft Netherlands Centre for Digital Music School of Electronic Engineerin

The fields of music, health, and technology have seen significant interactions in recent years in developing music technology for health care and well-being. In an effort to strengthen the collaboration between the involved disciplines, the workshop “Music, Computing, and Health” was held to discuss best practices and state-of-the-art at the intersection of these areas with researchers from music psychology and neuroscience, music therapy, music information retrieval, music technology, medical technology (medtech), and robotics. Following the discussions at the workshop, this article provides an overview of the different methods of the involved disciplines and their potential contributions to developing music technology for health and well-being. Furthermore, the article summarizes the state of the art in music technology that can be applied in various health scenarios and provides a perspective on challenges and opportunities for developing music technology that (1) supports person-centered care and evidence-based treatments, and (2) contributes to developing standardized, large-scale research on music-based interventions in an interdisciplinary manner. The article provides a resource for those seeking to engage in interdisciplinary research using music-based computational methods to develop technology for health care, and aims to inspire future research directions by evaluating the state of the art with respect to the challenges facing each field. © The Author(s) 2021.

关键词： health care interdisciplinarity MedTech music information retrieval (MIR) music neuroscience Music psychology music technology music therapy well-being

来源：评论

学校读者我要写书评

暂无评论

human preferences for robot-human hand-over configurations

Human preferences for robot-human hand-over configurations

引用

2011 IEEE/RSJ International Conference on Intelligent Robots and Systems

作者： Maya Cakmak Siddhartha S. Srinivasa Min Kyung Lee Jodi Forlizzi Sara Kiesler School of Interactive Computing Georgia Institute of Technology USA Intel Laboratories Pittsburgh USA Human-Computer Interaction Institute Carnegie Mellon University Qatar

Handing over objects to humans is an essential capability for assistive robots. While there are infinite ways to hand an object, robots should be able to choose the one that is best for the human. In this paper we focus on choosing the robot and object configuration at which the transfer of the object occurs, i.e. the hand-over configuration. We advocate the incorporation of user preferences in choosing hand-over configurations. We present a user study in which we collect data on human preferences and a human-robot interaction experiment in which we compare hand-over configurations learned from human examples against configurations planned using a kinematic model of the human. We find that the learned configurations are preferred in terms of several criteria, however planned configurations provide better reachability. Additionally, we find that humans prefer hand-overs with default orientations of objects and we identify several latent variables about the robot's arm that capture significant human preferences. These findings point towards planners that can generate not only optimal but also preferable hand-over configurations for novel objects.

关键词： humans Robot sensing systems Kinematics Joints Planning Receivers

来源：评论

学校读者我要写书评

暂无评论

Context-adaptive phone boundary refining for a TTS database

Context-adaptive phone boundary refining for a TTS database

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Ki-Seung Lee JeongSu Kim Department of Electronic Eng Konkuk University Seoul South Korea Human-Computer Interactive Laboratories Samsung Advanced Institute of Technology Gyeonggi South Korea

ISBN: (纸本)0780376633

A method for the automatic segmentation of speech signals is described. The method is dedicated to the construction of a large database for a Text-To-Speech (TTS) synthesis system. The main issue of the work involves the refinement of an initial estimation of phone boundaries which are provided by an alignment, based on a Hidden Markov Model (HMM). Multi-layer perceptron (MLP) was used as a phone boundary detector. To increase the performance of segmentation, a technique which individually trains an MLP according to phonetic transition is proposed. The optimum partitioning of the entire phonetic transition space is constructed from the standpoint of minimizing the overall deviation from hand labelling positions. With single speaker stimuli, the experimental results showed that more than 95% of all phone boundaries have a boundary deviation from the reference position smaller than 20 ms, and the refinement of the boundaries reduces the root mean square error by about 25%.

关键词： Speech synthesis Databases Hidden Markov models Labeling Automatic speech recognition Signal synthesis Multilayer perceptrons Detectors Root mean square Linear predictive coding

来源：评论

学校读者我要写书评

暂无评论

Multimodal Error Correction for Speech User Interfaces

引用

ACM Transactions on computer-human Interaction 2001年第1期8卷 60-98页

作者： Suhm, Bernhard Myers, Brad Waibel, Alex Speech and Language Processing BBN Technologies 70 Fawcett Street Cambridge MA 02138 United States Human Computer Interaction Institute School of Computer Science Carnegie Mellon University Pittsburgh PA 15213-3891 United States Interactive Systems Laboratories School of Computer Science Carnegie Mellon University and Karlsruhe University (Germany) Pittsburgh PA 15221 United States

Although commercial dictation systems and speech-enabled telephone voice user interfaces have become readily available, speech recognition errors remain a serious problem in the design and implementation of speech user interfaces. Previous work hypothesized that switching modality could speed up interactive correction of recognition errors. This article presents multimodal error correction methods that allow the user to correct recognition errors efficiently without keyboard input. Correction accuracy is maximized by novel recognition algorithms that use context information for recognizing correction input. Multimodal error correction is evaluated in the context of a prototype multimodal dictation system. The study shows that unimodal repair is less accurate than multimodal error correction. On a dictation task, multimodal correction is faster than unimodal correction by respeaking. The study also provides empirical evidence that system-initiated error correction (based on confidence measures) may not expedite error correction. Furthermore, the study suggests that recognition accuracy determines user choice between modalities: while users initially prefer speech, they learn to avoid ineffective correction modalities with experience. To extrapolate results from this user study, the article introduces a performance model of (recognition-based) multimodal interaction that predicts input speed including time needed for error correction. Applied to interactive error correction, the model predicts the impact of improvements in recognition technology on correction speeds, and the influence of recognition accuracy and correction method on the productivity of dictation systems. This model is a first step toward formalizingmultimodal interaction. © 2001, ACM. All rights reserved.

关键词： Design dictation systems Experimentation human Factors interactive error correction Measurement Multimodal interfaces pen input performance model speech input speech user interfaces

来源：评论

学校读者我要写书评

暂无评论

Model-based and empirical evaluation of multimodal interactive error correction 99

Model-based and empirical evaluation of multimodal interacti...

引用

SIGCHI Conference on human Factors in Computing Systems, CHI 1999

作者： Suhm, Bernhard Myers, Brad Waibel, Alex Interactive Systems Laboratories Carnegie Mellon University United States Human Computer Interaction Institute Carnegie Mellon University United States

ISBN: (纸本)0201485591

Our research addresses the problem of error correction in speech user interfaces. Previous work hypothesized that switching modality could speed up interactive correction of recognition errors (so-called multimodal error correction). We present a user study that compares, on a dictation task, multimodal error correction with conventional interactive correction, such as speaking again, choosing Tom a list, and keyboard input. Results show that multimodal correction is faster than conventional correction without keyboard input, but slower than correction by typing for users with good typing skills. Furthermore, while users initially prefer speech, they learn to avoid ineffective correction modalities with experience. To extrapolate results from this user study we developed a performance model of multimodal interaction that predicts input speed including time needed for error correction. We apply the model to estimate the impact of recognition technology improvements on correction speeds and the influence of recognition accuracy and correction method on the productivity of dictation systems. Our model is a first step towards formalizing multimodal (recognition-based) interaction. Copyright © 2012 ACM, Inc.

关键词： Error correction

来源：评论

学校读者我要写书评

暂无评论

interactive recovery from speech recognition errors in speech user interfaces

Interactive recovery from speech recognition errors in speec...

引用

International Conference on Spoken Language, ICSLP

作者： B. Suhm B. Myers A. Waibel Interactive Systems Laboratories Carnegie Mellon University USA Human Computer Interaction Institute Carnegie Mellon University USA

The authors present a multimodal approach to interactive recovery from speech recognition errors for the design of speech user interfaces. They propose a framework to compare various error recovery methods, arguing that a rational user will prefer interaction methods which provide an optimal trade off between accuracy, speed and naturalness. They describe a prototypical implementation of multimodal interactive error recovery and present results from a preliminary evaluation in form filling and speech to speech translation tasks.

关键词： Speech recognition User interfaces computer errors humans Natural languages laboratories Prototypes Speech analysis Vocabulary interactive systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：