检索结果-内蒙古大学图书馆

arXiv 2019年

作者： Deng, Didan Chen, Zhaokang Zhou, Yuqian Shi, Bertram Neuromorphic Interactive System Laboratory Department of Electronic and Computer Engineering Hong Kong University of Science and Technology Kowloon Hong Kong Image Formation and Processing Group University of Illinois at Urbana-Champaign ChampaignIL United States

Spatial-temporal feature learning is of vital importance for video emotion recognition. Previous deep network structures often focused on macro-motion which extends over long time scales, e.g., on the order of seconds. We believe integrating structures capturing information about both micro- and macro-motion will benefit emotion prediction, because human perceive both micro- and macro-expressions. In this paper, we propose to combine micro- and macro-motion features to improve video emotion recognition with a two-stream recurrent network, named MIMAMO (Micro-Macro-Motion) Net. Specifically, smaller and shorter micro-motions are analyzed by a two-stream network, while larger and more sustained macro-motions can be well captured by a subsequent recurrent network. Assigning specific interpretations to the roles of different parts of the network enables us to make choice of parameters based on prior knowledge: choices that turn out to be optimal. One of the important innovations in our model is the use of interframe phase differences rather than optical flow as input to the temporal stream. Compared with the optical flow, phase differences require less computation and are more robust to illumination changes. Our proposed network achieves state of the art performance on two video emotion datasets, the OMG emotion dataset and the Aff-Wild dataset. The most significant gains are for arousal prediction, for which motion information is intuitively more informative. Source code is available at https://***/wtomin/MIMAMO-Net. Copyright © 2019, The Authors. All rights reserved.

关键词： Speech recognition

来源：评论

学校读者我要写书评

暂无评论

Editorial: Knowledge engineering, semantics, and signal processing in audio - visual information retrieval

引用

IEEE Transactions on Circuits and Systems for Video Technology 2007年第3期17卷 257-258页

作者： Izquierdo, Ebroul Zhang, Jian Sikora, Thomas Huang, Thomas S. Department of Electrical Engineering Queen Mary University of London London El 4NS United Kingdom Sydney Australia School of Computer Science and Engineering University of New South Wales Australia Communication Systems Group Technical University Berlin Berlin Germany ITG University of Illinois at Urbana-Champaign United States Department of Electrical and Computer Engineering Coordinated Science Laboratory United States Image Formation and Processing Group Beckman Institute for Advanced Science and Technology United States Institute's Major Research Theme Human Computer Intelligent Interaction National Academy of Engineering Chinese Academies of Engineering and Sciences China International Association of Pattern Recognition Optical Society of American United States

No abstract available

关键词： Special issues and sections Knowledge engineering Information retrieval Audio-visual systems

来源：评论

学校读者我要写书评

暂无评论

Explanation-based facial motion tracking using a piecewise Bezier volume deformation model

Explanation-based facial motion tracking using a piecewise B...

引用

Conference on Computer Vision and Pattern Recognition (CVPR)

作者： Hai Tao T.S. Huang Image Processing and Formation Laboratory Beckman Institute University of Illinois Urbana-Champaign Urbana IL USA

Capturing real motions from video sequences is a powerful method for automatic building of facial articulation models. In this paper, we propose an explanation-based facial motion tracking algorithm based on a piecewise Bezier volume deformation model (PBVD). The PBVD is a suitable model both for the synthesis and the analysis of facial images. It is linear and independent of the facial mesh structure. With this model, basic facial movements, or action units, are interactively defined. By changing the magnitudes of these action units, animated facial images are generated. The magnitudes of these action units can also be computed from real video sequences using a model-based tracking algorithm. However, in order to customize the articulation model for a particular face, the predefined PBVD action units need to be adaptively modified. In this paper, we first briefly introduce the PBVD model and its application in facial animation. Then a multiresolution PBVD-based motion tracking algorithm is presented. Finally, we describe an explanation-based tracking algorithm that takes the predefined action units as the initial articulation model and adaptively improves them during the tracking process to obtain a more realistic articulation model. Experimental results on PBVD-based animation, model-based tracking, and explanation-based tracking are shown in this paper.

关键词： Tracking Deformable models Facial animation image analysis Video sequences Rendering (computer graphics) Solid modeling image communication Motion analysis image processing

来源：评论

学校读者我要写书评

暂无评论

A Region-Based Representation of images in MARS

引用

Journal of VLSI Signal processing Systems for Signal, image, and Video Technology 1998年第1-2期20卷 137-150页

作者： Servetto, Sergio D. Rui, Yong Ramchandran, Kannan Huang, Thomas S. Beckman Inst. Adv. Sci. and Technol. Univ. Illinois at Urbana-Champaign Urbana IL 61801 United States Universidad Nacional de La Plata Argentina Univ. Illinois at Urbana-Champaign United States Comp. Res. Adv. Applications Group IBM Argentina Argentina Image Formation and Processing Group Beckman Institute UIUC United States Department of Computer Science UNLP Argentina Dept. of Elec. and Comp. Engineering UIUC United States Multimedia Commun. Res. Department Bell Laboratories Murray Hill NJ United States Info. Sciences Research Department AT and T Labs. Florham Park NJ United States Department of Computer Science UIUC United States Southeast University China Tsinghua University China University of Illinois Urbana-Champaign IL United States Image Formation and Processing Group Beckman Inst. Advance Sci. Technol. UIUC United States Vis. Technol. Grp. of Microsoft Res. Redmond WA United States City College of New York United States Columbia University United States AT and T Bell Labs. United States Ctr. for Telecommunications Research Columbia University United States Elec. and Comp. Eng. Department United States Beckman Institute Coordinated Science Laboratory IL United States IEEE Signal Processing Society United States IEEE IMDSP Technical Committee United States IEEE Transactions on Image Proc. United States National Taiwan University Taipei Taiwan Massachusetts Inst. of Technology Cambridge MA United States Department of Electrical Engineering MIT United States School of Electrical Engineering United States Lab. for Info. and Signal Processing Purdue University United States Dept. of Elec. and Comp. Engineering United States Coordinated Science Laboratory United States Image Formation and Processing Group Beckman Inst. Adv. Sci. and Technol. United States MIT Lincoln Laboratory IBM Thomas J. Watson Research Center Rheinishes Landes Museum Bonn Germany Swiss Institutes of Technology Zurich Switzerland Swiss Institutes of Technology Lausanne S

We study the problem of representing images within a multimedia Database Management System (DBMS), in order to support fast retrieval operations without compromising storage efficiency. To achieve this goal, we propose new image coding techniques which combine a wavelet representation, embedded coding of the wavelet coefficients, and segmentation of image-domain regions in the wavelet domain. A bitstream is generated in which each image region is encoded independently of other regions, without having to explicitly store information describing the regions. Simulation results show that our proposed algorithms achieve coding performance which compares favorably, both perceptually and objectively, to that achieved using state-of-the-art image/video coding techniques while additionally providing region-based support.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：