检索结果-内蒙古大学图书馆

作者： Peng, Bo Qian, Gang Rajko, Stjephan Arts Media and Engineering Program Arizona State University Tempe AZ 85287 United States Department of Electrical Engineering Arizona State University Tempe AZ 85287 United States Department of Computer Science and Engineering Arizona State University Tempe AZ 85287 United States

ISBN: (纸本)9781424421756

In this paper, we propose a video-based full-body gesture recognition system independent of the view angle of the cameras. We performed multilinear analysis on the silhouette images of the static poses making up the gestures by tensor decomposition and projection. Each pair of silhouette images is projected to a view-invariant low dimensional pose coefficient vector space. These pose vectors are then used as input vectors in hid- den Markov model (HMM) for gesture recognition. This system worked effectively in our experiments using real videos. © 2008 IEEE.

关键词： Gesture recognition

来源：评论

学校读者我要写书评

暂无评论

A comparison of molecular approaches for generating sparse and structured multiresolution representations of audio and music signals

A comparison of molecular approaches for generating sparse a...

引用

7th European Conference on Noise Control 2008, EURONOISE 2008

作者： Sturm, B. Shynk, J.J. McLeran, A. Roads, C. Daudet, L. University of California Department of Electrical and Computer Engineering Box 117 Santa Barbara CA 93106 United States University of California Media Arts and Technology Program Santa Barbara CA 93106 United States UPMC Univ Paris 06 LAM IJLRA 11 rue de Lourmel 75015 Paris France

We compare the characteristics and performance of joint (single-step) and sequential (two-step) approaches for creating sparse and structured acoustic signal representations derived using overcomplete methods (OMs). A joint approach, such as molecular matching pursuit (MMP), attempts to find coherent structures in a signal as part of the decomposition process, while a sequential approach, such as agglomerative clustering (AC), attempts to find coherent structures after the signal decomposition. We review each approach, and examine their performance using real audio and music signals.

关键词： Audio acoustics

来源：评论

学校读者我要写书评

暂无评论

Robust Human Pose Recognition Using Unlabelled Markers

Robust Human Pose Recognition Using Unlabelled Markers

引用

IEEE Workshop on Applications of Computer Vision (WACV)

作者： Yi Wang Gang Qian Department of Computer Science and Engineering Arizona State University USA Arts Media and Engineering Program and Department of Electrical Engineering Arizona State University USA

In this paper, we tackle robust human pose recognition using unlabelled markers obtained from an optical marker-based motion capture system. A coarse-to-fine fast pose matching algorithm is presented with the following three steps. Given a query pose, firstly, the majority of the non-matching poses are rejected according to marker distributions along the radius and height dimensions. Secondly, relative rotation angles between the query pose and the remaining candidate poses are estimated using a fast histogram matching method based on circular convolution implemented using the fast Fourier transform. Finally, rotation angle estimates are refined using nonlinear least square minimization through the Levenberg-Marquardt minimization. In the presence of multiple solutions, false poses can be effectively removed by thresholding the minimized matching scores. The proposed framework can handle missing markers caused by occlusion. Experimental results using real motion capture data show the efficacy of the proposed approach.

关键词： Robustness Humans Least squares approximation Labeling Clouds Art Fast Fourier transforms Image recognition Computer science Biomedical optical imaging

来源：评论

学校读者我要写书评

暂无评论

Human pose inference from stereo cameras

Human pose inference from stereo cameras

引用

7th IEEE Workshop on Applications of Computer Vision, WACV 2007

作者： Guo, Feng Qian, Gang Arts Media and Engineering Program Department of Electrical Engineering Arizona State University

ISBN: (纸本)0769527949

In this paper, a Bayesian mixture expert (BME) framework for the estimation of 3D human poses from two uncalibrated wide-baseline cameras is presented. The two cameras will reduce the ambiguities of the pose estimation greatly and is easy to implemented. BME is learnt to conduct multi-modal pose estimation regression. K-means algorithm considering Euclidean distance and maximum-value distance for the joint angle vector is used for the initial clustering in BME learning. This will give the better cluster results to separate the ambiguous poses into different expert. Also a weighted PCA is implemented in an expectation-maximization (EM) framework to learn the parameters of the BME. This can reduce the dimension of the training data more effectively compared with global PCA. The system is trained with synthesized silhouettes from motion capture data. The experimental results on synthesized and real images illustrate that our approach does not need precise camera calibration and can estimate the poses effectively. © 2007 IEEE.

关键词： Face recognition

来源：评论

学校读者我要写书评

暂无评论

A comparison of molecular approaches for generating sparse and structured multiresolution representations of audio and music signals

引用

The Journal of the Acoustical Society of America 2008年第5_SUPPLEMENT期123卷 3801-3801页

作者： Bob Sturm John J. Shynk Aaron McLeran Curtis Roads Laurent Daudet University of California Box 117 Department of Electrical and Computer Engineering Santa Barbara CA 93106 USA boblsturm@ece.ucsb.edu University of California Box 117 Department of Electrical and Computer Engineering Santa Barbara CA 93106 USA shynk@ece.ucsb.edu University of California Media Arts and Technology Program Santa Barbara CA 93106 USA amcleran@*** University of California Media Arts and Technology Program Santa Barbara CA 93106 USA clang@mat.ucsb.edu UPMC Univ Paris 06 LAM / IJLRA 11 rue de Lourmel 75015 Paris France daudet@lam.jussieu.fr

The authors investigate the characteristics and performance of joint (single‐step) and sequential (two‐step) approaches to creating sparse and structured multiresolution representations of audio and music signals derived using sparse overcomplete methods. A joint approach, such as molecular matching pursuit, attempts to find structures in a signal as part of the decomposition process, while a sequential approach, such as agglomerative clustering, attempts to find structures in the completed decomposition of a signal. Each of these approaches have different benefits and drawbacks. For a joint approach, it is computationally convenient that the decomposition and structuring are done simultaneously, but usually only simple structural relations are possible. For a sequential approach, one is working in a parameter space of much smaller dimension than the original signal, but the computation is higher since the decomposition and the structure building are two separate processes. Results from these approaches using real audio and music signals will be compared and contrasted, and will contribute to our goal of creating an enhanced interface between the content of audio and music signals, e.g., onsets, notes, voices, and their multiresolution sparse atomic decompositions.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Human Pose Inference from Stereo Cameras

Human Pose Inference from Stereo Cameras

引用

IEEE Workshop on Applications of Computer Vision (WACV)

作者： Feng Guo Gang Qian Arts Media and Engineering Program and Department of Electrical Engineering Arizona State University USA

In this paper, a Bayesian mixture expert (BME) framework for the estimation of 3D human poses from two uncalibrated wide-baseline cameras is presented. The two cameras will reduce the ambiguities of the pose estimation greatly and is easy to implement. BME is learnt to conduct multimodal pose estimation regression. K-means algorithm considering Euclidean distance and maximum-value distance for the joint angle vector is used for the initial clustering in BME learning. This will give the better cluster results to separate the ambiguous poses into different experts. Also a weighted PCA is implemented in an expectation-maximization (EM) framework to learn the parameters of the BME. This can reduce the dimension of the training data more effectively compared with global PCA. The system is trained with synthesized silhouettes from motion capture data. The experimental results on synthesized and real images illustrate that our approach does not need precise camera calibration and can estimate the poses effectively

关键词： Humans Cameras Principal component analysis Calibration Bayesian methods Clustering algorithms Euclidean distance Training data Joints Visual databases

来源：评论

学校读者我要写书评

暂无评论

The multimodal music stand 07

The multimodal music stand

引用

7th International Conference on New Interfaces for Musical Expression, NIME '07

作者： Bell, Bo Kleban, Jim Overholt, Dan Putnam, Lance Thompson, John Kuchera-Morin, Joann Electrical and Computer Engineering California NanoSystems Institute University of California - Santa Barbara Santa Barbara CA 93106 United States Media Arts and Technology California NanoSystems Institute University of California - Santa Barbara Santa Barbara CA 93106 United States

ISBN: (纸本)9781450378376

We present the Multimodal Music Stand (MMMS) for the untethered sensing of performance gestures and the interactive control of music. Using e-field sensing, audio analysis, and computer vision, the MMMS captures a performer's continuous expressive gestures and robustly identifies discrete cues in a musical performance. Continuous and discrete gestures are sent to an interactive music system featuring custom designed software that performs real-time spectral transformation of audio.

关键词： Music

来源：评论

学校读者我要写书评

暂无评论

Experiential Signal Processing (ESP) and Experiential Telecommunications (ET)

Experiential Signal Processing (ESP) and Experiential Teleco...

引用

Asilomar Conference on Signals, Systems & Computers

作者： Jerry D. Gibson JoAnn Kuchera-Morin Alex Norman Department of Electrical & Computer Engineering and the Media Arts & Technology University of California Santa Barbara USA USA University of California 슠Santa Barbara USA

We define the research fields of experiential signal processing (ESP) and experiential telecommunications (ET), which are concerned with sensing, communicating, and presenting an Environment, Event, or Experience at a distance. We develop our vision of ESP and ET and describe key components and research fields. We highlight the challenges of presenting multichannel, multimedia information and present an example for panoramic video using the Allosphere, a 3-story sphere housed in an anechoic chamber, that has been constructed at UCSB.

关键词： Signal processing Electrostatic precipitators Video signal processing Optical signal processing Anechoic chambers Management training Art Educational products Optical sensors Terminology

来源：评论

学校读者我要写书评

暂无评论

Movement-based interactive dance performance 06

Movement-based interactive dance performance

引用

14th Annual ACM International Conference on Multimedia, MM 2006

作者： James, Jodi Ingalls, Todd Qian, Gang Olsen, Loren Whiteley, Daniel Wong, Siew Rikakis, Thanassis Arts Media and Engineering Program Arizona State University Department of Electrical Engineering Arizona State University

ISBN: (纸本)1595934472

Movement-based interactive dance has recently attracted great interest in the performing arts. While utilizing motion capture technology, the goal of this project was to design the necessary real-time motion analysis engine, staging, and communication systems for the completion of a movement-based interactive multimedia dance performance. The movement analysis engine measured the correlation of dance movement between three people wearing similar sets of retro-reflective markers in a motion capture volume. This analysis provided the framework for the creation of an interactive dance piece, Lucidity, which will be described in detail. Staging such a work also presented additional challenges. These challenges and our proposed solutions will be discussed. We conclude with a description of the final work and a summary of our future research objectives. Copyright 2006 ACM.

关键词： Systems analysis

来源：评论

学校读者我要写书评

暂无评论

Learning and inference of 3D human poses from Gaussian mixture modeled silhouettes

Learning and inference of 3D human poses from Gaussian mixtu...

引用

18th International Conference on Pattern Recognition, ICPR 2006

作者： Feng, Guo Gang, Qian Department of Electrical Engineering and Arts Media and Engineering Program Arizona State University Tempe AZ 85287 United States

ISBN: (纸本)0769525210

In this paper, we present a learning and inference framework for 3D human pose recovery using silhouettes represented by Gaussian mixtures. A Bayesian mixture of experts is learnt to conduct multimodal pose regression. The major contribution of this paper is the use of Gaussian mixtures as silhouette shape descriptor and Kullback-Leibler divergence (KLD) for silhouette distance and kernel computation. Using Gaussian mixtures and KLD makes the learning and inference robust to errors in silhouettes extraction. It also allows likelihood evaluation of different pose estimates. This is done by computing the similarity of the observed silhouette and the predicted silhouettes by a generic body model onto the image plane. The system was trained with silhouettes rendered using animation software driven by motion capture data. Experimental results using both synthetic and real image silhouettes illustrate the usefulness of the proposed framework. © 2006 IEEE.

关键词： Three dimensional computer graphics

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：