检索结果-内蒙古大学图书馆

1966 ieee computer society conference on computer vision and pattern recognition

作者： Gavrila, DM Davis, LS UNIV MARYLAND CFARCOMP VIS LABCOLLEGE PKMD 20742

ISBN: (纸本)0818672587

We present a vision system for the 3-D model-based tracking of unconstrained human movement. Using image sequences acquired simultaneously from multiple views, we recover the 3-D body pose at each time instant without the use of markers. The pose-recovery problem is formulated as a search problem and entails finding the pose parameters of a graphical human model whose synthesized appearance is most similar to the actual appearance of the real human in the multi-view images. The models used for this purpose are acquired from the images. We use a decomposition approach and a best-first technique to search through the high dimensional pose parameter space. A robust variant of chamfer matching is used as a fast similarity measure between synthesized and real edge images. We present initial tracking results from a large new Humans-In-Action (HIA) database containing more than 2500 frames in each of four orthogonal views. They contain subjects involved in a variety of activities, of various degrees of complexity, ranging from the more simple one-person hand waving to the challenging two person close interaction in the Argentine Tango.

关键词： pattern recognition systems

来源：评论

学校读者我要写书评

暂无评论

Neurodata Lab's approach to the Challenge on computer vision for Physiological Measurement

Neurodata Lab's approach to the Challenge on Computer Vision...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Artemyev, Mikhail Churikova, Marina Grinenko, Mikhail Perepelkina, Olga Neurodata Lab LLC Miami FL 33137 USA Lomonosov Moscow State Univ Fac Biol Dept Higher Nervous Act Moscow Russia

ISBN: (纸本)9781728193601

This paper introduces the Neurodata Lab's approach presented at the 1st Challenge on Remote Physiological Signal Sensing (RePSS) organized within CVPR2020. The RePSS challenge was focused on measuring the average heart rate from color facial videos, which is one of the most fundamental problems in the field of computer vision. Our deep learning-based approach includes 3D spatio-temporal attention convolutional neural network for photoplethysmogram extraction and 1D convolutional neural network pre-trained on synthetic data for time series analysis. It provides state-of-the-art results outperforming those of other participants on a mixture of VIPL and OBF databases: MAE=6.94 (12.3% improvement compared to the top-2 result), RMSE=10.68 (24.6% improvement), Pearson R = 0.755 (28.2% improvement).

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

WiCV 2019: The Sixth Women In computer vision Workshop 32

WiCV 2019: The Sixth Women In Computer Vision Workshop

引用

32nd ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Amerini, Irene Balashova, Elena Ebrahimi, Sayna Leonard, Kathryn Nagrani, Arsha Salvador, Amaia Univ Florence Florence Italy Princeton Univ Princeton NJ 08544 USA Univ Calif Berkeley Berkeley CA USA Occident Coll Los Angeles CA USA Univ Oxford Oxford England Univ Politen Catalunya Barcelona Spain

ISBN: (纸本)9781728125060

In this paper we present the Women in computer vision Workshop - WiCV 2019, organized in conjunction with CVPR 2019. This event is meant for increasing the visibility and inclusion of women researchers in computer vision field. computer vision and machine learning have made incredible progress over the past years, but the number of female researchers is still low both in the academia and in the industry. WiCV is organized especially for this reason: to raise visibility of female researchers, to increase collaborations between them, and to provide mentorship to female junior researchers in the field. In this paper, we present a report of trends over the past years, along with a summary of statistics regarding presenters, attendees, and sponsorship for the current workshop.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Embedded Computing Framework for vision-based Real-time Surround Threat Analysis and Driver Assistance 29

Embedded Computing Framework for Vision-based Real-time Surr...

引用

29th ieee conference on computer vision and pattern recognition (CVPR)

作者： Lu, Frankie Lee, Sean Satzoda, Ravi Kumar Trivedi, Mohan Univ Calif San Diego San Diego CA 92103 USA

ISBN: (纸本)9781509014378

In this paper, we present a distributed embedded vision system that enables surround scene analysis and vehicle threat estimation. The proposed system analyzes the surroundings of the ego-vehicle using four cameras, each connected to a separate embedded processor. Each processor runs a set of optimized vision-based techniques to detect surrounding vehicles, so that the entire system operates at real-time speeds. This setup has been demonstrated on multiple vehicle testbeds with high levels of robustness under real-world driving conditions and is scalable to additional cameras. Finally, we present a detailed evaluation which shows over 95% accuracy and operation at nearly 15 frames per second.

关键词： Vehicles

来源：评论

学校读者我要写书评

暂无评论

Key Point-Based Driver Activity recognition

Key Point-Based Driver Activity Recognition

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Vats, Arpita Anastasiu, David C. Santa Clara Univ Santa Clara CA 95053 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

We present a key point-based activity recognition framework, built upon pre-trained human pose estimation and facial feature detection models. Our method extracts complex static and movement-based features from key frames in videos, which are used to predict a sequence of key-frame activities. Finally, a merge procedure is employed to identify robust activity segments while ignoring outlier frame activity predictions. We analyze the different components of our framework via a wide array of experiments and draw conclusions with regards to the utility of the model and ways it can be improved. Results show our model is competitive, taking the 11th place out of 27 teams submitting to Track 3 of the 2022 AI City Challenge.

关键词： computer vision conferences Urban areas Pose estimation Activity recognition Predictive models Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Recognizing Actions from Depth Cameras as Weakly Aligned Multi-Part Bag-of-Poses

Recognizing Actions from Depth Cameras as Weakly Aligned Mul...

引用

26th ieee conference on computer vision and pattern recognition (CVPR)

作者： Seidenari, Lorenzo Varano, Vincenzo Berretti, Stefano Del Bimbo, Alberto Pala, Pietro Univ Florence I-50121 Florence Italy

ISBN: (纸本)9780769549903

Recently released depth cameras provide effective estimation of 3D positions of skeletal joints in temporal sequences of depth maps. In this work, we propose an efficient yet effective method to recognize human actions based on the positions of joints. First, the body skeleton is decomposed in a set of kinematic chains, and the position of each joint is expressed in a locally defined reference system which makes the coordinates invariant to body translations and rotations. A multi-part bag-of-poses approach is then defined, which permits the separate alignment of body parts through a nearest-neighbor classification. Experiments conducted on the Florence 3D Action dataset and the MSR Daily Activity dataset show promising results.

关键词： cameras gesture recognition image representation pattern classification position measurement

来源：评论

学校读者我要写书评

暂无评论

AAFormer: A Multi-Modal Transformer Network for Aerial Agricultural Images

AAFormer: A Multi-Modal Transformer Network for Aerial Agric...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Shen, Yao Wang, Lei Jin, Yue China Pacific Insurance Grp Co Ltd Shanghai Peoples R China East China Normal Univ Shanghai Peoples R China

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

The semantic segmentation of agricultural aerial images is very important for the recognition and analysis of farmland anomaly patterns, such as drydown, endrow, nutrient deficiency, etc. Methods for general semantic segmentation such as Fully Convolutional Networks can extract rich semantic features, but are difficult to exploit the long-range information. Recently, vision Transformer architectures have made outstanding performances in image segmentation tasks, but transformer-based models have not been fully explored in the field of ***, we propose a novel architecture called Agricultural Aerial Transformer (AAFormer) to solve the semantic segmentation of aerial farmland images. We adopt Mix Transformer (MiT) in the encoder stage to enhance the ability of field anomaly pattern recognition and leverage the Squeeze-and-Excitation (SE) module in the decoder stage to improve the effectiveness of key channels. The boundary maps of farmland are introduced into the decoder. Evaluated on the Agriculture-vision validation set, the mIoU of our proposed model reaches 45.44%.

关键词： Image segmentation Image recognition conferences Semantics computer architecture Transformers Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Segmenting visual actions based on spatio-temporal motion patterns

Segmenting visual actions based on spatio-temporal motion pa...

引用

ieee conference on computer vision and pattern recognition (CVPR 2000)

作者： Rui, Y Anandan, P Microsoft Res Redmond WA 98052 USA

ISBN: (纸本)0769506623

The analysis of human action captured in video sequences has been a topic of considerable interest in computer vision. Much of the previous work has focused on the problem of action or activity recognition, but ignored the problem of detecting action boundaries in a video sequence containing unfamiliar and arbitrary visual actions. This paper presents an approach to this problem based on detecting temporal discontinuities of the spatial pattern of image motion that captures the action. We represent frame to frame optical-flow in terms of the coefficients of the most significant principal components computed from all the flow-fields within a given video sequence. We then detect the discontinuities in the temporal trajectories of these coefficients based on three different measures. We compare our segment boundaries against those detected by human observers on the same sequences in a recent independent psychological study of human perception of visual events. We show experimental results on the two sequences that were used in this study. Our experimental results are promising both from visual evaluation and when compared against the results of the psychological study.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Panoramic EPI generation and analysis of video from a moving platform with vibration

引用

Proceedings of the ieee computer society conference on computer vision and pattern recognition 1999年 2卷 531-537页

作者： Zhu, Zhigang Xu, Guangyou Lin, Xueyin Tsinghua Univ Beijing China

This paper presents a novel approach for generating and analyzing epipolar plane images (EPIs) from video sequences taken from a moving platform subject to vibration so that the 3D model of an arbitrary scene can be constructed. Two problems are solved in our approach: (1) how to generate EPIs from video under a more general motion than a pure translation; (2) how to analyze the huge amount of data in the EPIs robustly and efficiently. For the first problem, a 3D image stabilization method is proposed which decouples the vibration from the vehicle's motion so that good EPIs and panoramic view images (PVIs) can be generated. For the second problem, we propose an efficient panoramic EPI analysis (PEPIA) method in which only one scanline of each EPI is processed. The PEPIA combines advantages of PVIs and EPIs and consists of three important steps: locus orientation detection, motion boundary localization, and occlusion/resolution recovery. The output of the PEPIA - a layered 3D panorama, is very useful in visual navigation and virtual reality modeling. Since camera calibration, image segmentation, feature extraction and matching are avoided, all the proposed algorithms are fully automatic and rather general. Results on real image sequences are given.

关键词： computer vision Algorithms Image recording Video cameras Virtual reality Epipolar plane images (EPIs) Panoramic view images (PVIs)

来源：评论

学校读者我要写书评

暂无评论

Internal Diverse Image Completion

Internal Diverse Image Completion

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Alkobi, Noa Shaham, Tamar Rott Michaeli, Tomer Technion Haifa Israel MIT Technion Haifa Israel

ISBN: (纸本)9798350302493

Image completion is widely used in photo restoration and editing applications, e.g. for object removal. Recently, there has been a surge of research on generating diverse completions for missing regions. However, existing methods require large training sets from a specific domain of interest, and often fail on general-content images. In this paper, we propose a diverse completion method that does not require a training set and can thus treat arbitrary images from any domain. Our internal diverse completion (IDC) approach draws inspiration from recent single-image generative models that are trained on multiple scales of a single image, adapting them to the extreme setting in which only a small portion of the image is available for training. We illustrate the strength of IDC on several datasets, using both user studies and quantitative comparisons.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：