检索结果-内蒙古大学图书馆

34th IEEE International Conference on visual communications and image processing, VCIP 2019

作者： Russell, Mosin Zou, Ju Jia Fang, Gu Cai, Weidong Western Sydney University School of Computing Engineering and Mathematics NSW2751 Australia University of Sydney School of Computer Science NSW2006 Australia

ISBN: (纸本)9781728137230

In this paper, a new background subtraction framework is proposed to deal with possible scenarios occurring in natural scenes. In this method, a combination of two feature descriptors, namely color information in HSV color format and global texture descriptor T, are introduced to effectively identify background points under varying conditions. Using these features, an adaptive background model is constructed to automatically adapt to scene changes. The proposed framework is evaluated on common change detection datasets, showing improved performance compared to three well-known methods. © 2019 IEEE.

关键词： Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Kernelized Target Representation for visual Tracking with Sparse Constraint 4

Kernelized Target Representation for Visual Tracking with Sp...

引用

4th IEEE International Conference on image, Vision and Computing, ICIVC 2019

作者： Wang, Yuanyun Wang, Jun Wu, Zhaoming Tian, Wei Deng, Chengzhi Wang, Shengqian Nanchang Institute of Technology Jiangxi Province Key Laboratory of Water Information Cooperative Sensing and Intelligent Processing Nanchang330099 China

ISBN: (纸本)9781728123257

Robust visual tracking is a challenging task due to factors motion blur, fast motion, partial occlusion and illumination variation. Existing tracking algorithms represent a target candidate by templates or a linear combination of them with some constraints such as sparse coding. While the high computational cost restricts the tracking speed with sparse constraint since many trivial templates are introduced. Due to affecting by complicated appearance variations, the relationship between a target candidate and the corresponding target templates is not linear. Maybe it is nonlinear or complex. In this paper, we present a kernelized target representation model with sparsity constraint for visual tracking. Namely, a target candidate is represented by a nonlinear combination of templates with sparsity constraint. The proposed appearance model have the advantages of sparse coding and kernel method, which is robust to outliers and represents a target candidate in a high feature space. A novel tracker is proposed upon the presented appearance model. Superior experimental results are achieved against state-of-the-art trackers on some challenging sequences. © 2019 IEEE.

关键词： Target tracking

来源：评论

学校读者我要写书评

暂无评论

Mode-Dependent Transforms Based on Elliptical Model for High Efficiency Video Coding

Mode-Dependent Transforms Based on Elliptical Model for High...

引用

30th IEEE Conference on visual communications and image processing (VCIP)

作者： Jia, Kaiyuan Chen, Chen Meng, Xiandong Zhu, Shuyuan Zeng, Bing Univ Elect Sci & Technol China Inst Image Proc 2006 Xiyuan Ave Chengdu Sichuan Peoples R China Hong Kong Univ Sci & Technol Dept Elect & Comp Engn Kowloon Hong Kong Peoples R China

ISBN: (纸本)9781509053162

High efficiency video coding (HEVC) defines 35 prediction modes in its intra prediction stage to signal the direction information of residual blocks. Traditionally, separable two-dimension (2-D) transforms (integer DCT and DST) are utilized in a similar manner as in the previous H.264/AVC standards. However, such 2-D transforms cannot yield the best energy compaction for a 2-D directional source where the dominating directional information is other than the horizontal or vertical one. In order to overcome this drawback, we build an elliptical model with directionality and design some non separable transforms based on the Karhunen-Loeve transform in this paper. Specifically, we derive a non-separable transform in closed-form for each intra-prediction mode and replace the default transform in HEVC. Simulation results reveal that 1.7% and 2.0% on average and up to 7.7% and 8.1% BD-rate reduction can be achieved for luma and chroma component, respectively. In the meantime, the test results show that both the encoding time and decoding time increase only about 5%.

关键词： Terms HEVC intra prediction KLT mode-dependent transform non-separable transform elliptical model

来源：评论

学校读者我要写书评

暂无评论

Part Propagation for Local Part Segmentation

Part Propagation for Local Part Segmentation

引用

30th IEEE Conference on visual communications and image processing (VCIP)

作者： Meng, Fanman Li, Hongliang Wu, Qingbo Luo, Bing Cai, Jianfei Huang, Chao Univ Elect Sci & Technol China Sch Elect Engn 2006 Xiyuan Ave Chengdu 611731 Sichuan Peoples R China Nanyang Technol Univ Sch Comp Sci & Engn Nanyang Ave Singapore 639798 Singapore

ISBN: (纸本)9781509053162

Segment propagation transfers object priors among images, which is an important prior generation manner in image segmentation. The existing propagation methods focus on object foreground propagation, while the detailed part propagation is deficiency, which is caused by the challenges that not only the multiple part regions, but also their relationships need to be transferred. In this paper, a part propagation method is proposed. Two level propagations such as object level propagation, and part level propagation are successively used for the part propagation. The object level propagation is to transfer global shape information among images, which is formulated as graph matching based edge fragments matching problem, with dynamic programming solution. The part level propagation is to transfer the more detailed part labels, which is formulated as pixel level structure matching problem, and is efficiently solved by traditional dense pixel matching methods. The proposed method is verified on 15 challenging classes selected from PASCAL 2010 dataset, Bird dataset and Cat-Dog dataset. The experimental results demonstrate the effectiveness of the proposed method.

关键词： Segmentation Propagation Part Segmentation Graph Matching Dense Matching

来源：评论

学校读者我要写书评

暂无评论

DC Coefficient Estimation of Intra-Predicted Residuals in High Efficiency Video Coding

DC Coefficient Estimation of Intra-Predicted Residuals in Hi...

引用

30th IEEE Conference on visual communications and image processing (VCIP)

作者： Chen, Chen Miao, Zexiang Meng, Xiandong Zhu, Shuyuan Zeng, Bing Hong Kong Univ Sci & Technol Dept Elect & Comp Engn Kowloon Hong Kong Peoples R China Univ Elect Sci & Technol China Inst Image Proc 2006 Xiyuan Ave Chengdu Sichuan Peoples R China

ISBN: (纸本)9781509053162

This paper proposes a DC coefficient estimation algorithm for intra-predicted residual blocks in the High Efficiency Video Coding (HEVC) standard. Discarding the DC coefficient in the current coding block leads to a substantial bit-saving but produces at the same time strong discontinuities between this block and its neighboring reconstructed blocks. To overcome this problem, we propose an estimation algorithm for the DC coefficient, which solves an optimal offset in a closed form in the pixel domain to recover the corresponding block edges. Test results show that our algorithm achieves 1.0% and 1.4% BD-rate reduction on average for luma and chroma as compared with HM-16.6, respectively, when the sign-bit-hiding (SBH) technique is disabled. When SDH is set on, namely under the common test condition (CTC), the BD-rate reduction drops slightly to 0.7% and 1.1% for luma and chroma, respectively. In the meantime, the test results show that both encoding time and decoding time increase only slightly (about 10%, without any special optimization on programming our proposed algorithm).

关键词： Terms HEVC video coding intra prediction coefficient estimation R-D performance

来源：评论

学校读者我要写书评

暂无评论

Shape based Co-segmentation repairing by Segment Evaluation and Object Proposals

Shape based Co-segmentation repairing by Segment Evaluation ...

引用

30th IEEE Conference on visual communications and image processing (VCIP)

作者： Shi, Wen Zhu, Hongyuan Yang, Li Luo, Yuanqing Univ Elect Sci & Technol China Coll Elect Engn 2006 Xiyuan Ave West Hitech Zone Chengdu 611731 Sichuan Peoples R China ASTAR I2R Singapore Singapore

ISBN: (纸本)9781509053162

Repairing co-segmentation results by consistency evaluation shows the improvement of the co-segmentation performance. However, the existing co-segmentation refinement methods focus on color feature, while the mid-level features based repairing, such as shape, is ignored. In this paper, we propose a new shape based co-segmentation refinement method. An edge map based segment completeness evaluation and a shape based segment consistency evaluation are firstly proposed. Then, we use the initial segments and their evaluation scores to refine each result by employing the object proposals. By repeating such two evaluation and refinement steps, final refined results are obtained. Compared with traditional methods where only the bad segment is repaired, all segments are simultaneously evaluated and refined in an iteration process in our method to achieve better results. We verify our method based on Icoseg dataset. The results show larger IOU values than the original results.

关键词： co-segmentation segmentation evaluation

来源：评论

学校读者我要写书评

暂无评论

Task Estimation Using Latent Semantic Analysis of visual Scenes and Spoken Words

引用

ELECTRONICS AND communications IN JAPAN 2014年第6期97卷 33-42页

作者： Kimura, Masashi Sawada, Shinta Iribe, Yurie Katsurada, Kouichi Nitta, Tsuneo Toyohashi Univ Technol Toyohashi Aichi Japan Toyohashi Univ Technol Informat & Media Ctr Toyohashi Aichi Japan

In this paper, we propose a task estimation method based on multiple subspaces extracted from multimodal information of image objects in visual scenes and spoken words in dialogue appearing in the same task. The multiple subspaces are obtained by using latent semantic analysis (LSA). In the proposed method, a task vector composed of spoken words and the frequencies of image-object appearances are extracted first, and then similarities among the input task vector and reference subspaces of different tasks are compared. Experiments are conducted on the identification of game tasks. The experimental results show that the proposed method with multimodal information outperforms the method in which only the single modality of image or spoken dialogue is applied. The proposed method achieves accurate performance even if less spoken dialogue is applied.

关键词： multimodal processing latent semantic analysis task estimation

来源：评论

学校读者我要写书评

暂无评论

image super-resolution with sparse representation prior on primitive patches

Image super-resolution with sparse representation prior on p...

引用

Conference on visual communications and image processing (VCIP)

作者： Li, Haifeng Xiong, Hongkai Qian, Liang Shanghai Jiao Tong Univ Dept Elect Engn Shanghai 200240 Peoples R China

ISBN: (纸本)9780819482341

We focus on the problem of single image super-resolution in this paper. Given a low-resolution image, we seek to synthesize its underlying high-resolution details using a learning based method. Inspired by recent progress in compressive sensing, we use sparse representation prior to regularize this ill-posed problem. On the other hand, with natural image statistics taken into consideration, we enforce the prior only on those image patches associated with image primitives rather than on arbitrary ones. Specifically, each patch from primitive layer of the lowresolution image, which can be viewed as a low-dimensional projection of a high-resolution primitive patch, is conjectured to have a sparse representation concerning an over-complete dictionary. Under mild conditions, the sparse representation can be correctly restored from the low-dimensional projection according to the theory of compressive sensing. We also construct a dictionary using image primitive patches which works well on generic input images. Experiment results show the efficiency of our method by outperforming other learning-based methods both subjectively and objectively.

关键词： image super-resolution learning compressive sensing sparse representation image primitives

来源：评论

学校读者我要写书评

暂无评论

违章停车检测与识别算法

引用

吉林大学学报（工学版） 2010年第1期40卷 42-46页

作者：王殿海胡宏宇李志慧曲昭伟吉林大学交通学院长春130022

针对传统违章停车人工检测方式的缺陷,设计了基于图像处理技术的停车违章监控算法。在禁停路段区域设置视觉传感器采集视频图像序列,利用自适应的混合高斯模型实现复杂交通场景下的背景抽取,提取可能运动前景目标。利用像素级时间序列... 详细信息

针对传统违章停车人工检测方式的缺陷,设计了基于图像处理技术的停车违章监控算法。在禁停路段区域设置视觉传感器采集视频图像序列,利用自适应的混合高斯模型实现复杂交通场景下的背景抽取,提取可能运动前景目标。利用像素级时间序列特征检测静止物体,并根据对象级区域特征实现停驶车辆的辨识,获取车辆的违章停车信息。根据不同禁停区域的具体违章要求实现自动警报。最后,通过实际交通场景视频序列对算法进行了验证,结果表明了本文方法的有效性。

关键词：交通运输系统工程智能交通视频监控违章停车背景模型

来源：评论

学校读者我要写书评

暂无评论

Exploring the relationships of regions for visual content understanding

Exploring the relationships of regions for visual content un...

引用

Conference on visual communications and image processing 2008

作者： Liu, Ting Wang, Weiqiang Tian, Yonghong Huang, Tiejun Chinese Acad Sci Inst Comp Technol Beijing 100864 Peoples R China Chinese Acad Sci Grad Univ Beijing Peoples R China Peking Univ Inst Digital Media Beijing Peoples R China

ISBN: (纸本)9780819469946

An image can be considered as a collection of small regions. Most researches of image understanding extract features of these regions, and investigate relationships between these regions and keywords of images that are annotated manually. There are also some researches that explore the ontology of words. However, little attention has been paid to the relationships among regions in an image. In this paper, we make a close study of this type of relationships without the assumption that they are independent for visual content understanding. We first analyze the co-occurrence of regions using a statistical relevance probability model (SRP). Since human attention in the perception process of an image first focuses in one region and then moves on to other relevant regions, we propose a novel model called region sequence prediction model (RSP) to describe it. In RSP, annotation keywords for region sequences of the image and their probabilities are generated by a hidden Markov model. Experimental results of both image annotation and retrieval on the Corel dataset (an open image dataset) show that mining the relationships of image regions will achieve comparative or better performance in visual content understanding.

关键词： visual content understanding automatic image annotation statistical relevance prediction region sequence prediction Hidden Markov Model

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：