检索结果-内蒙古大学图书馆

Joint Optimization for object class segmentation and Dense Stereo Reconstruction

INTERNATIONAL JOURNAL OF COMPUTER VISION 2012年第2期100卷 122-133页

作者： Ladicky, Lubor Sturgess, Paul Russell, Chris Sengupta, Sunando Bastanlar, Yalin Clocksin, William Torr, Philip H. S. Univ Oxford Oxford England Oxford Brookes Univ Oxford OX3 0BP England Univ London London England Izmir Inst Technol Izmir Turkey Univ Hertfordshire Hatfield AL10 9AB Herts England

The problems of dense stereo reconstruction and object class segmentation can both be formulated as Random Field labeling problems, in which every pixel in the image is assigned a label corresponding to either its disparity, or an object class such as road or building. While these two problems are mutually informative, no attempt has been made to jointly optimize their labelings. In this work we provide a flexible framework configured via cross-validation that unifies the two problems and demonstrate that, by resolving ambiguities, which would be present in real world data if the two problems were considered separately, joint optimization of the two problems substantially improves performance. To evaluate our method, we augment the Leuven data set (http://***/research/visiongroup/files/***), which is a stereo video shot from a car driving around the streets of Leuven, with 70 hand labeled object class and disparity maps. We hope that the release of these annotations will stimulate further work in the challenging domain of street-view analysis. Complete source code is publicly available (http://***/staff/Philip-Torr/***).

关键词： object class segmentation Dense stereo reconstruction Random fields

来源：评论

学校读者我要写书评

暂无评论

Interactive object class segmentation for Mobile Devices 27

Interactive Object Class Segmentation for Mobile Devices

引用

27th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

作者： Gallo, Ignazio Zamberletti, Alessandro Noce, Lucia Univ Insubria Dept Theoret & Appl Sci DiSTA Varese Italy

ISBN: (纸本)9781479942602

In this paper we propose an interactive approach for object class segmentation of natural images on touch-screen capable mobile devices. The key research question to which this paper tries to give an answer is: can we effectively correct the errors committed by an automatic or semi-automatic figure-ground segmentation algorithm while also providing real time feedback to the user on a low computational power mobile device? Many research works focused on improving automatic or semi-automatic figure-ground segmentation algorithms, but none tried to take advantage of the existing touch-screen technology integrated in most modern mobile devices to optimize the segmentation results of these algorithms. Our key idea is to use superpixels as interactive buttons that can be quickly tapped by the user to be added or removed from an initial low quality segmentation mask, with the aim of correcting the segmentation errors and produce a satisfying final result. We performed an extensive analysis of the proposed approach by implementing it both on a desktop computer and a mid-range Android device;even though our method is extremely simple, the results we obtained are comparable with those achieved by other state-of-the-art interactive segmentation algorithms. As such, we believe that the proposed approach can be exploited by most image editing mobile applications to provide a simple but highly effective method for interactive object class segmentation.

关键词： Interactive Image segmentation object class segmentation GrabCut segmentation Superpixel segmentation

来源：评论

学校读者我要写书评

暂无评论

MULTI-LABEL ENERGY MINIMIZATION FOR object class segmentation

MULTI-LABEL ENERGY MINIMIZATION FOR OBJECT CLASS SEGMENTATIO...

引用

20th European Signal Processing Conference (EUSIPCO)

作者： Couprie, Camille NYU Dept Comp Sci Courant Inst New York NY 10003 USA

ISBN: (纸本)9781467310680

The task of associating a semantic class to the objects present in an image is challenging because this problem involves the joint segmentation and recognition of the objects. In this work, we use a recent approach embedding several optimization algorithms into a common framework named Power watershed to perform this task. We show how the fast watershed algorithm can be used to minimize an energy function for which the minimizer corresponds to the desired object class segmentation. Higher order potentials are then added to improve label consistency. We also demonstrate that the random walker algorithm can be successfully applied to semantic class segmentation problems. Comparisons with the Graph Cuts algorithm show that the proposed approaches yield better segmentation results, obtained up to twelve times faster on a very challenging indoor scenes dataset.

关键词： Image processing object class segmentation Graph-based optimization Graph cuts Random walker Watershed

来源：评论

学校读者我要写书评

暂无评论

Inference Methods for CRFs with Co-occurrence Statistics

引用

INTERNATIONAL JOURNAL OF COMPUTER VISION 2013年第2期103卷 213-225页

作者： Ladicky, L'ubor Russell, Chris Kohli, Pushmeet Torr, Philip H. S. Univ Oxford Oxford England Univ London Queen Mary Coll London England Microsoft Res Cambridge England Oxford Brookes Univ Oxford OX3 0BP England

The Markov and Conditional random fields (CRFs) used in computer vision typically model only local interactions between variables, as this is generally thought to be the only case that is computationally tractable. In this paper we consider a class of global potentials defined over all variables in the CRF. We show how they can be readily optimised using standard graph cut algorithms at little extra expense compared to a standard pairwise field. This result can be directly used for the problem of class based image segmentation which has seen increasing recent interest within computer vision. Here the aim is to assign a label to each pixel of a given image from a set of possible object classes. Typically these methods use random fields to model local interactions between pixels or super-pixels. One of the cues that helps recognition is global object co-occurrence statistics, a measure of which classes (such as chair or motorbike) are likely to occur in the same image together. There have been several approaches proposed to exploit this property, but all of them suffer from different limitations and typically carry a high computational cost, preventing their application on large images. We find that the new model we propose produces a significant improvement in the labelling compared to just using a pairwise model and that this improvement increases as the number of labels increases.

关键词： Conditional random fields object class segmentation Optimization

来源：评论

学校读者我要写书评

暂无评论

ImageSpirit: Verbal Guided Image Parsing

引用

ACM TRANSACTIONS ON GRAPHICS 2014年第1期34卷 1–11页

作者： Cheng, Ming-Ming Zheng, Shuai Lin, Wen-Yan Vineet, Vibhav Sturgess, Paul Crook, Nigel Mitra, Niloy J. Torr, Philip Univ Oxford Wellington Sq Oxford OX1 2JD England Oxford Brookes Univ Oxford OX3 0BP England UCL London WC1E 6BT England Univ Oxford Oxford OX1 2JD England

ISBN: (纸本)9781450333313

Humans describe images in terms of nouns and adjectives while algorithms operate on images represented as sets of pixels. Bridging this gap between how humans would like to access images versus their typical representation is the goal of image parsing, which involves assigning object and attribute labels to pixels. In this article we propose treating nouns as object labels and adjectives as visual attribute labels. This allows us to formulate the image parsing problem as one of jointly estimating per-pixel object and attribute labels from a set of training images. We propose an efficient (interactive time) solution. Using the extracted labels as handles, our system empowers a user to verbally refine the results. This enables hands-free parsing of an image into pixel-wise object/attribute labels that correspond to human semantics. Verbally selecting objects of interest enables a novel and natural interaction modality that can possibly be used to interact with new generation devices (e.g., smartphones, Google Glass, livingroom devices). We demonstrate our system on a large number of real-world images with varying complexity. To help understand the trade-offs compared to traditional mouse-based interactions, results are reported for both a large-scale quantitative evaluation and a user study.

关键词： Design Human Factors Languages Image parsing natural language control speech interface object class segmentation image parsing visual attributes multilabel CRF

来源：评论

学校读者我要写书评

暂无评论

Filter-Based Mean-Field Inference for Random Fields with Higher-Order Terms and Product Label-Spaces

引用

INTERNATIONAL JOURNAL OF COMPUTER VISION 2014年第3期110卷 290-307页

作者： Vineet, Vibhav Warrell, Jonathan Torr, Philip H. S. Oxford Brookes Univ Oxford OX3 0BP England MIAS CSIR Pretoria South Africa Univ Oxford Dept Engn Sci Oxford OX1 3PJ England

Recently, a number of cross bilateral filtering methods have been proposed for solving multi-label problems in computer vision, such as stereo, optical flow and object class segmentation that show an order of magnitude improvement in speed over previous methods. These methods have achieved good results despite using models with only unary and/or pairwise terms. However, previous work has shown the value of using models with higher-order terms e. g. to represent label consistency over large regions, or global co-occurrence relations. We show how these higher-order terms can be formulated such that filter-based inference remains possible. We demonstrate our techniques on joint stereo and object labelling problems, as well as object class segmentation, showing in addition for joint object-stereo labelling how our method provides an efficient approach to inference in product label-spaces. We show that we are able to speed up inference in these models around 10-30 times with respect to competing graph-cut/move-making methods, as well as maintaining or improving accuracy in all cases. We showresults on PascalVOC-10 for object class segmentation, and Leuven for joint object-stereo labelling.

关键词： object class segmentation Dense stereo reconstruction Mean-field methods Higher order potentials Bilateral filters CRF

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：