检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

341 篇 会议
2 篇 期刊文献

馆藏范围

343 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

337 篇 工学
- 327 篇 计算机科学与技术...
- 262 篇 电气工程
- 256 篇 机械工程
- 8 篇 软件工程
- 4 篇 电子科学与技术（可...
- 4 篇 信息与通信工程
- 4 篇 控制科学与工程
- 2 篇 光学工程
- 1 篇 仪器科学与技术
- 1 篇 生物工程
6 篇 理学
- 3 篇 系统科学
- 2 篇 物理学
- 1 篇 数学
- 1 篇 生物学
- 1 篇 统计学（可授理学、...
6 篇 医学
- 6 篇 临床医学
- 1 篇 基础医学(可授医学...
- 1 篇 公共卫生与预防医...
3 篇 管理学
- 3 篇 管理科学与工程(可...
- 1 篇 工商管理
1 篇 经济学
- 1 篇 应用经济学

主题

141 篇 computer vision
73 篇 images
63 篇 training
53 篇 estimation
50 篇 visualization
45 篇 modulation
45 篇 filtration
45 篇 field programmab...
44 篇 ofdm
44 篇 signal processin...
44 篇 clocks
36 篇 computer archite...
36 篇 semantics
30 篇 feature extracti...
28 篇 computational mo...
24 篇 neural networks
23 篇 cameras
21 篇 recognition (psy...
21 篇 videotapes
20 篇 dataset

机构

12 篇 carnegie mellon ...
11 篇 univ chinese aca...
8 篇 stanford univ st...
6 篇 australian natl ...
5 篇 univ sci & techn...
5 篇 mit cambridge ma...
5 篇 kaust thuwal
5 篇 chinese acad sci...
5 篇 univ calif berke...
4 篇 facebook ai res ...
4 篇 boston univ bost...
4 篇 univ sydney nsw
4 篇 sun yat sen univ...
4 篇 natl univ singap...
4 篇 deepmind england
4 篇 univ washington ...
4 篇 microsoft res pe...
4 篇 max planck inst ...
3 篇 ist austria klos...
3 篇 univ toronto on

作者

5 篇 ghanem bernard
5 篇 zafeiriou stefan...
4 篇 darrell trevor
4 篇 lin liang
4 篇 schiele bernt
4 篇 ramanan deva
3 篇 swoboda paul
3 篇 busch christoph
3 篇 van gool luc
3 篇 murino vittorio
3 篇 fritz mario
3 篇 feng jiashi
3 篇 gupta abhinav
3 篇 deng weihong
3 篇 torr philip h. s...
3 篇 pollefeys marc
3 篇 leibe bastian
3 篇 liu li
3 篇 shao ling
3 篇 mei tao

语言

339 篇 英文
2 篇 日文
1 篇 其他
1 篇 中文

检索条件"任意字段=30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017"

共 343 条记录，以下是51-60 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Enhancing Video Summarization via vision-Language Embedding 30

Enhancing Video Summarization via Vision-Language Embedding

引用

30th ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Plummer, Bryan A. Brown, Matthew Lazebnik, Svetlana Univ Illinois Urbana IL 61801 USA Google Res Mountain View CA USA

ISBN: (纸本)9781538604571

this paper addresses video summarization, or the problem of distilling a raw video into a shorter form while still capturing the original story. We show that visual representations supervised by freeform language make a good fit for this application by extending a recent submodular summarization approach [9] with representativeness and interestingness objectives computed on features from a joint vision-language embedding space. We perform an evaluation on two diverse datasets, UT Egocentric [18] and TV Episodes [45], and show that our new objectives give improved summarization ability compared to standard visual features alone. Our experiments also show that the vision-language embedding need not be trained on domainspecific data, but can be learned from standard still image vision-language datasets and transferred to video. A further benefit of our model is the ability to guide a summary using freeform text input at test time, allowing user customization.

关键词： Videotapes embedding visual representation TESTING TIME REPRESENTATIVE SAMPLES text input Dataset

来源：评论

学校读者我要写书评

暂无评论

Temporal Attention-Gated Model for Robust Sequence Classification 30

Temporal Attention-Gated Model for Robust Sequence Classific...

引用

30th ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Pei, Wenjie Baltrusaitis, Tadas Tax, David M. J. Morency, Louis-Philippe Delft Univ Technol Pattern Recognit Lab Delft Netherlands Carnegie Mellon Univ Language Technol Inst Pittsburgh PA 15213 USA

ISBN: (纸本)9781538604571

Typical techniques for sequence classification are designed for well-segmented sequences which have been edited to remove noisy or irrelevant parts. therefore, such methods cannot be easily applied on noisy sequences expected in real-world applications. In this paper, we present the Temporal Attention-Gated Model (TAGM) which integrates ideas from attention models and gated recurrent networks to better deal with noisy or unsegmented sequences. Specifically, we extend the concept of attention model to measure the relevance of each observation (time step) of a sequence. We then use a novel gated recurrent network to learn the hidden representation for the final prediction. An important advantage of our approach is interpretability since the temporal attention weights provide a meaningful value for the salience of each time step in the sequence. We demonstrate the merits of our TAGM approach, both for prediction accuracy and interpretability, on three different tasks: spoken digit recognition, text-based sentiment analysis and visual event recognition.

关键词： Noise interpretability recurrent network GATING Sequences

来源：评论

学校读者我要写书评

暂无评论

A Study of Lagrangean Decompositions and Dual Ascent Solvers for Graph Matching 30

A Study of Lagrangean Decompositions and Dual Ascent Solvers...

引用

30th ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Swoboda, Paul Rother, Carsten Abu Alhaija, Hassan Kainmueller, Dagmar Savchynskyy, Bogdan IST Austria Klosterneuburg Austria Tech Univ Dresden Dresden Germany MPI CBG Dresden Germany

ISBN: (纸本)9781538604571

We study the quadratic assignment problem, in computer vision also known as graph matching. Two leading solvers for this problem optimize the Lagrange decomposition duals with sub-gradient and dual ascent (also known as message passing) updates. We explore this direction further and propose several additional Lagrangean relaxations of the graph matching problem along with corresponding algorithms, which are all based on a common dual ascent framework. Our extensive empirical evaluation gives several theoretical insights and suggests a new state-of-the-art any-time solver for the considered problem. Our improvement over state-of-the-art is particularly visible on a new dataset with large-scale sparse problem instances containing more than 500 graph nodes each.

关键词： Message passing Labeling computer vision Schedules pattern matching Optimization Graph matching message passing images computer vision pattern matching ascent Solvers

来源：评论

学校读者我要写书评

暂无评论

Missing Modalities Imputation via Cascaded Residual Autoencoder 30

Missing Modalities Imputation via Cascaded Residual Autoenco...

引用

30th ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Luan Tran Liu, Xiaoming Zhou, Jiayu Jin, Rong Michigan State Univ Dept Comp Sci & Engn E Lansing MI 48824 USA Alibaba Grp Holding Ltd Hangzhou Zhejiang Peoples R China

ISBN: (纸本)9781538604571

Affordable sensors lead to an increasing interest in acquiring and modeling data with multiple modalities. Learning from multiple modalities has shown to significantly improve performance in object recognition. However, in practice it is common that the sensing equipment experiences unforeseeable malfunction or configuration issues, leading to corrupted data with missing modalities. Most existing multi-modal learning algorithms could not handle missing modalities, and would discard either all modalities with missing values or all corrupted data. To leverage the valuable information in the corrupted data, we propose to impute the missing data by leveraging the relatedness among different modalities. Specifically, we propose a novel Cascaded Residual Autoencoder (CRA) to impute missing modalities. By stacking residual autoencoders, CRA grows iteratively to model the residual between the current prediction and original data. Extensive experiments demonstrate the superior performance of CRA on both the data imputation and the object recognition task on imputed data.

关键词： corrupted data object recognition cascades RESIDUAL missing data Current prediction

来源：评论

学校读者我要写书评

暂无评论

A Dual Ascent Framework for Lagrangean Decomposition of Combinatorial Problems 30

A Dual Ascent Framework for Lagrangean Decomposition of Comb...

引用

30th ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Swoboda, Paul Kuske, Jan Savchynskyy, Bogdan IST Austria Klosterneuburg Austria Heidelberg Univ Heidelberg Vic Australia Tech Univ Dresden Dresden Germany

ISBN: (纸本)9781538604571

We propose a general dual ascent framework for Lagrangean decomposition of combinatorial problems. Although methods of this type have shown their efficiency for a number of problems, so far there was no general algorithm applicable to multiple problem types. In this work, we propose such a general algorithm. It depends on several parameters, which can be used to optimize its performance in each particular setting. We demonstrate efficacy of our method on graph matching and multicut problems, where it outperforms state-of-the-art solvers including those based on subgradient optimization and off-the-shelf linear programming solvers.

关键词： Optimization computer vision Inference algorithms Partitioning algorithms Computational modeling Couplings Linear programming Inference algorithms Partitioning algorithms linear programming Couplings Computational modeling images computer vision Graph matching ascent coupling

来源：评论

学校读者我要写书评

暂无评论

Lip Reading Sentences in the Wild 30

Lip Reading Sentences in the Wild

引用

30th ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Chung, Joon Son Senior, Andrew Vinyals, Oriol Zisserman, Andrew Univ Oxford Dept Engn Sci Oxford England DeepMind London England

ISBN: (纸本)9781538604571

the goal of this work is to recognise phrases and sentences being spoken by a talking face, with or without the audio. Unlike previous works that have focussed on recognising a limited number of words or phrases, we tackle lip reading as an open-world problem - unconstrained natural language sentences, and in the wild videos. Our key contributions are: (1) a 'Watch, Listen, Attend and Spell' (WLAS) network that learns to transcribe videos of mouth motion to characters;(2) a curriculum learning strategy to accelerate training and to reduce overfitting;(3) a 'Lip Reading Sentences' (LRS) dataset for visual speech recognition, consisting of over 100,000 natural sentences from British television. the WLAS model trained on the LRS dataset surpasses the performance of all previous work on standard lip reading benchmark datasets, often by a significant margin. this lip reading performance beats a professional lip reader on videos from BBC television, and we also demonstrate that if audio is available, then visual information helps to improve speech recognition performance.

关键词： Lips Visualization Speech recognition Decoding Training Face Videos

来源：评论

学校读者我要写书评

暂无评论

Product Split Trees 30

Product Split Trees

引用

30th ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Babenko, Artem Lempitsky, Victor Yandex Moscow Russia Natl Res Univ Higher Sch Econ Moscow Russia Skolkovo Inst Sci & Technol Skoltech Moscow Russia

ISBN: (纸本)9781538604571

In this work, we introduce a new kind of spatial partition trees for efficient nearest-neighbor search. Our approach first identifies a set of useful data splitting directions, and then learns a codebook that can be used to encode such directions. We use the product-quantization idea in order to make the effective codebook large, the evaluation of scalar products between the query and the encoded splitting direction very fast, and the encoding itself compact. As a result, the proposed data srtucture (Product Split tree) achieves compact clustering of data points, while keeping the traversal very efficient. In the nearest-neighbor search experiments on high-dimensional data, product split trees achieved state-of-the-art performance, demonstrating better speed-accuracy tradeoff than other spatial partition trees.

关键词： Vegetation Nearest neighbor searches Databases Memory management computer vision Partitioning algorithms Reactive power Nearest neighbor query Partitioning algorithms Store management Reactive power images computer vision Database vegetation codebook Trees trees

来源：评论

学校读者我要写书评

暂无评论

the Impact of Typicality for Informative Representative Selection 30

The Impact of Typicality for Informative Representative Sele...

引用

30th ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Bappy, Jawadul H. Paul, Sujoy Tuncel, Ertem Roy-Chowdhury, Amit K. Univ Calif Riverside Dept ECE Riverside CA 92521 USA

ISBN: (纸本)9781538604571

In computer vision, selection of the most informative samples from a huge pool of training data in order to learn a good recognition model is an active research problem. Furthermore, it is also useful to reduce the annotation cost, as it is time consuming to annotate unlabeled samples. In this paper, motivated by the theories in data compression, we propose a novel sample selection strategy which exploits the concept of typicality from the domain of information theory. Typicality is a simple and powerful technique which can be applied to compress the training data to learn a good classification model. In this work, typicality is used to identify a subset of the most informative samples for labeling, which is then used to update the model using active learning. the proposed model can take advantage of the inter-relationships between data samples. Our approach leads to a significant reduction of manual labeling cost while achieving similar or better recognition performance compared to a model trained with entire training set. this is demonstrated through rigorous experimentation on five datasets.

关键词： Computational modeling computer vision Data models Entropy Context modeling Activity recognition

来源：评论

学校读者我要写书评

暂无评论

Data Dropout: Optimizing Training Data for Convolutional Neural Networks 30

Data Dropout: Optimizing Training Data for Convolutional Neu...

引用

30th ieee International conference on Tools with Artificial Intelligence (ICTAI)

作者： Wang, Tianyang Huan, Jun Li, Bo Austin Peay State Univ Clarksville TN 37044 USA Baidu Res Beijing Peoples R China Univ Southern Mississippi Hattiesburg MS 39406 USA

ISBN: (纸本)9781538674499

Deep learning models learn to fit training data while they are highly expected to generalize well to testing data. Most works aim at finding such models by creatively designing architectures and fine-tuning parameters. To adapt to particular tasks, hand-crafted information such as image prior has also been incorporated into end-to-end learning. However, very little progress has been made on investigating how an individual training sample will influence the generalization ability of a model. In other words, to achieve high generalization accuracy, do we really need all the samples in a training dataset? In this paper, we demonstrate that deep learning models such as convolutional neural networks may not favor all training samples, and generalization accuracy can be further improved by dropping those unfavorable samples. Specifically, the influence of removing a training sample is quantifiable, and we propose a Two-Round Training approach, aiming to achieve higher generalization accuracy. We locate unfavorable samples after the first round of training, and then retrain the model from scratch with the reduced training dataset in the second round. Since our approach is essentially different from fine-tuning or further training, the computational cost should not be a concern. Our extensive experimental results indicate that, with identical settings, the proposed approach can boost performance of the well-known networks on both high-level computer vision problems such as image classification, and low-level vision problems such as image denoising.

关键词： Convolutional Neural Networks Deep Learning Dropout Image Classification Training Data pattern recognition

来源：评论

学校读者我要写书评

暂无评论

FASON: First and Second Order Information Fusion Network for Texture recognition 30

FASON: First and Second Order Information Fusion Network for...

引用

30th ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Dai, Xiyang Ng, Joe Yue-Hei Davis, Larry S. Univ Maryland Inst Adv Comp Studies College Pk MD 20742 USA

ISBN: (纸本)9781538604571

Deep networks have shown impressive performance on many computer vision tasks. Recently, deep convolutional neural networks (CNNs) have been used to learn discriminative texture representations. One of the most successful approaches is Bilinear CNN model that explicitly captures the second order statistics within deep features. However, these networks cut off the first order information flow in the deep network and make gradient back-propagation difficult. We propose an effective fusion architecture - FASON that combines second order information flow and first order information flow. Our method allows gradients to back-propagate through both flows freely and can be trained effectively. We then build a multi-level deep architecture to exploit the first and second order information within different convolutional layers. Experiments show that our method achieves improvements over state-of-the-art methods on several benchmark datasets.

关键词： First Second Order Information Fusion Network Texture recognition information flow Information fusion agate First texture Network images computer vision

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共35页 << < 2 3 4 5 6 7 8 9 10 11 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：