检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

29,426 篇 会议
1,400 册 图书
235 篇 期刊文献

馆藏范围

31,059 篇 电子文献
2 种 纸本馆藏

日期分布

学科分类号

17,311 篇 工学
- 13,652 篇 计算机科学与技术...
- 5,219 篇 软件工程
- 2,970 篇 机械工程
- 2,647 篇 光学工程
- 1,413 篇 控制科学与工程
- 1,412 篇 电气工程
- 1,334 篇 信息与通信工程
- 658 篇 生物工程
- 576 篇 仪器科学与技术
- 514 篇 生物医学工程（可授...
- 466 篇 电子科学与技术（可...
- 251 篇 化学工程与技术
- 216 篇 安全科学与工程
- 143 篇 交通运输工程
- 134 篇 建筑学
- 122 篇 材料科学与工程（可...
- 120 篇 土木工程
5,070 篇 理学
- 3,136 篇 物理学
- 2,409 篇 数学
- 826 篇 生物学
- 803 篇 统计学（可授理学、...
- 299 篇 系统科学
- 228 篇 化学
3,832 篇 医学
- 3,801 篇 临床医学
- 187 篇 基础医学(可授医学...
- 140 篇 药学(可授医学、理...
1,065 篇 管理学
- 618 篇 图书情报与档案管...
- 471 篇 管理科学与工程(可...
- 148 篇 工商管理
373 篇 艺术学
- 373 篇 设计学（可授艺术学...
117 篇 法学
82 篇 农学
48 篇 教育学
44 篇 经济学
18 篇 军事学
8 篇 文学

主题

12,609 篇 computer vision
5,703 篇 pattern recognit...
3,181 篇 training
2,263 篇 cameras
2,179 篇 computational mo...
2,116 篇 feature extracti...
2,051 篇 image segmentati...
1,971 篇 visualization
1,967 篇 shape
1,642 篇 robustness
1,491 篇 layout
1,476 篇 three-dimensiona...
1,442 篇 computer science
1,339 篇 computer archite...
1,296 篇 object detection
1,221 篇 semantics
1,144 篇 face recognition
1,107 篇 conferences
1,077 篇 benchmark testin...
1,056 篇 humans

机构

137 篇 univ sci & techn...
134 篇 tsinghua univers...
134 篇 univ chinese aca...
118 篇 chinese univ hon...
101 篇 microsoft resear...
97 篇 zhejiang univers...
95 篇 national laborat...
94 篇 shanghai jiao to...
93 篇 zhejiang univ pe...
85 篇 university of sc...
79 篇 shanghai ai lab ...
78 篇 swiss fed inst t...
66 篇 microsoft res as...
62 篇 adobe research
62 篇 computer vision ...
61 篇 peking univ peop...
58 篇 univ oxford oxfo...
57 篇 google mountain ...
57 篇 hong kong univ s...
56 篇 google res mount...

作者

107 篇 umapada pal
82 篇 van gool luc
70 篇 zhang lei
59 篇 timofte radu
41 篇 yang yi
37 篇 loy chen change
37 篇 hanqing lu
33 篇 liu yang
32 篇 nassir navab
32 篇 wang liang
32 篇 xiaoou tang
30 篇 tian qi
29 篇 h. bischof
29 篇 jan-michael frah...
29 篇 vittorio murino
29 篇 darrell trevor
28 篇 ling haibin
28 篇 chen chen
27 篇 li xin
27 篇 vasconcelos nuno

语言

30,810 篇 英文
181 篇 其他
100 篇 中文
6 篇 土耳其文
2 篇 日文
2 篇 俄文

检索条件"任意字段=Conference on Computer Vision and Pattern Recognition"

共 31061 条记录，以下是4251-4260 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Scaled 360 layouts: Revisiting non-central panoramas

Scaled 360 layouts: Revisiting non-central panoramas

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Berenguel-Baeta, Bruno Bermudez-Cameo, Jesus Guerrero, Jose J. Univ Zaragoza I3A Zaragoza Spain

ISBN: (纸本)9781665448994

From a non-central panorama, 3D lines can be recovered by geometric reasoning. However, their sensitivity to noise and the complex geometric modeling required has led these panoramas being very little investigated. In this work we present a novel approach for 3D layout recovery of indoor environments using single non-central panoramas. We obtain the boundaries of the structural lines of the room from a non-central panorama using deep learning and exploit the properties of non-central projection systems in a new geometrical processing to recover the scaled layout. We solve the problem for Manhattan environments, handling occlusions, and also for Atlanta environments in an unified method. The experiments performed improve the state-of-the-art methods for 3D layout recovery from a single panorama. Our approach is the first work using deep learning with non-central panoramas and recovering the scale of single panorama layouts.

关键词： Deep learning computer vision Three-dimensional displays Sensitivity conferences Layout Neural networks

来源：评论

学校读者我要写书评

暂无评论

Virtual Fully-Connected Layer: Training a Large-Scale Face recognition Dataset with Limited Computational Resources

Virtual Fully-Connected Layer: Training a Large-Scale Face R...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Li, Pengyu Wang, Biao Zhang, Lei Alibaba Grp Artificial Intelligence Ctr DAMO Acad Hangzhou Peoples R China Hong Kong Polytech Univ Dept Comp Hong Kong Peoples R China

ISBN: (纸本)9781665445092

Recently, deep face recognition has achieved significant progress because of Convolutional Neural Networks (CNNs) and large-scale datasets. However, training CNNs on a large-scale face recognition dataset with limited computational resources is still a challenge. This is because the classification paradigm needs to train a fully-connected layer as the category classifier, and its parameters will be in the hundreds of millions if the training dataset contains millions of identities. This requires many computational resources, such as GPU memory. The metric learning paradigm is an economical computation method, but its performance is greatly inferior to that of the classification paradigm. To address this challenge, we propose a simple but effective CNN layer called the Virtual fully-connected (Virtual FC) layer to reduce the computational consumption of the classification paradigm. Without bells and whistles, the proposed Virtual FC reduces the parameters by more than 100 times with respect to the fully-connected layer and achieves competitive performance on mainstream face recognition evaluation datasets. Moreover, the performance of our Virtual FC layer on the evaluation datasets is superior to that of the metric learning paradigm by a significant margin. Our code will be released in hopes of disseminating our idea to other domains1.

关键词： Training Measurement computer vision Codes Face recognition Memory management Graphics processing units

来源：评论

学校读者我要写书评

暂无评论

An Empirical Study of Scaling Law for Scene Text recognition

An Empirical Study of Scaling Law for Scene Text Recognition

引用

conference on computer vision and pattern recognition (CVPR)

作者： Miao Rang Zhenni Bi Chuaniian Liu Yunhe Wang Kai Han Huawei Noah's Ark Lab

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

The laws of model size, data volume, computation and model performance have been extensively studied in the field of Natural Language Processing (NLP). However, the scaling laws in Scene Text recognition (STR) have not yet been investigated. To address this, we conducted comprehensive studies that involved examining the correlations between performance and the scale of models, data volume and computation in the field of text recognition. Conclusively, the study demonstrates smooth power laws between performance and model size, as well as training data volume, when other influencing factors are held constant. Additionally, we have constructed a large-scale dataset called REBU-Syn, which comprises 6 M real samples and 18 M synthetic samples. Based on the disclosed scaling law and new dataset, we successfully trained a scene text recognition model, achieving a new state-of-the-art on 6 common test benchmarks with top-1 average accuracy of 97.42%. The models and dataset are publicly available at ***.

关键词： Solid modeling computer vision Correlation Accuracy Text recognition Computational modeling Training data

来源：评论

学校读者我要写书评

暂无评论

Towards Robust Classification Model by Counterfactual and Invariant Data Generation

Towards Robust Classification Model by Counterfactual and In...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Chang, Chun-Hao Adam, George Alexandru Goldenberg, Anna Univ Toronto Hosp Sick Children Vector Inst Toronto ON Canada

ISBN: (纸本)9781665445092

Despite the success of machine learning applications in science, industry, and society in general, many approaches are known to be non-robust, often relying on spurious correlations to make predictions. Spuriousness occurs when some features correlate with labels but are not causal;relying on such features prevents models from generalizing to unseen environments where such correlations break. In this work, we focus on image classification and propose two data generation processes to reduce spuriousness. Given human annotations of the subset of the features responsible (causal) for the labels (e.g. bounding boxes), we modify this causal set to generate a surrogate image that no longer has the same label (i.e. a counterfactual image). We also alter non-causal features to generate images still recognized as the original labels, which helps to learn a model invariant to these features. In several challenging datasets, our data generations outperform state-of-the-art methods in accuracy when spurious correlations break, and increase the saliency focus on causal features providing better explanations.

关键词： Industries computer vision Correlation Image recognition Annotations Computational modeling Machine learning

来源：评论

学校读者我要写书评

暂无评论

Capsule Network is Not More Robust than Convolutional Network

Capsule Network is Not More Robust than Convolutional Networ...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Gu, Jindong Tresp, Volker Hu, Han Univ Munich Munich Germany Microsoft Res Asia Beijing Peoples R China

ISBN: (纸本)9781665445092

The Capsule Network is widely believed to be more robust than Convolutional Networks. However, there are no comprehensive comparisons between these two networks, and it is also unknown which components in the CapsNet affect its robustness. In this paper, we first carefully examine the special designs in CapsNet that differ from that of a ConvNet commonly used for image classification. The examination reveals five major new/different components in CapsNet: a transformation process, a dynamic routing layer, a squashing function, a marginal loss other than cross-entropy loss, and an additional class-conditional reconstruction loss for regularization. Along with these major differences, we conduct comprehensive ablation studies on three kinds of robustness, including affine transformation, overlapping digits, and semantic representation. The study reveals that some designs, which are thought critical to CapsNet, actually can harm its robustness, i.e., the dynamic routing layer and the transformation process, while others are beneficial for the robustness. Based on these findings, we propose enhanced ConvNets simply by introducing the essential components behind the CapsNet's success. The proposed simple ConvNets can achieve better robustness than the CapsNet.

关键词： computer vision Aggregates Semantics Routing Robustness pattern recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

Taming Transformers for High-Resolution Image Synthesis

Taming Transformers for High-Resolution Image Synthesis

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Esser, Patrick Rombach, Robin Ommer, Bjoern Heidelberg Univ Heidelberg Collaboratory Image Proc IWR Heidelberg Germany

ISBN: (纸本)9781665445092

Designed to learn long-range interactions on sequential data, transformers continue to show state-of-the-art results on a wide variety of tasks. In contrast to CNNs, they contain no inductive bias that prioritizes local interactions. This makes them expressive, but also computationally infeasible for long sequences, such as high-resolution images. We demonstrate how combining the effectiveness of the inductive bias of CNNs with the expressivity of transformers enables them to model and thereby synthesize high-resolution images. We show how to (i) use CNNs to learn a contextrich vocabulary of image constituents, and in turn (ii) utilize transformers to efficiently model their composition within high-resolution images. Our approach is readily applied to conditional synthesis tasks, where both non-spatial information, such as object classes, and spatial information, such as segmentations, can control the generated image. In particular, we present the first results on semantically-guided synthesis of megapixel images with transformers. Project page at https://***/JLlvY.

关键词： Vocabulary Image segmentation computer vision Image synthesis computer architecture Transformers Rendering (computer graphics)

来源：评论

学校读者我要写书评

暂无评论

MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers

MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Tran...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Wang, Huiyu Zhu, Yukun Adam, Hartwig Yuille, Alan Chen, Liang-Chieh Johns Hopkins Univ Baltimore MD 21218 USA Google Res Mountain View CA USA Google Mountain View CA 94043 USA

ISBN: (纸本)9781665445092

We present MaX-DeepLab, the first end-to-end model for panoptic segmentation. Our approach simplifies the current pipeline that depends heavily on surrogate sub-tasks and hand-designed components, such as box detection, non-maximum suppression, thing-stuff merging, etc. Although these sub-tasks are tackled by area experts, they fail to comprehensively solve the target task. By contrast, our MaX-DeepLab directly predicts class-labeled masks with a mask transformer, and is trained with a panoptic quality inspired loss via bipartite matching. Our mask transformer employs a dual-path architecture that introduces a global memory path in addition to a CNN path, allowing direct communication with any CNN layers. As a result, MaX-DeepLab shows a significant 7.1% PQ gain in the box-free regime on the challenging COCO dataset, closing the gap between box-based and box-free methods for the first time. A small variant of MaX-DeepLab improves 3.0% PQ over DETR with similar parameters and M-Adds. Furthermore, MaX-DeepLab, without test time augmentation, achieves new state-of-the-art 51.3% PQ on COCO test-dev set.

关键词： computer vision Merging Pipelines computer architecture Transformers pattern recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

All You Can Embed: Natural Language based Vehicle Retrieval with Spatio-Temporal Transformers

All You Can Embed: Natural Language based Vehicle Retrieval ...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Scribano, Carmelo Sapienza, Davide Franchini, Giorgia Verucchi, Micaela Bertogna, Marko Univ Modena & Reggio Emilia Modena Italy Univ Ferrara Ferrara Italy Univ Parma Parma Italy

ISBN: (纸本)9781665448994

Combining Natural Language with vision represents a unique and interesting challenge in the domain of Artificial Intelligence. The AI City Challenge Track 5 for Natural Language-Based Vehicle Retrieval focuses on the problem of combining visual and textual information, applied to a smart-city use case. In this paper, we present All You Can Embed (AYCE), a modular solution to correlate single-vehicle tracking sequences with natural language. The main building blocks of the proposed architecture are (i) BERT to provide an embedding of the textual descriptions, (ii) a convolutional backbone along with a Transformer model to embed the visual information. For the training of the retrieval model, a variation of the Triplet Margin Loss is proposed to learn a distance measure between the visual and language embeddings. The code is publicly available at https://***/cscribano/AYCE_2021.

关键词： Training Visualization Target tracking Natural languages Urban areas computer architecture Loss measurement

来源：评论

学校读者我要写书评

暂无评论

Essentials for Class Incremental Learning

Essentials for Class Incremental Learning

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Mittal, Sudhanshu Galesso, Silvio Brox, Thomas Univ Freiburg Freiburg Germany

ISBN: (纸本)9781665448994

Contemporary neural networks are limited in their ability to learn from evolving streams of training data. When trained sequentially on new or evolving tasks, their accuracy drops sharply, making them unsuitable for many real-world applications. In this work, we shed light on the causes of this well known yet unsolved phenomenon often referred to as catastrophic forgetting - in a class-incremental setup. We show that a combination of simple components and a loss that balances intra-task and inter-task learning can already resolve forgetting to the same extent as more complex measures proposed in literature. Moreover, we identify poor quality of the learned representation as another reason for catastrophic forgetting in class-IL. We show that performance is correlated with secondary class information (dark knowledge) learned by the model and it can be improved by an appropriate regularizer. With these lessons learned, class-incremental learning results on CIFAR-100 and ImageNet improve over the state-of-the-art by a large margin, while keeping the approach simple.

关键词： Learning systems computer vision Art conferences Neural networks Training data Boosting

来源：评论

学校读者我要写书评

暂无评论

Compact and Effective Representations for Sketch-based Image Retrieval

Compact and Effective Representations for Sketch-based Image...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Torres, Pablo Saavedra, Jose M. Univ Chile DCC Av Beauchef 851 Santiago Chile Impresee Inc 600 Calif St San Francisco CA USA

ISBN: (纸本)9781665448994

Sketch-based image retrieval (SBIR) has undergone an increasing interest in the community of computer vision bringing high impact in real applications. For instance, SBIR brings an increased benefit to eCommerce search engines because it allows users to formulate a query just by drawing what they need to buy. However, current methods showing high precision in retrieval work in a high dimensional space, which negatively affects aspects like memory consumption and time processing. Although some authors have also proposed compact representations, these drastically degrade the performance in a low dimension. Therefore in this work, we present different results of evaluating methods for producing compact embeddings in the context of sketch-based image retrieval. Our main interest is in strategies aiming to keep the local structure of the original space. The recent unsupervised local-topology preserving dimension reduction method UMAP fits our requirements and shows outstanding performance, improving even the precision achieved by SOTA methods. We evaluate six methods in two different datasets. We use Flickr15K and eCommerce datasets;the latter is another contribution of this work. We show that UMAP allows us to have feature vectors of 16 bytes improving precision by more than 35%.

关键词： Dimensionality reduction computer vision conferences Image retrieval Memory management Search engines pattern recognition

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 422 423 424 425 426 427 428 429 430 431 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：