检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

50,693 篇 会议
1,424 册 图书
1,049 篇 期刊文献
1 篇 学位论文

馆藏范围

53,164 篇 电子文献
3 种 纸本馆藏

日期分布

学科分类号

31,979 篇 工学
- 24,946 篇 计算机科学与技术...
- 12,662 篇 软件工程
- 5,176 篇 光学工程
- 4,776 篇 电气工程
- 4,488 篇 信息与通信工程
- 4,263 篇 机械工程
- 4,009 篇 控制科学与工程
- 2,480 篇 生物工程
- 1,737 篇 生物医学工程（可授...
- 1,583 篇 仪器科学与技术
- 1,332 篇 电子科学与技术（可...
- 795 篇 化学工程与技术
- 729 篇 安全科学与工程
- 570 篇 交通运输工程
- 389 篇 建筑学
- 339 篇 土木工程
11,923 篇 理学
- 6,487 篇 物理学
- 5,441 篇 数学
- 2,768 篇 生物学
- 1,918 篇 统计学（可授理学、...
- 804 篇 化学
- 669 篇 系统科学
5,318 篇 医学
- 5,105 篇 临床医学
- 732 篇 基础医学(可授医学...
- 459 篇 药学(可授医学、理...
3,389 篇 管理学
- 1,977 篇 图书情报与档案管...
- 1,565 篇 管理科学与工程(可...
- 487 篇 工商管理
720 篇 艺术学
- 718 篇 设计学（可授艺术学...
439 篇 法学
- 411 篇 社会学
303 篇 农学
199 篇 教育学
167 篇 经济学
63 篇 文学
48 篇 军事学

主题

17,427 篇 computer vision
9,029 篇 pattern recognit...
4,199 篇 training
3,832 篇 feature extracti...
3,134 篇 cameras
2,879 篇 computational mo...
2,796 篇 image segmentati...
2,624 篇 visualization
2,574 篇 shape
2,536 篇 face recognition
2,176 篇 robustness
2,125 篇 computer science
1,976 篇 object detection
1,961 篇 computer archite...
1,882 篇 layout
1,855 篇 object recogniti...
1,801 篇 three-dimensiona...
1,725 篇 neural networks
1,705 篇 humans
1,700 篇 image recognitio...

机构

165 篇 univ chinese aca...
144 篇 tsinghua univers...
135 篇 national laborat...
106 篇 univ sci & techn...
104 篇 zhejiang univers...
101 篇 shanghai jiao to...
95 篇 university of sc...
95 篇 microsoft resear...
85 篇 zhejiang univ pe...
84 篇 shanghai ai lab ...
74 篇 school of comput...
69 篇 computer vision ...
68 篇 peking univ peop...
68 篇 chinese acad sci...
66 篇 chinese univ hon...
63 篇 institute of inf...
62 篇 google res mount...
61 篇 univ oxford oxfo...
59 篇 univ toronto on
57 篇 swiss fed inst t...

作者

92 篇 van gool luc
87 篇 umapada pal
78 篇 zhang lei
64 篇 lee seong-whan
50 篇 vittorio murino
42 篇 yang yi
34 篇 nassir navab
34 篇 ling haibin
33 篇 li xin
33 篇 jie yang
31 篇 loy chen change
31 篇 liu yang
30 篇 escalera sergio
30 篇 h. bischof
29 篇 zhou jie
29 篇 vasconcelos nuno
29 篇 jan-michael frah...
28 篇 blumenstein mich...
27 篇 jia yunde
27 篇 luo ping

语言

51,180 篇 英文
1,749 篇 其他
253 篇 中文
22 篇 土耳其文
4 篇 西班牙文
2 篇 日文
2 篇 葡萄牙文
2 篇 俄文
1 篇 法文

检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition"

共 53167 条记录，以下是4921-4930 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Cross-Modal Center Loss for 3D Cross-Modal Retrieval

Cross-Modal Center Loss for 3D Cross-Modal Retrieval

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Jing, Longlong Vahdani, Elahe Tan, Jiaxing Tian, Yingli CUNY New York NY 10021 USA

ISBN: (纸本)9781665445092

Cross-modal retrieval aims to learn discriminative and modal-invariant features for data from different modalities. Unlike the existing methods which usually learn from the features extracted by offline networks, in this paper, we propose an approach to jointly train the components of cross-modal retrieval framework with metadata, and enable the network to find optimal features. The proposed end-to-end framework is updated with three loss functions: 1) a novel cross-modal center loss to eliminate cross-modal discrepancy, 2) cross-entropy loss to maximize inter-class variations, and 3) mean-square-error loss to reduce modality variations. In particular, our proposed cross-modal center loss minimizes the distances of features from objects belonging to the same class across all modalities. Extensive experiments have been conducted on the retrieval tasks across multi-modalities including 2D image, 3D point cloud and mesh data. The proposed framework significantly outperforms the state-of-the-art methods for both cross-modal and in-domain retrieval for 3D objects on the ModelNet10 and ModelNet40 datasets.

关键词： Solid modeling computer vision Three-dimensional displays Computational modeling Metadata Feature extraction pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Glancing at the Patch: Anomaly Localization with Global and Local Feature Comparison

Glancing at the Patch: Anomaly Localization with Global and ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Wang, Shenzhi Wu, Liwei Cui, Lei Shen, Yujun SenseTime Res Hong Kong Peoples R China Chinese Univ Hong Kong Hong Kong Peoples R China

ISBN: (纸本)9781665445092

Anomaly localization, with the purpose to segment the anomalous regions within images, is challenging due to the large variety of anomaly types. Existing methods typically train deep models by treating the entire image as a whole yet put little effort into learning the local distribution, which is vital for this pixel-precise task. In this work, we propose an unsupervised patch-based approach that gives due consideration to both the global and local information. More concretely, we employ a Local-Net and Global-Net to extract features from any individual patch and its surrounding respectively. Global-Net is trained with the purpose to mimic the local feature such that we can easily detect an abnormal patch when its feature mismatches that from the context. We further introduce an Inconsistency Anomaly Detection (IAD) head and a Distortion Anomaly Detection (DAD) head to sufficiently spot the discrepancy between global and local features. A scoring function derived from the multi-head design facilitates high-precision anomaly localization. Extensive experiments on a couple of real-world datasets suggest that our approach outperforms state-of-the-art competitors by a sufficiently large margin.

关键词： Location awareness Image segmentation computer vision Head Feature extraction Distortion pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Leveraging Multi scale Backbone with Multilevel supervision for Thermal Image Super Resolution

Leveraging Multi scale Backbone with Multilevel supervision ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Nathan, Sabari Kansal, Priya Couger Inc Shibuya Ku Tokyo Japan

ISBN: (纸本)9781665448994

This paper proposes an attention-based multi-level model with a multi-scale backbone for thermal image super-resolution. The model leverages the multi-scale backbone as well. The thermal image dataset is provided by PBVS 2020 in their thermal image super-resolution challenge. This dataset contains the images with three different resolution scales(low, medium, high) [1]. However, only the medium and high-resolution images are used to train the proposed architecture to generate the super-resolution images in x2, x4 scales. The proposed architecture is based on the Res2net blocks as the backbone of the network. Along with this, the coordinate convolution layer and dual attention are also used in the architecture. Further, multi-level supervision is implemented to supervise the output image resolution similarity with the real image at each block during training. To test the robustness of the proposed model, we evaluated our model on the Thermal-6 dataset [20]. The results show that our model is efficient to achieve state-of-the-art results on the PBVS dataset. Further the results on the Thermal-6 dataset show that the model has a decent generalization capacity.

关键词： Training Convolution conferences Superresolution computer architecture Robustness pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Understanding the Robustness of 3D Object Detection with Bird'View Representations in Autonomous Driving

Understanding the Robustness of 3D Object Detection with Bir...

引用

2023 ieee/CVF conference on computer vision and pattern recognition, CVPR 2023

作者： Zhu, Zijian Zhang, Yichi Chen, Hai Dong, Yinpeng Zhao, Shu Ding, Wenbo Zhong, Jiachen Zheng, Shibao Institute of Image Communication and Network Engineering Shanghai Jiao Tong University China Institute for Ai Tsinghua University BNRist Center Thbi Lab Dept. of Comp. Sci. and Tech. China School of Computer Science and Technology Anhui University Key Laboratory of Intelligent Computing and Signal Processing Ministry of Education Information Materials and Intelligent Sensing Laboratory of Anhui Province China Saic Motor Ai Lab Zhongguancun Laboratory China

ISBN: (纸本)9798350301298

3D object detection is an essential perception task in autonomous driving to understand the environments. The Bird's-Eye-View (BEV) representations have significantly improved the performance of 3D detectors with camera inputs on popular benchmarks. However, there still lacks a systematic understanding of the robustness of these vision-dependent BEV models, which is closely related to the safety of autonomous driving systems. In this paper, we evaluate the natural and adversarial robustness of various representative models under extensive settings, to fully understand their behaviors influenced by explicit BEV features compared with those without BEV. In addition to the classic settings, we propose a 3D consistent patch attack by applying adversarial patches in the 3D space to guarantee the spatiotemporal consistency, which is more realistic for the scenario of autonomous driving. With substantial experiments, we draw several findings: 1) BEV models tend to be more stable than previous methods under different natural conditions and common corruptions due to the expressive spatial representations;2) BEV models are more vulnerable to adversarial noises, mainly caused by the redundant BEV features;3) Camera-LiDARfusion models have superior performance under different settings with multi-modal inputs, but BEV fusion model is still vulnerable to adversarial noises of both point cloud and image. These findings alert the safety issue in the applications of BEV detectors and could facilitate the development of more robust models. © 2023 ieee.

关键词： Autonomous driving

来源：评论

学校读者我要写书评

暂无评论

Composing Photos Like a Photographer

Composing Photos Like a Photographer

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Hong, Chaoyi Du, Shuaiyuan Xian, Ke Lu, Hao Cao, Zhiguo Zhong, Weicai Huazhong Univ Sci & Technol Sch Artificial Intelligence & Automat Minist Educ Key Lab Image Proc & Intelligent Control Wuhan Hubei Peoples R China Huawei CBG Consumer Cloud Serv Prod & Big Data Platform Dept Shenzhen Peoples R China

ISBN: (纸本)9781665445092

We show that explicit modeling of composition rules benefits image cropping. Image cropping is considered a promising way to automate aesthetic composition in professional photography. Existing efforts, however;only model such professional knowledge implicitly, e.g., by ranking from comparative candidates. Inspired by the observation that natural composition traits always follow a specific rule, we propose to learn such rules in a discriminative manner, and more importantly, to incorporate learned composition clues explicitly in the model. To this end, we introduce the concept of the key composition map (KCM) to encode the composition rules. The KCM can reveal the common laws hidden behind different composition rules and can inform the cropping model of what is important in composition. With the KCM, we present a novel cropping-by-composition paradigm and instantiate a network to implement composition-aware image cropping. Extensive experiments on two benchmarks justify that our approach enables effective, interpretable, and fast image cropping.

关键词： Photography computer vision Computational modeling Benchmark testing pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Improving the Efficiency-Accuracy Trade-off of DETR-Style Models in Practice

Improving the Efficiency-Accuracy Trade-off of DETR-Style Mo...

引用

ieee computer Society conference on computer vision and pattern recognition Workshops (CVPRW)

作者： Yumin Suh Dongwan Kim Abhishek Aich Samuel Schulter Jong-Chyi-Su Bohyung Han Manmohan Chandraker NEC Laboratories America Seoul National University

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

We aim to provide a comprehensive view of the inference efficiency of DETR-style detection models. We explore the effect of basic efficiency techniques and identify the factors that are easy to implement, yet effectively improve the efficiency-accuracy trade-off. Specifically, we investigate the effect of input resolution, multi-scale feature enhancement, and backbone pre-training. Our experiments support that 1) adjusting the input resolution is a simple yet effective way to achieve a better efficiency-accuracy trade-off. 2) Multi-scale feature enhancement can be lightened with a marginal decrease in accuracy, and 3) improved backbone pre-training can further improve the trade-off.

关键词： computer vision Accuracy conferences pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Jigsaw Clustering for Unsupervised Visual Representation Learning

Jigsaw Clustering for Unsupervised Visual Representation Lea...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Chen, Pengguang Liu, Shu Jia, Jiaya Chinese Univ Hong Kong Hong Kong Peoples R China SmartMore Hong Kong Peoples R China

ISBN: (纸本)9781665445092

Unsupervised representation learning with contrastive learning achieved great success. This line of methods duplicate each training batch to construct contrastive pairs, making each training batch and its augmented version forwarded simultaneously and leading to additional computation. We propose a new jigsaw clustering pretext task in this paper, which only needs to forward each training batch itself, and reduces the training cost. Our method makes use of information from both intra- and inter-images, and outperforms previous single-batch based ones by a large margin. It is even comparable to the contrastive learning methods when only half of training batches are used. Our method indicates that multiple batches during training are not necessary, and opens the door for future research of single-batch unsupervised methods. Our models trained on ImageNet datasets achieve state-of-the-art results with linear classification, outperforming previous single-batch methods by 2.6%. Models transferred to COCO datasets outperforms MoCo v2 by 0.4% with only half of the training batches. Our pretrained models outperform supervised ImageNet pretrained models on CIFAR-10 and CIFAR-100 datasets by 0.9% and 4.1% respectively.

关键词： Training Learning systems Visualization computer vision Costs pattern recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

Virtual Fully-Connected Layer: Training a Large-Scale Face recognition Dataset with Limited Computational Resources

Virtual Fully-Connected Layer: Training a Large-Scale Face R...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Li, Pengyu Wang, Biao Zhang, Lei Alibaba Grp Artificial Intelligence Ctr DAMO Acad Hangzhou Peoples R China Hong Kong Polytech Univ Dept Comp Hong Kong Peoples R China

ISBN: (纸本)9781665445092

Recently, deep face recognition has achieved significant progress because of Convolutional Neural Networks (CNNs) and large-scale datasets. However, training CNNs on a large-scale face recognition dataset with limited computational resources is still a challenge. This is because the classification paradigm needs to train a fully-connected layer as the category classifier, and its parameters will be in the hundreds of millions if the training dataset contains millions of identities. This requires many computational resources, such as GPU memory. The metric learning paradigm is an economical computation method, but its performance is greatly inferior to that of the classification paradigm. To address this challenge, we propose a simple but effective CNN layer called the Virtual fully-connected (Virtual FC) layer to reduce the computational consumption of the classification paradigm. Without bells and whistles, the proposed Virtual FC reduces the parameters by more than 100 times with respect to the fully-connected layer and achieves competitive performance on mainstream face recognition evaluation datasets. Moreover, the performance of our Virtual FC layer on the evaluation datasets is superior to that of the metric learning paradigm by a significant margin. Our code will be released in hopes of disseminating our idea to other domains1.

关键词： Training Measurement computer vision Codes Face recognition Memory management Graphics processing units

来源：评论

学校读者我要写书评

暂无评论

Anchor-Free Person Search

Anchor-Free Person Search

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Yan, Yichao Li, Jinpeng Qin, Jie Bai, Song Liao, Shengcai Liu, Li Zhu, Fan Shao, Ling Incept Inst Artificial Intelligence IIAI Abu Dhabi U Arab Emirates Univ Oxford Oxford England

ISBN: (纸本)9781665445092

Person search aims to simultaneously localize and identify a query person from realistic, uncropped images, which can be regarded as the unified task of pedestrian detection and person re-identification (re-id). Most existing works employ two-stage detectors like Faster-RCNN, yielding encouraging accuracy but with high computational overhead. In this work, we present the Feature-Aligned Person Search Network (AlignPS), the first anchor-free framework to efficiently tackle this challenging task AlignPS explicitly addresses the major challenges, which we summarize as the misalignment issues in different levels (i.e., scale, region, and task), when accommodating an anchor-free detector for this task More specifically, we propose an aligned feature aggregation module to generate more discriminative and robust feature embeddings by following a "re-id first" principle. Such a simple design directly improves the baseline anchor-free model on CUHK-SYSU by more than 20% in mAR Moreover AlignPS outperforms state-of-the-art two-stage methods, with a higher speed. The code is available at https://***/daodaofr/AlignPS.

关键词： computer vision Codes Search methods Detectors Feature extraction pattern recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

Advanced Techniques for Chinese Image Captioning: Investigating Attention Mechanisms Based on Object Detection for Chinese Image Caption Generation 5

Advanced Techniques for Chinese Image Captioning: Investigat...

引用

5th International conference on Machine Learning and computer Application, ICMLCA 2024

作者： Hua, Yongbin Li, Pei'ang Zeng, Xiangjin Xu, Hong School of Computer Science and Engineering Wuhan Institute of Technology Hubei Wuhan China

ISBN: (纸本)9798331530334

The task of image caption generation aims to automatically produce natural language descriptions that match the content of images, integrating the fields of machine vision and natural language processing, which holds significant theoretical and practical value. Inspired by top-down attention mechanisms, this paper proposes an innovative attention model. Utilizing the output of pretrained object detection networks as prior knowledge for images, the model guides the generation of natural language descriptions. By directly incorporating the results of object detection as attention inputs into the text generation network, the model effectively focuses on key descriptive regions of images, thereby significantly enhancing performance. On public Chinese image captioning datasets, this model demonstrates substantial advantages in metrics such as BLEU-4 and METEOR. © 2024 ieee.

关键词： Object recognition

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 489 490 491 492 493 494 495 496 497 498 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：