检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

11,745 篇 会议
8 篇 期刊文献

馆藏范围

11,753 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

8,139 篇 工学
- 7,674 篇 计算机科学与技术...
- 804 篇 机械工程
- 580 篇 软件工程
- 376 篇 电气工程
- 252 篇 控制科学与工程
- 208 篇 光学工程
- 85 篇 生物工程
- 83 篇 信息与通信工程
- 29 篇 生物医学工程（可授...
- 23 篇 电子科学与技术（可...
- 21 篇 化学工程与技术
- 15 篇 交通运输工程
- 14 篇 安全科学与工程
- 10 篇 网络空间安全
- 8 篇 仪器科学与技术
- 6 篇 材料科学与工程（可...
- 6 篇 动力工程及工程热...
3,194 篇 医学
- 3,190 篇 临床医学
- 11 篇 基础医学(可授医学...
- 7 篇 公共卫生与预防医...
481 篇 理学
- 216 篇 物理学
- 203 篇 系统科学
- 88 篇 生物学
- 55 篇 数学
- 29 篇 统计学（可授理学、...
- 24 篇 化学
55 篇 管理学
- 29 篇 图书情报与档案管...
- 28 篇 管理科学与工程(可...
- 12 篇 工商管理
17 篇 法学
- 15 篇 社会学
6 篇 农学
4 篇 教育学
2 篇 经济学
1 篇 军事学
1 篇 艺术学

主题

5,434 篇 computer vision
2,516 篇 training
2,087 篇 pattern recognit...
1,621 篇 computational mo...
1,435 篇 visualization
1,306 篇 three-dimensiona...
1,060 篇 semantics
981 篇 codes
968 篇 benchmark testin...
898 篇 computer archite...
884 篇 deep learning
762 篇 task analysis
681 篇 feature extracti...
536 篇 face recognition
527 篇 conferences
515 篇 transformers
515 篇 neural networks
479 篇 object detection
466 篇 image segmentati...
454 篇 cameras

机构

168 篇 univ sci & techn...
144 篇 univ chinese aca...
144 篇 tsinghua univ pe...
143 篇 carnegie mellon ...
135 篇 chinese univ hon...
112 篇 peng cheng lab p...
108 篇 zhejiang univ pe...
97 篇 swiss fed inst t...
92 篇 tsinghua univers...
92 篇 sensetime res pe...
88 篇 shanghai ai lab ...
85 篇 zhejiang univers...
84 篇 shanghai jiao to...
78 篇 peng cheng labor...
77 篇 university of sc...
77 篇 alibaba grp peop...
76 篇 univ hong kong p...
76 篇 tech univ munich...
76 篇 stanford univ st...
73 篇 university of ch...

作者

76 篇 timofte radu
64 篇 van gool luc
50 篇 zhang lei
44 篇 yang yi
40 篇 loy chen change
34 篇 tao dacheng
32 篇 liu yang
32 篇 chen chen
30 篇 zhou jie
30 篇 tian qi
30 篇 sun jian
28 篇 zha zheng-jun
27 篇 qi tian
26 篇 li xin
26 篇 vasconcelos nuno
26 篇 ying shan
25 篇 liu xiaoming
25 篇 luc van gool
25 篇 boxin shi
24 篇 zheng wei-shi

语言

11,746 篇 英文
7 篇 其他

检索条件"任意字段=2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023"

共 11753 条记录，以下是4881-4890 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Curriculum Graph Co-Teaching for Multi-Target Domain Adaptation

Curriculum Graph Co-Teaching for Multi-Target Domain Adaptat...

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Roy, Subhankar Krivosheev, Evgeny Zhong, Zhun Sebe, Nicu Ricci, Elisa Univ Trento Trento TN Italy Fdn Bruno Kessler Povo TN Italy

ISBN: (纸本)9781665445092

In this paper we address multi-target domain adaptation (MTDA), where given one labeled source dataset and multiple unlabeled target datasets that differ in data distributions, the task is to learn a robust predictor for all the target domains. We identify two key aspects that can help to alleviate multiple domain-shifts in the MTDA: feature aggregation and curriculum learning. To this end, we propose Curriculum Graph Co-Teaching (CGCT) that uses a dual classifier head, with one of them being a graph convolutional network (GCN) which aggregates features from similar samples across the domains. To prevent the classifiers from over-fitting on its own noisy pseudo-labels we develop a co-teaching strategy with the dual classifier head that is assisted by curriculum learning to obtain more reliable pseudo-labels. Furthermore, when the domain labels are available, we propose Domain-aware Curriculum Learning (DCL), a sequential adaptation strategy that first adapts on the easier target domains, followed by the harder ones. We experimentally demonstrate the effectiveness of our proposed frameworks on several benchmarks and advance the state-of-the-art in the MTDA by large margins (e.g. +5.6% on the DomainNet).

关键词： Deep learning computer vision computer network reliability PROM Collaboration pattern recognition Reliability

来源：评论

学校读者我要写书评

暂无评论

What If We Only Use Real Datasets for Scene Text recognition? Toward Scene Text recognition With Fewer Labels

What If We Only Use Real Datasets for Scene Text Recognition...

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Baek, Jeonghun Matsui, Yusuke Aizawa, Kiyoharu Univ Tokyo Tokyo Japan

ISBN: (纸本)9781665445092

Scene text recognition (STR) task has a common practice: All state-of-the-art STR models are trained on large synthetic data. In contrast to this practice, training STR models only on fewer real labels (STR with fewer labels) is important when we have to train STR models without synthetic data: for handwritten or artistic texts that are difficult to generate synthetically and for languages other than English for which we do not always have synthetic data. However, there has been implicit common knowledge that training STR models on real data is nearly impossible because real data is insufficient. We consider that this common knowledge has obstructed the study of STR with fewer labels. In this work, we would like to reactivate STR with fewer labels by disproving the common knowledge. We consolidate recently accumulated public real data and show that we can train STR models satisfactorily only with real labeled data. Subsequently, we find simple data augmentation to fully exploit real data. Furthermore, we improve the models by collecting unlabeled data and introducing semi- and self-supervised methods. As a result, we obtain a competitive model to state-of-the-art methods. To the best of our knowledge, this is the first study that 1) shows sufficient performance by only using real labels and 2) introduces semi- and self-supervised methods into STR with fewer labels.

关键词： Training computer vision Codes Text recognition Data models Task analysis

来源：评论

学校读者我要写书评

暂无评论

NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis

NeRV: Neural Reflectance and Visibility Fields for Relightin...

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Srinivasan, Pratul P. Deng, Boyang Zhang, Xiuming Tancik, Matthew Mildenhall, Ben Barron, Jonathan T. Google Res Mountain View CA 94043 USA MIT Cambridge MA 02139 USA Univ Calif Berkeley Berkeley CA USA

ISBN: (纸本)9781665445092

We present a method that takes as input a set of images of a scene illuminated by unconstrained known lighting, and produces as output a 3D representation that can be rendered from novel viewpoints under arbitrary lighting conditions. Our method represents the scene as a continuous volumetric function parameterized as MLPs whose inputs are a 3D location and whose outputs are the following scene properties at that input location: volume density, surface normal, material parameters, distance to the first surface intersection in any direction, and visibility of the external environment in any direction. Together, these allow us to render novel views of the object under arbitrary lighting, including indirect illumination effects. The predicted visibility and surface intersection fields are critical to our model's ability to simulate direct and indirect illumination during training, because the brute-force techniques used by prior work are intractable for lighting conditions outside of controlled setups with a single light. Our method outperforms alternative approaches for recovering relightable 3D scene representations, and performs well in complex lighting settings that have posed a significant challenge to prior work.

关键词： Training Reflectivity computer vision Three-dimensional displays Lighting Predictive models Rendering (computer graphics)

来源：评论

学校读者我要写书评

暂无评论

Background-Aware Pooling and Noise-Aware Loss for Weakly-Supervised Semantic Segmentation

Background-Aware Pooling and Noise-Aware Loss for Weakly-Sup...

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Oh, Youngmin Kim, Beomjun Ham, Bumsub Yonsei Univ Sch Elect & Elect Engn Seoul South Korea

ISBN: (纸本)9781665445092

We address the problem of weakly-supervised semantic segmentation (WSSS) using bounding box annotations. Although object bounding boxes are good indicators to segment corresponding objects, they do not specify object boundaries, making it hard to train convolutional neural networks (CNNs) for semantic segmentation. We find that background regions are perceptually consistent in part within an image, and this can be leveraged to discriminate foreground and background regions inside object bounding boxes. To implement this idea, we propose a novel pooling method, dubbed background-aware pooling (BAP), that focuses more on aggregating foreground features inside the bounding boxes using attention maps. This allows to extract high-quality pseudo segmentation labels to train CNNs for semantic segmentation, but the labels still contain noise especially at object boundaries. To address this problem, we also introduce a noise-aware loss (NAL) that makes the networks less susceptible to incorrect labels. Experimental results demonstrate that learning with our pseudo labels already outperforms state-of-the-art weakly- and semi-supervised methods on the PASCAL VOC 2012 dataset, and the NAL further boosts the performance.

关键词： Training Image segmentation computer vision Annotations Computational modeling Semantics Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Learning a Proposal Classifier for Multiple Object Tracking

Learning a Proposal Classifier for Multiple Object Tracking

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Dai, Peng Weng, Renliang Choi, Wongun Zhang, Changshui He, Zhangping Ding, Wei Tsinghua Univ Beijing Peoples R China Aibee Inc Beijing Peoples R China

ISBN: (纸本)9781665445092

The recent trend in multiple object tracking (MOT) is heading towards leveraging deep learning to boost the tracking performance. However, it is not trivial to solve the data-association problem in an end-to-end fashion. In this paper, we propose a novel proposal-based learnable framework, which models MOT as a proposal generation, proposal scoring and trajectory inference paradigm on an affinity graph. This framework is similar to the two-stage object detector Faster RCNN, and can solve the MOT problem in a data-driven way. For proposal generation, we propose an iterative graph clustering method to reduce the computational cost while maintaining the quality of the generated proposals. For proposal scoring, we deploy a trainable graph-convolutional-network (GCN) to learn the structural patterns of the generated proposals and rank them according to the estimated quality scores. For trajectory inference, a simple deoverlapping strategy is adopted to generate tracking output while complying with the constraints that no detection can be assigned to more than one track. We experimentally demonstrate that the proposed method achieves a clear performance improvement in both MOTA and IDF1 with respect to previous state-of-the-art on two public benchmarks.

关键词： Deep learning computer vision Detectors Market research Trajectory Computational efficiency Proposals

来源：评论

学校读者我要写书评

暂无评论

Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images

Adaptive Sparse Convolutional Networks with Global Context E...

引用

conference on computer vision and pattern recognition (cvpr)

作者： Bowei Du Yecheng Huang Jiaxin Chen Di Huang State Key Laboratory of Software Development Environment Beihang University Beijing China School of Computer Science and Engineering Beihang University Beijing China Hangzhou Innovation Institute Beihang University Hangzhou China

Object detection on drone images with low-latency is an important but challenging task on the resource-constrained unmanned aerial vehicle (UAV) platform. This paper investigates optimizing the detection head based on the sparse convolution, which proves effective in balancing the accuracy and efficiency. Nevertheless, it suffers from inadequate integration of contextual information of tiny objects as well as clumsy control of the mask ratio in the presence of foreground with varying scales. To address the issues above, we propose a novel global context-enhanced adaptive sparse convolutional network (CEASC). It first develops a context-enhanced group normalization (CE-GN) layer, by replacing the statistics based on sparsely sampled features with the global contextual ones, and then designs an adaptive multi-layer masking strategy to generate optimal mask ratios at distinct scales for compact foreground coverage, promoting both the accuracy and efficiency. Extensive experimental results on two major benchmarks, i.e. VisDrone and UAVDT, demonstrate that CEASC remarkably reduces the GFLOPs and accelerates the inference procedure when plugging into the typical state-of-the-art detection frameworks (e.g. RetinaNet and GFL V1) with competitive performance. Code is available at https://***/Cuogeihong/CEASC.

关键词：

来源：评论

学校读者我要写书评

暂无评论

MTLSegFormer: Multi-task Learning with Transformers for Semantic Segmentation in Precision Agriculture

MTLSegFormer: Multi-task Learning with Transformers for Sema...

引用

2023 ieee/cvf conference on computer vision and pattern recognition Workshops, cvprW 2023

作者： Goncalves, Diogo Nunes Marcato, Jose Zamboni, Pedro Pistori, Hemerson Li, Jonathan Nogueira, Keiller Goncalves, Wesley Nunes Federal University of Mato Grosso Do Sul Faculty of Computer Science Av. Costa e Silva Campo Grande79070-900 MS Brazil Federal University of Mato Grosso Do Sul Faculty of Engineering Architecture and Urbanism and Geography Av. Costa e Silva Campo Grande79070-900 MS Brazil Dom Bosco Catholic University INOVISAO Avenida Tamandaré 6000 Campo Grande79117-900 MS Brazil University of Waterloo Department of Geography and Environmental Management WaterlooONN2L 3G1 Canada University of Stirling Scotland StirlingFK9 4LA United Kingdom

ISBN: (纸本)9798350302493

Multi-task learning has proven to be effective in improving the performance of correlated tasks. Most of the existing methods use a backbone to extract initial features with independent branches for each task, and the exchange of information between the branches usually occurs through the concatenation or sum of the feature maps of the branches. However, this type of information exchange does not directly consider the local characteristics of the image nor the level of importance or correlation between the tasks. In this paper, we propose a semantic segmentation method, MTLSegFormer, which combines multi-task learning and attention mechanisms. After the backbone feature extraction, two feature maps are learned for each task. The first map is proposed to learn features related to its task, while the second map is obtained by applying learned visual attention to locally re-weigh the feature maps of the other tasks. In this way, weights are assigned to local regions of the image of other tasks that have greater importance for the specific task. Finally, the two maps are combined and used to solve a task. We tested the performance in two challenging problems with correlated tasks and observed a significant improvement in accuracy, mainly in tasks with high dependence on the others. © 2023 ieee.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

FESTA: Flow Estimation via Spatial-Temporal Attention for Scene Point Clouds

FESTA: Flow Estimation via Spatial-Temporal Attention for Sc...

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Wang, Haiyan Pang, Jiahao Lodhi, Muhammad A. Tian, Yingli Tian, Dong InterDigital Wilmington DE 19809 USA CUNY City Coll New York NY 10031 USA

ISBN: (纸本)9781665445092

Scene flow depicts the dynamics of a 3D scene, which is critical for various applications such as autonomous driving, robot navigation, AR/VR, etc. Conventionally, scene flow is estimated from dense/regular RGB video frames. With the development of depth-sensing technologies, precise 3D measurements are available via point clouds which have sparked new research in 3D scene flow. Nevertheless, it remains challenging to extract scene flow from point clouds due to the sparsity and irregularity in typical point cloud sampling patterns. One major issue related to irregular sampling is identified as the randomness during point set abstraction/feature extraction-an elementary process in many flow estimation scenarios. A novel Spatial Abstraction with Attention (SA(2)) layer is accordingly proposed to alleviate the unstable abstraction problem. Moreover, a Temporal Abstraction with Attention (TA(2)) layer is proposed to rectify attention in temporal domain, leading to benefits with motions scaled in a larger range. Extensive analysis and experiments verified the motivation and significant performance gains of our method, dubbed as Flow Estimation via Spatial-Temporal Attention (FESTA), when compared to several state-of-the-art benchmarks of scene flow estimation.

关键词： computer vision Three-dimensional displays Image recognition Image coding Navigation Computational modeling Estimation

来源：评论

学校读者我要写书评

暂无评论

Revisiting The Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis

Revisiting The Evaluation of Class Activation Mapping for Ex...

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Poppi, Samuele Cornia, Marcella Baraldi, Lorenzo Cucchiara, Rita Univ Modena & Reggio Emilia Modena Italy

ISBN: (纸本)9781665448994

As the request for deep learning solutions increases, the need for explainability is even more fundamental. In this setting, particular attention has been given to visualization techniques, that try to attribute the right relevance to each input pixel with respect to the output of the network. In this paper, we focus on Class Activation Mapping (CAM) approaches, which provide an effective visualization by taking weighted averages of the activation maps. To enhance the evaluation and the reproducibility of such approaches, we propose a novel set of metrics to quantify explanation maps, which show better effectiveness and simplify comparisons between approaches. To evaluate the appropriateness of the proposal, we compare different CAM-based visualization methods on the entire ImageNet validation set, fostering proper comparisons and reproducibility.

关键词： Deep learning Visualization computer vision Protocols conferences Reproducibility of results pattern recognition

来源：评论

学校读者我要写书评

暂无评论

A Dual Iterative Refinement Method for Non-rigid Shape Matching

A Dual Iterative Refinement Method for Non-rigid Shape Match...

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Xiang, Rui Lai, Rongjie Zhao, Hongkai UC Dept Math Irvine CA 92697 USA Rensselaer Polytech Inst Dept Math Troy NY 12181 USA Duke Univ Dept Math Durham NC 27706 USA

ISBN: (纸本)9781665445092

In this work, a robust and efficient dual iterative refinement (DIR) method is proposed for dense correspondence between two nearly isometric shapes. The key idea is to use dual information, such as spatial and spectral, or local and global features, in a complementary and effective way, and extract more accurate information from current iteration to use for the next iteration. In each DIR iteration, starting from current correspondence, a zoom-in process at each point is used to select well matched anchor pairs by a local mapping distortion criterion. These selected anchor pairs are then used to align spectral features (or other appropriate global features) whose dimension adaptively matches the capacity of the selected anchor pairs. Thanks to the effective combination of complementary information in a data-adaptive way, DIR is not only efficient but also robust to render accurate results within a few iterations. By choosing appropriate dual features, DIR has the flexibility to handle patch and partial matching as well. Our comprehensive experiments on various data sets demonstrate the superiority of DIR over other state-of-the-art methods in terms of both accuracy and efficiency.

关键词： computer vision Image analysis Shape Stability criteria Feature extraction Distortion Iterative methods

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 485 486 487 488 489 490 491 492 493 494 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：