检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

12,844 篇 会议
13 篇 期刊文献
2 册 图书

馆藏范围

12,859 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

7,573 篇 工学
- 6,863 篇 计算机科学与技术...
- 880 篇 机械工程
- 814 篇 软件工程
- 435 篇 控制科学与工程
- 360 篇 光学工程
- 306 篇 电气工程
- 209 篇 仪器科学与技术
- 124 篇 信息与通信工程
- 91 篇 生物工程
- 62 篇 生物医学工程（可授...
- 39 篇 电子科学与技术（可...
- 34 篇 安全科学与工程
- 26 篇 化学工程与技术
- 21 篇 交通运输工程
- 20 篇 建筑学
- 18 篇 土木工程
2,957 篇 医学
- 2,956 篇 临床医学
- 15 篇 基础医学(可授医学...
- 12 篇 药学(可授医学、理...
700 篇 理学
- 359 篇 物理学
- 225 篇 数学
- 175 篇 系统科学
- 95 篇 统计学（可授理学、...
- 93 篇 生物学
- 22 篇 化学
201 篇 艺术学
- 201 篇 设计学（可授艺术学...
84 篇 管理学
- 59 篇 图书情报与档案管...
- 25 篇 管理科学与工程(可...
- 14 篇 工商管理
23 篇 法学
- 21 篇 社会学
5 篇 农学
4 篇 教育学
2 篇 经济学
1 篇 军事学

主题

6,464 篇 computer vision
2,688 篇 training
2,437 篇 pattern recognit...
1,780 篇 computational mo...
1,522 篇 visualization
1,348 篇 three-dimensiona...
1,091 篇 computer archite...
1,063 篇 semantics
997 篇 benchmark testin...
976 篇 codes
970 篇 conferences
854 篇 feature extracti...
830 篇 cameras
771 篇 task analysis
707 篇 deep learning
646 篇 image segmentati...
611 篇 object detection
595 篇 shape
554 篇 transformers
538 篇 neural networks

机构

132 篇 univ sci & techn...
122 篇 carnegie mellon ...
120 篇 tsinghua univ pe...
114 篇 univ chinese aca...
113 篇 chinese univ hon...
94 篇 tsinghua univers...
91 篇 zhejiang univ pe...
91 篇 swiss fed inst t...
85 篇 peng cheng lab p...
81 篇 university of ch...
80 篇 zhejiang univers...
77 篇 shanghai ai lab ...
77 篇 peng cheng labor...
75 篇 university of sc...
69 篇 shanghai jiao to...
68 篇 shanghai jiao to...
67 篇 alibaba grp peop...
67 篇 stanford univ st...
66 篇 univ hong kong p...
64 篇 sensetime res pe...

作者

77 篇 timofte radu
63 篇 van gool luc
45 篇 zhang lei
36 篇 yang yi
36 篇 luc van gool
34 篇 tao dacheng
31 篇 loy chen change
29 篇 chen chen
28 篇 sun jian
28 篇 qi tian
25 篇 li xin
24 篇 liu yang
24 篇 tian qi
24 篇 ying shan
23 篇 wang xinchao
23 篇 zha zheng-jun
23 篇 boxin shi
21 篇 zhou jie
21 篇 vasconcelos nuno
20 篇 luo ping

语言

12,856 篇 英文
2 篇 其他
1 篇 中文

检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops"

共 12859 条记录，以下是4571-4580 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Multi-modal Aerial View Image Challenge: Sensor Domain Translation

Multi-modal Aerial View Image Challenge: Sensor Domain Trans...

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Spencer Low Oliver Nina Dylan Bowald Angel D. Sappa Nathan Inkawhich Peter Bruns Brigham Young University Provo Utah Air Force Research Laboratory Dayton OH ESPOL Polytechnic University Ecuador Computer Vision Center Spain Air Force Research Laboratory Rome NY University of Utah Salt Lake UT

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

This paper describes the design, outcomes, and top methods of the 2nd annual Multi-modal Aerial View Image Challenge (MAVIC) aimed at cross modality aerial image translation. The primary objective of this competition is to stimulate research efforts towards the development of models capable of translating co-aligned images between multiple modalities. Specifically, the challenge centers on translation between synthetic aperture radar (SAR), electro-optical (EO), camera (RGB), and infrared (IR) sensor modalities, a budding area of research that has begun to garner attention. While last year’s inaugural challenge demonstrated the feasibility of SAR→EO translation, this year’s challenge made significant improvements in dataset coverage, sensor variation, experimental design, and methods covering the tasks of SAR→EO, SAR→RGB, SAR→IR, RGB→IR introducing a new dataset called translation. By Multi-modal Aerial Gathered Image Composites (MAGIC); multimodal image translation is available for different comparisons. With a more rigorous set of translation performance metrics, winners were determined from aggregation of L1-norm, LPIPS (Learned Perceptual Image Patch Similarity, and FID (Frechet Inception Distance) scores. The wining methods included the pix2pixHD and LPIPS metrics as loss functions with an aggregated score 5% better separated by the SAR→EO and RGB→IR translation scores.

关键词： Measurement computer vision Design methodology conferences Cameras pattern recognition Synthetic aperture radar

来源：评论

学校读者我要写书评

暂无评论

Guided Integrated Gradients: an Adaptive Path Method for Removing Noise

Guided Integrated Gradients: an Adaptive Path Method for Rem...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Kapishnikov, Andrei Venugopalan, Subhashini Avci, Besim Wedin, Ben Terry, Michael Bolukbasi, Tolga Google Res Mountain View CA 94043 USA

ISBN: (纸本)9781665445092

Integrated Gradients (IG) [29] is a commonly used feature attribution method for deep neural networks. While IG has many desirable properties, the method often produces spurious/noisy pixel attributions in regions that are not related to the predicted class when applied to visual models. While this has been previously noted [27], most existing solutions [25, 17] are aimed at addressing the symptoms by explicitly reducing the noise in the resulting attributions. In this work, we show that one of the causes of the problem is the accumulation of noise along the IG path. To minimize the effect of this source of noise, we propose adapting the attribution path itself - conditioning the path not just on the image but also on the model being explained. We introduce Adaptive Path Methods (APMs) as a generalization of path methods, and Guided IG as a specific instance of an APM. Empirically, Guided IG creates saliency maps better aligned with the model's prediction and the input image that is being explained. We show through qualitative and quantitative experiments that Guided IG outperforms other, related methods in nearly every experiment.

关键词： Measurement Deep learning Adaptation models Visualization computer vision Computational modeling Predictive models

来源：评论

学校读者我要写书评

暂无评论

Test-Time Fast Adaptation for Dynamic Scene Deblurring via Meta-Auxiliary Learning

Test-Time Fast Adaptation for Dynamic Scene Deblurring via M...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Chi, Zhixiang Wang, Yang Yu, Yuanhao Tang, Jin Huawei Technol Noahs Ark Lab Shenzhen Peoples R China Univ Manitoba Winnipeg MB Canada

ISBN: (纸本)9781665445092

In this paper, we tackle the problem of dynamic scene deblurring. Most existing deep end-to-end learning approaches adopt the same generic model for all unseen test images. These solutions are sub-optimal, as they fail to utilize the internal information within a specific image. On the other hand, a self-supervised approach, SelfDeblur, enables internal training within a test image from scratch, but it does not fully take advantage of large external datasets. In this work, we propose a novel selfsupervised meta-auxiliary learning to improve the performance of deblurring by integrating both external and internal learning. Concretely, we build a self-supervised auxiliary reconstruction task that shares a portion of the network with the primary deblurring task. The two tasks are jointly trained on an external dataset. Furthermore, we propose a meta-auxiliary training scheme to further optimize the pretrained model as a base learner, which is applicable for fast adaptation at test time. During training, the performance of both tasks is coupled. Therefore, we are able to exploit the internal information at test time via the auxiliary task to enhance the performance of deblurring. Extensive experimental results across evaluation datasets demonstrate the effectiveness of test-time adaptation of the proposed method.

关键词： Training Adaptation models computer vision Computational modeling Performance gain pattern recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

ChallenCap: Monocular 3D Capture of Challenging Human Performances using Multi-Modal References

ChallenCap: Monocular 3D Capture of Challenging Human Perfor...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： He, Yannan Pang, Anqi Chen, Xin Liang, Han Wu, Minye Ma, Yuexin Xu, Lan ShanghaiTech Univ Shanghai Peoples R China Shanghai Engn Res Ctr Intelligent Vis & Imaging Shanghai Peoples R China

ISBN: (纸本)9781665445092

Capturing challenging human motions is critical for numerous applications, but it suffers from complex motion patterns and severe self-occlusion under the monocular setting. In this paper, we propose ChallenCap - a template-based approach to capture challenging 3D human motions using a single RGB camera in a novel learning-and-optimization framework, with the aid of multi-modal references. We propose a hybrid motion inference stage with a generation network, which utilizes a temporal encoder-decoder to extract the motion details from the pair-wise sparse-view reference, as well as a motion discriminator to utilize the unpaired marker-based references to extract specific challenging motion characteristics in a data-driven manner. We further adopt a robust motion optimization stage to increase the tracking accuracy, by jointly utilizing the learned motion details from the supervised multi-modal references as well as the reliable motion hints from the input image reference. Extensive experiments on our new challenging motion dataset demonstrate the effectiveness and robustness of our approach to capture challenging human motions.

关键词： computer vision Three-dimensional displays Tracking Cameras Robustness Hybrid power systems pattern recognition

来源：评论

学校读者我要写书评

暂无评论

OTA: Optimal Transport Assignment for Object Detection

OTA: Optimal Transport Assignment for Object Detection

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Ge, Zheng Liu, Songtao Liu, Zeming Yoshie, Osamu Sun, Jian Waseda Univ Tokyo Japan Megvii Technol Beijing Peoples R China

ISBN: (纸本)9781665445092

Recent advances in label assignment in object detection mainly seek to independently define positive/negative training samples for each ground-truth (gt) object. In this paper, we innovatively revisit the label assignment from a global perspective and propose to formulate the assigning procedure as an Optimal Transport (OT) problem - a well-studied topic in Optimization Theory. Concretely, we define the unit transportation cost between each demander (anchor) and supplier (gt) pair as the weighted summation of their classification and regression losses. After formulation, finding the best assignment solution is converted to solve the optimal transport plan at minimal transportation costs, which can be solved via Sinkhorn-Knopp Iteration. On COCO, a single FCOS-ResNet-50 detector equipped with Optimal Transport Assignment (OTA) can reach 40.7% mAP under 1x scheduler, outperforming all other existing assigning methods. Extensive experiments conducted on COCO and CrowdHuman further validate the effectiveness of our proposed OTA, especially its superiority in crowd scenarios.

关键词： Training computer vision Costs Codes Transportation Estimation Object detection

来源：评论

学校读者我要写书评

暂无评论

Towards Part-Based Understanding of RGB-D Scans

Towards Part-Based Understanding of RGB-D Scans

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Bokhovkin, Alexey Ishimtsev, Vladislav Bogomolov, Emil Zorin, Denis Artemov, Alexey Burnaev, Evgeny Dai, Angela Tech Univ Munich Munich Germany Skolkovo Inst Sci & Technol Moscow Russia NYU New York NY 10003 USA

ISBN: (纸本)9781665445092

Recent advances in 3D semantic scene understanding have shown impressive progress in 3D instance segmentation, enabling object-level reasoning about 3D scenes;however, a finer-grained understanding is required to enable interactions with objects and their functional understanding. Thus, we propose the task of part-based scene understanding of real-world 3D environments: from an RGB-D scan of a scene, we detect objects, and for each object predict its decomposition into geometric part masks, which composed together form the complete geometry of the observed object. We leverage an intermediary part graph representation to enable robust completion as well as building of part priors, which we use to construct the final part mask predictions. Our experiments demonstrate that guiding part understanding through part graph to part prior-based predictions significantly outperforms alternative approaches to the task of semantic part completion.

关键词： Geometry computer vision Three-dimensional displays Semantics Buildings Cognition Autonomous agents

来源：评论

学校读者我要写书评

暂无评论

General Instance Distillation for Object Detection

General Instance Distillation for Object Detection

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Dai, Xing Jiang, Zeren Wu, Zhao Bao, Yiping Wang, Zhicheng Liu, Si Zhou, Erjin MEGVII Technol Beijing Peoples R China BeiHang Univ Beijing Peoples R China

ISBN: (纸本)9781665445092

In recent years, knowledge distillation has been proved to be an effective solution for model compression. This approach can make lightweight student models acquire the knowledge extracted from cumbersome teacher models. However, previous distillation methods of detection have weak generalization for different detection frameworks and rely heavily on ground truth (GT), ignoring the valuable relation information between instances. Thus, we propose a novel distillation method for detection tasks based on discriminative instances without considering the positive or negative distinguished by GT, which is called general instance distillation (GID). Our approach contains a general instance selection module (GISM) to make full use of feature-based, relation-based and response-based knowledge for distillation. Extensive results demonstrate that the student model achieves significant AP improvement and even outperforms the teacher in various detection frameworks. Specifically, RetinaNet with ResNet-50 achieves 39.1% in mAP with GID on COCO dataset, which surpasses the baseline 36.2% by 2.9%, and even better than the ResNet-101 based teacher model with 38.1% AP.

关键词： computer vision Adaptation models Object detection Feature extraction pattern recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

Temporal Query Networks for Fine-grained Video Understanding

Temporal Query Networks for Fine-grained Video Understanding

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Zhang, Chuhan Gupta, Ankush Zisserman, Andrew Univ Oxford Oxford England DeepMind London England

ISBN: (纸本)9781665445092

Our objective in this work is fine-grained classification of actions in untrimmed videos, where the actions may be temporally extended or may span only a few frames of the video. We cast this into a query-response mechanism, where each query addresses a particular question, and has its own response label set. We make the following four contributions: (i) We propose a new model-a Temporal Query Network-which enables the query-response functionality, and a structural understanding of fine-grained actions. It attends to relevant segments for each query with a temporal attention mechanism, and can be trained using only the labels for each query. (ii) We propose a new way-stochastic feature bank update-to train a network on videos of various lengths with the dense sampling required to respond to fine-grained queries. (iii) we compare the TQN to other architectures and text supervision methods, and analyze their pros and cons. Finally, (iv) we evaluate the method extensively on the FineGym and Diving48 benchmarks for fine-grained action classification and surpass the state-of-the-art using only RGB features. Project page: https://***/-vgg/research/tqn/.

关键词： Training Location awareness computer vision computer architecture pattern recognition Videos

来源：评论

学校读者我要写书评

暂无评论

Global2Local: Efficient Structure Search for Video Action Segmentation

Global2Local: Efficient Structure Search for Video Action Se...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Gao, Shang-Hua Han, Qi Li, Zhong-Yu Peng, Pai Wang, Liang Cheng, Ming-Ming Nankai Univ TKLNDST CS Tianjin Peoples R China Tencent Shenzhen Peoples R China NLPR Beijing Peoples R China

ISBN: (纸本)9781665445092

Temporal receptive fields of models play an important role in action segmentation. Large receptive fields facilitate the long-term relations among video clips while small receptive fields help capture the local details. Existing methods construct models with hand-designed receptive fields in layers. Can we effectively search for receptive field combinations to replace hand-designed patterns? To answer this question, we propose to find better receptive field combinations through a global-to-local search scheme. Our search scheme exploits both global search to find the coarse combinations and local search to get the refined receptive field combination patterns further. The global search finds possible coarse combinations other than human-designed patterns. On top of the global search, we propose an expectation guided iterative local search scheme to refine combinations effectively. Our global-to-local search can be plugged into existing action segmentation methods to achieve state-of-the-art performance.

关键词： computer vision Codes Probabilistic logic pattern recognition Task analysis Forecasting

来源：评论

学校读者我要写书评

暂无评论

The Third Monocular Depth Estimation Challenge

The Third Monocular Depth Estimation Challenge

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Jaime Spencer Fabio Tosi Matteo Poggi Ripudaman Singh Arora Chris Russell Simon Hadfield Richard Bowden GuangYuan Zhou ZhengXin Li Qiang Rao YiPing Bao Xiao Liu Dohyeong Kim Jinseong Kim Myunghyun Kim Mykola Lavreniuk Rui Li Qing Mao Jiang Wu Yu Zhu Jinqiu Sun Yanning Zhang Suraj Patni Aradhye Agarwal Chetan Arora Pihai Sun Kui Jiang Gang Wu Jian Liu Xianming Liu Junjun Jiang Xidan Zhang Jianing Wei Fangjun Wang Zhiming Tan Jiabao Wang Albert Luginov Muhammad Shahzad Seyed Hosseini Aleksander Trajcevski James H. Elder Independent University of Bologna Blue River Technology Oxford Internet Institute University of Surrey ByteDance University of Chinese Academy of Science RGA Inc. Space Research Institute NASU-SSAU Kyiv Ukraine Northwestern Polytechnical University Indian Institute of Technology Delhi Harbin Institute of Technology Fujitsu GuangXi University University of Reading York University

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

This paper discusses the results of the third edition of the Monocular Depth Estimation Challenge (MDEC). The challenge focuses on zero-shot generalization to the challenging SYNS-Patches dataset, featuring complex scenes in natural and indoor settings. As with the previous edition, methods can use any form of supervision, i.e. supervised or self-supervised. The challenge received a total of 19 submissions outperforming the baseline on the test set: 10 among them submitted a report describing their approach, highlighting a diffused use of foundational models such as Depth Anything at the core of their method. The challenge winners drastically improved 3D F-Score performance, from 17.51% to 23.72%.

关键词： computer vision Three-dimensional displays conferences Computational modeling Estimation pattern recognition

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 454 455 456 457 458 459 460 461 462 463 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：