检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

12,844 篇 会议
13 篇 期刊文献
2 册 图书

馆藏范围

12,859 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

7,573 篇 工学
- 6,863 篇 计算机科学与技术...
- 880 篇 机械工程
- 814 篇 软件工程
- 435 篇 控制科学与工程
- 360 篇 光学工程
- 306 篇 电气工程
- 209 篇 仪器科学与技术
- 124 篇 信息与通信工程
- 91 篇 生物工程
- 62 篇 生物医学工程（可授...
- 39 篇 电子科学与技术（可...
- 34 篇 安全科学与工程
- 26 篇 化学工程与技术
- 21 篇 交通运输工程
- 20 篇 建筑学
- 18 篇 土木工程
2,957 篇 医学
- 2,956 篇 临床医学
- 15 篇 基础医学(可授医学...
- 12 篇 药学(可授医学、理...
700 篇 理学
- 359 篇 物理学
- 225 篇 数学
- 175 篇 系统科学
- 95 篇 统计学（可授理学、...
- 93 篇 生物学
- 22 篇 化学
201 篇 艺术学
- 201 篇 设计学（可授艺术学...
84 篇 管理学
- 59 篇 图书情报与档案管...
- 25 篇 管理科学与工程(可...
- 14 篇 工商管理
23 篇 法学
- 21 篇 社会学
5 篇 农学
4 篇 教育学
2 篇 经济学
1 篇 军事学

主题

6,464 篇 computer vision
2,688 篇 training
2,437 篇 pattern recognit...
1,780 篇 computational mo...
1,522 篇 visualization
1,348 篇 three-dimensiona...
1,091 篇 computer archite...
1,063 篇 semantics
997 篇 benchmark testin...
976 篇 codes
970 篇 conferences
854 篇 feature extracti...
830 篇 cameras
771 篇 task analysis
707 篇 deep learning
646 篇 image segmentati...
611 篇 object detection
595 篇 shape
554 篇 transformers
538 篇 neural networks

机构

132 篇 univ sci & techn...
122 篇 carnegie mellon ...
120 篇 tsinghua univ pe...
114 篇 univ chinese aca...
113 篇 chinese univ hon...
94 篇 tsinghua univers...
91 篇 zhejiang univ pe...
91 篇 swiss fed inst t...
85 篇 peng cheng lab p...
81 篇 university of ch...
80 篇 zhejiang univers...
77 篇 shanghai ai lab ...
77 篇 peng cheng labor...
75 篇 university of sc...
69 篇 shanghai jiao to...
68 篇 shanghai jiao to...
67 篇 alibaba grp peop...
67 篇 stanford univ st...
66 篇 univ hong kong p...
64 篇 sensetime res pe...

作者

77 篇 timofte radu
63 篇 van gool luc
45 篇 zhang lei
36 篇 yang yi
36 篇 luc van gool
34 篇 tao dacheng
31 篇 loy chen change
29 篇 chen chen
28 篇 sun jian
28 篇 qi tian
25 篇 li xin
24 篇 liu yang
24 篇 tian qi
24 篇 ying shan
23 篇 wang xinchao
23 篇 zha zheng-jun
23 篇 boxin shi
21 篇 zhou jie
21 篇 vasconcelos nuno
20 篇 luo ping

语言

12,849 篇 英文
9 篇 其他
1 篇 中文

检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops"

共 12859 条记录，以下是4641-4650 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Parameter Efficient Fine-tuning of Self-supervised ViTs without Catastrophic Forgetting

Parameter Efficient Fine-tuning of Self-supervised ViTs with...

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Reza Akbarian Bafghi Nidhin Harilal Claire Monteleoni Maziar Raissi University of Colorado Boulder INRIA Paris University of California Riverside

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Artificial neural networks often suffer from catastrophic forgetting, where learning new concepts leads to a complete loss of previously acquired knowledge. We observe that this issue is particularly magnified in vision transformers (ViTs), where post-pre-training and fine-tuning on new tasks can significantly degrade the model’s original general abilities. For instance, a DINO ViT-Base/16 pre-trained on ImageNet-1k loses over 70% accuracy on ImageNet-1k after just 10 iterations of fine-tuning on CIFAR-100. Overcoming this stability-plasticity dilemma is crucial for enabling ViTs to continuously learn and adapt to new domains while preserving their initial knowledge. In this work, we study two new parameter-efficient fine-tuning strategies: (1) Block Expansion, and (2) Low-rank adaptation (LoRA). Our experiments reveal that using either Block Expansion or LoRA on self-supervised pre-trained ViTs surpass fully fine-tuned ViTs in new domains while offering significantly greater parameter efficiency. Notably, we find that Block Expansion experiences only a minimal performance drop in the pre-training domain, thereby effectively mitigating catastrophic forgetting in pre-trained ViTs 1 .

关键词： Knowledge engineering computer vision Adaptation models conferences Learning (artificial intelligence) Artificial neural networks Transformers

来源：评论

学校读者我要写书评

暂无评论

U-MedSAM: Uncertainty-Aware MedSAM for Medical Image Segmentation 1

引用

International Challenge on Segment Anything in Medical Images on Laptop held in conjunction with the ieee/cvf conference on computer vision and pattern recognition, CVPR 2024

作者： Wang, Xin Liu, Xiaoyu Huang, Peng Huang, Pu Hu, Shu Zhu, Hongtu Albany United States School of Physics and Electronics Shandong Normal University Jinan China School of Computing and Artificial Intelligence Southwest Jiaotong University Chengdu China Department of Computer and Information Technology Purdue University West Lafayette United States University of North Carolina at Chapel Hill Chapel Hill United States

ISBN: (数字)9783031818547

ISBN: (纸本)9783031818530

Medical Image Foundation Models have proven to be powerful tools for mask prediction across various datasets. However, accurately assessing the uncertainty of their predictions remains a significant challenge. To address this, we propose a new model, U-MedSAM, which integrates the MedSAM model with an uncertainty-aware loss function and the Sharpness-Aware Minimization (SharpMin) optimizer. The uncertainty-aware loss function automatically combines region-based, distribution-based, and pixel-based loss designs to enhance segmentation accuracy and robustness. SharpMin improves generalization by finding flat minima in the loss landscape, thereby reducing overfitting. Our method was evaluated in the CVPR24 MedSAM on Laptop challenge, where U-MedSAM demonstrated promising performance. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Medical imaging

来源：评论

学校读者我要写书评

暂无评论

Curriculum Graph Co-Teaching for Multi-Target Domain Adaptation

Curriculum Graph Co-Teaching for Multi-Target Domain Adaptat...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Roy, Subhankar Krivosheev, Evgeny Zhong, Zhun Sebe, Nicu Ricci, Elisa Univ Trento Trento TN Italy Fdn Bruno Kessler Povo TN Italy

ISBN: (纸本)9781665445092

In this paper we address multi-target domain adaptation (MTDA), where given one labeled source dataset and multiple unlabeled target datasets that differ in data distributions, the task is to learn a robust predictor for all the target domains. We identify two key aspects that can help to alleviate multiple domain-shifts in the MTDA: feature aggregation and curriculum learning. To this end, we propose Curriculum Graph Co-Teaching (CGCT) that uses a dual classifier head, with one of them being a graph convolutional network (GCN) which aggregates features from similar samples across the domains. To prevent the classifiers from over-fitting on its own noisy pseudo-labels we develop a co-teaching strategy with the dual classifier head that is assisted by curriculum learning to obtain more reliable pseudo-labels. Furthermore, when the domain labels are available, we propose Domain-aware Curriculum Learning (DCL), a sequential adaptation strategy that first adapts on the easier target domains, followed by the harder ones. We experimentally demonstrate the effectiveness of our proposed frameworks on several benchmarks and advance the state-of-the-art in the MTDA by large margins (e.g. +5.6% on the DomainNet).

关键词： Deep learning computer vision computer network reliability PROM Collaboration pattern recognition Reliability

来源：评论

学校读者我要写书评

暂无评论

Forensic Iris Image Synthesis

Forensic Iris Image Synthesis

引用

ieee Winter Applications and computer vision workshops (WACVW)

作者： Rasel Ahmed Bhuiyan Adam Czajka Department of Computer Science and Engineering 384 Fitzpatrick Hall of Engineering University of Notre Dame Notre Dame Indiana USA

Post-mortem iris recognition is an emerging application of iris-based human identification in a forensic setup, able to correctly identify deceased subjects even three weeks post-mortem. This technique thus is considered as an important component of future forensic toolkits. The current advancements in this field are seriously slowed down by exceptionally difficult data collection, which can happen in mortuary conditions, at crime scenes, or in “body farm” facilities. This paper makes a novel contribution to facilitate progress in post-mortem iris recognition by offering a conditional StyleGAN-based iris synthesis model, trained on the largest-available dataset of post-mortem iris samples acquired from more than 350 subjects, generating – through appropriate exploration of StyleGAN latent space – multiple within-class (same identity) and between-class (different new identities) post-mortem iris images, compliant with ISO/IEC 29794-6, and with decomposition deformations controlled by the requested PMI (post mortem interval). Besides an obvious application to enhance the existing, very sparse, post-mortem iris datasets to advance – among others – iris presentation attack endeavors, we anticipate it may be useful to generate samples that would expose professional forensic human examiners to never-seen-before deformations for various PMIs, increasing their training effectiveness. The source codes and model weights are made available with the paper.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Learning Dynamic Network Using a Reuse Gate Function in Semi-supervised Video Object Segmentation

Learning Dynamic Network Using a Reuse Gate Function in Semi...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Park, Hyojin Yoo, Jayeon Jeong, Seohyeong Venkatesh, Ganesh Kwak, Nojun Seoul Natl Univ Seoul South Korea Facebook Inc Menlo Pk CA USA AIRS Co Hyundai Motor Grp Seoul South Korea SNU Seoul South Korea

ISBN: (纸本)9781665445092

Current state-of-the-art approaches for Semi-supervised Video Object Segmentation (Semi-VOS) propagates information from previous frames to generate segmentation mask for the current frame. This results in high-quality segmentation across challenging scenarios such as changes in appearance and occlusion. But it also leads to unnecessary computations for stationary or slow-moving objects where the change across frames is minimal. In this work, we exploit this observation by using temporal information to quickly identify frames with minimal change and skip the heavyweight mask generation step. To realize this efficiency, we propose a novel dynamic network that estimates change across frames and decides which path - computing a full network or reusing previous frame's feature - to choose depending on the expected similarity. Experimental results show that our approach significantly improves inference speed without much accuracy degradation on challenging Semi-VOS datasets - DAVIS 16, DAVIS 17, and YouTube-VOS. Furthermore, our approach can be applied to multiple Semi-VOS methods demonstrating its generality.

关键词： Degradation computer vision Codes Computational modeling Object segmentation computer architecture Logic gates

来源：评论

学校读者我要写书评

暂无评论

What If We Only Use Real Datasets for Scene Text recognition? Toward Scene Text recognition With Fewer Labels

What If We Only Use Real Datasets for Scene Text Recognition...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Baek, Jeonghun Matsui, Yusuke Aizawa, Kiyoharu Univ Tokyo Tokyo Japan

ISBN: (纸本)9781665445092

Scene text recognition (STR) task has a common practice: All state-of-the-art STR models are trained on large synthetic data. In contrast to this practice, training STR models only on fewer real labels (STR with fewer labels) is important when we have to train STR models without synthetic data: for handwritten or artistic texts that are difficult to generate synthetically and for languages other than English for which we do not always have synthetic data. However, there has been implicit common knowledge that training STR models on real data is nearly impossible because real data is insufficient. We consider that this common knowledge has obstructed the study of STR with fewer labels. In this work, we would like to reactivate STR with fewer labels by disproving the common knowledge. We consolidate recently accumulated public real data and show that we can train STR models satisfactorily only with real labeled data. Subsequently, we find simple data augmentation to fully exploit real data. Furthermore, we improve the models by collecting unlabeled data and introducing semi- and self-supervised methods. As a result, we obtain a competitive model to state-of-the-art methods. To the best of our knowledge, this is the first study that 1) shows sufficient performance by only using real labels and 2) introduces semi- and self-supervised methods into STR with fewer labels.

关键词： Training computer vision Codes Text recognition Data models Task analysis

来源：评论

学校读者我要写书评

暂无评论

Learning Feature Aggregation for Deep 3D Morphable Models

Learning Feature Aggregation for Deep 3D Morphable Models

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Chen, Zhixiang Kim, Tae-Kyun Imperial Coll London London England Korea Adv Inst Sci & Technol Seoul South Korea

ISBN: (纸本)9781665445092

3D morphable models are widely used for the shape representation of an object class in computer vision and graphics applications. In this work, we focus on deep 3D morphable models that directly apply deep learning on 3D mesh data with a hierarchical structure to capture information at multiple scales. While great efforts have been made to design the convolution operator, how to best aggregate vertex features across hierarchical levels deserves further attention. In contrast to resorting to mesh decimation, we propose an attention based module to learn mapping matrices for better feature aggregation across hierarchical levels. Specifically, the mapping matrices are generated by a compatibility function of the keys and queries. The keys and queries are trainable variables, learned by optimizing the target objective, and shared by all data samples of the same object class. Our proposed module can be used as a train-only drop-in replacement for the feature aggregation in existing architectures for both downsampling and upsampling. Our experiments show that through the end-to-end training of the mapping matrices, we achieve state-of-the-art results on a variety of 3D shape datasets in comparison to existing morphable models.

关键词： Training Deep learning Solid modeling computer vision Three-dimensional displays Shape Convolution

来源：评论

学校读者我要写书评

暂无评论

Disentangling Label Distribution for Long-tailed Visual recognition

Disentangling Label Distribution for Long-tailed Visual Reco...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Hong, Youngkyu Han, Seungju Choi, Kwanghee Seo, Seokjun Kim, Beomsu Chang, Buru Hyperconnect Seoul South Korea

ISBN: (纸本)9781665445092

The current evaluation protocol of long-tailed visual recognition trains the classification model on the long-tailed source label distribution and evaluates its performance on the uniform target label distribution. Such protocol has questionable practicality since the target may also be long-tailed. Therefore, we formulate long-tailed visual recognition as a label shift problem where the target and source label distributions are different. One of the significant hurdles in dealing with the label shift problem is the entanglement between the source label distribution and the model prediction. In this paper, we focus on disentangling the source label distribution from the model prediction. We first introduce a simple but overlooked baseline method that matches the target label distribution by post-processing the model prediction trained by the cross-entropy loss and the Softmax function. Although this method surpasses state-of-the-art methods on benchmark datasets, it can be further improved by directly disentangling the source label distribution from the model prediction in the training phase. Thus, we propose a novel method, LAbel distribution DisEntangling (LADE) loss based on the optimal bound of Donsker-Varadhan representation. LADE achieves state-of-the-art performance on benchmark datasets such as CIFAR-100-LL Places-LT ImageNet-LL and iNaturalist 2018. Moreover LADE outperforms existing methods on various shifted target label distributions, showing the general adaptability of our proposed method.

关键词： Training Visualization computer vision Protocols Target recognition Object detection Predictive models

来源：评论

学校读者我要写书评

暂无评论

Learning-based Image Registration with Meta-Regularization

Learning-based Image Registration with Meta-Regularization

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Al Safadi, Ebrahim Song, Xubo Oregon Hlth & Sci Univ Portland OR 97201 USA Amazon Seattle WA 98121 USA

ISBN: (纸本)9781665445092

We introduce a meta-regularization framework for learning-based image registration. Current learning-based image registration methods use high-resolution architectures such as U-Nets to produce spatial transformations, and impose simple and explicit regularization on the output of the network to ensure that the estimated displacements are smooth. While this approach works well on small deformations, it has been known to struggle when the deformations are large. Our method uses a more advanced form of meta-regularization to increase the generalization ability of learned registration models. We motivate our approach based on Reproducing Kernel Hilbert Space (RKHS) theory, and approximate that framework via a meta-regularization convolutional layer with radially symmetric, positive semi-definite filters that inherent its regularization properties. We then provide a method to learn such regularization filters while also learning to register. Our experiments on synthetic and real datasets as well as ablation analysis show that our method can improve anatomical correspondence compared to competing methods, and reduce the percentage of folding and tear in the large deformation setting, reflecting better regularization and model generalization.

关键词： Optical filters Deformable models Training Image registration computer architecture Filtering theory Registers

来源：评论

学校读者我要写书评

暂无评论

Keep your Eyes on the Lane: Real-time Attention-guided Lane Detection

Keep your Eyes on the Lane: Real-time Attention-guided Lane ...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Tabelini, Lucas Berriel, Rodrigo Paixao, Thiago M. Badue, Claudine De Souza, Alberto F. Oliveira-Santos, Thiago Univ Fed Espirito Santo UFES Vitoria ES Brazil Inst Fed Espirito Santo IFES Vitoria ES Brazil

ISBN: (纸本)9781665445092

Modern lane detection methods have achieved remarkable performances in complex real-world scenarios, but many have issues maintaining real-time efficiency, which is important for autonomous vehicles. In this work, we propose LaneATT: an anchor-based deep lane detection model, which, akin to other generic deep object detectors, uses the anchors for the feature pooling step. Since lanes follow a regular pattern and are highly correlated, we hypothesize that in some cases global information may be crucial to infer their positions, especially in conditions such as occlusion, missing lane markers, and others. Thus, this work proposes a novel anchor-based attention mechanism that aggregates global information. The model was evaluated extensively on three of the most widely used datasets in the literature. The results show that our method outperforms the current state-of-the-art methods showing both higher efficacy and efficiency. Moreover, an ablation study is performed along with a discussion on efficiency trade-off options that are useful in practice.

关键词： computer vision Codes Lane detection Computational modeling Aggregates Detectors Feature extraction

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 461 462 463 464 465 466 467 468 469 470 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：