检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

11,745 篇 会议
8 篇 期刊文献

馆藏范围

11,753 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

8,139 篇 工学
- 7,674 篇 计算机科学与技术...
- 804 篇 机械工程
- 580 篇 软件工程
- 376 篇 电气工程
- 252 篇 控制科学与工程
- 208 篇 光学工程
- 85 篇 生物工程
- 83 篇 信息与通信工程
- 29 篇 生物医学工程（可授...
- 23 篇 电子科学与技术（可...
- 21 篇 化学工程与技术
- 15 篇 交通运输工程
- 14 篇 安全科学与工程
- 10 篇 网络空间安全
- 8 篇 仪器科学与技术
- 6 篇 材料科学与工程（可...
- 6 篇 动力工程及工程热...
3,194 篇 医学
- 3,190 篇 临床医学
- 11 篇 基础医学(可授医学...
- 7 篇 公共卫生与预防医...
481 篇 理学
- 216 篇 物理学
- 203 篇 系统科学
- 88 篇 生物学
- 55 篇 数学
- 29 篇 统计学（可授理学、...
- 24 篇 化学
55 篇 管理学
- 29 篇 图书情报与档案管...
- 28 篇 管理科学与工程(可...
- 12 篇 工商管理
17 篇 法学
- 15 篇 社会学
6 篇 农学
4 篇 教育学
2 篇 经济学
1 篇 军事学
1 篇 艺术学

主题

5,434 篇 computer vision
2,516 篇 training
2,087 篇 pattern recognit...
1,621 篇 computational mo...
1,435 篇 visualization
1,306 篇 three-dimensiona...
1,060 篇 semantics
981 篇 codes
968 篇 benchmark testin...
898 篇 computer archite...
884 篇 deep learning
762 篇 task analysis
681 篇 feature extracti...
536 篇 face recognition
527 篇 conferences
515 篇 transformers
515 篇 neural networks
479 篇 object detection
466 篇 image segmentati...
454 篇 cameras

机构

168 篇 univ sci & techn...
144 篇 univ chinese aca...
144 篇 tsinghua univ pe...
143 篇 carnegie mellon ...
135 篇 chinese univ hon...
112 篇 peng cheng lab p...
108 篇 zhejiang univ pe...
97 篇 swiss fed inst t...
92 篇 tsinghua univers...
92 篇 sensetime res pe...
88 篇 shanghai ai lab ...
85 篇 zhejiang univers...
84 篇 shanghai jiao to...
78 篇 peng cheng labor...
77 篇 university of sc...
77 篇 alibaba grp peop...
76 篇 univ hong kong p...
76 篇 tech univ munich...
76 篇 stanford univ st...
73 篇 university of ch...

作者

76 篇 timofte radu
64 篇 van gool luc
50 篇 zhang lei
44 篇 yang yi
40 篇 loy chen change
34 篇 tao dacheng
32 篇 liu yang
32 篇 chen chen
30 篇 zhou jie
30 篇 tian qi
30 篇 sun jian
28 篇 zha zheng-jun
27 篇 qi tian
26 篇 li xin
26 篇 vasconcelos nuno
26 篇 ying shan
25 篇 liu xiaoming
25 篇 luc van gool
25 篇 boxin shi
24 篇 zheng wei-shi

语言

11,746 篇 英文
7 篇 其他

检索条件"任意字段=2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023"

共 11753 条记录，以下是11-20 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

U-MedSAM: Uncertainty-Aware MedSAM for Medical Image Segmentation

U-MedSAM: Uncertainty-Aware MedSAM for Medical Image Segmen...

引用

International Challenge on Segment Anything in Medical Images on Laptop held in conjunction with the ieee/cvf conference on computer vision and pattern recognition, cvpr 2024

作者： Wang, Xin Liu, Xiaoyu Huang, Peng Huang, Pu Hu, Shu Zhu, Hongtu Albany United States School of Physics and Electronics Shandong Normal University Jinan China School of Computing and Artificial Intelligence Southwest Jiaotong University Chengdu China Department of Computer and Information Technology Purdue University West Lafayette United States University of North Carolina at Chapel Hill Chapel Hill United States

ISBN: (纸本)9783031818530

Medical Image Foundation Models have proven to be powerful tools for mask prediction across various datasets. However, accurately assessing the uncertainty of their predictions remains a significant challenge. To address this, we propose a new model, U-MedSAM, which integrates the MedSAM model with an uncertainty-aware loss function and the Sharpness-Aware Minimization (SharpMin) optimizer. The uncertainty-aware loss function automatically combines region-based, distribution-based, and pixel-based loss designs to enhance segmentation accuracy and robustness. SharpMin improves generalization by finding flat minima in the loss landscape, thereby reducing overfitting. Our method was evaluated in the cvpr24 MedSAM on Laptop challenge, where U-MedSAM demonstrated promising performance. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Medical imaging

来源：评论

学校读者我要写书评

暂无评论

TRH2TQA: Table recognition with Hierarchical Relationships to Table Question-Answering on Business Table Images

TRH2TQA: Table Recognition with Hierarchical Relationships t...

引用

2025 ieee/cvf Winter conference on Applications of computer vision, WACV 2025

作者： Jirachanchaisiri, Pongsakorn Ly, Nam Tuan Takasu, Atsuhiro National Institute of Informatics Tokyo Japan Kanagawa Japan Tokyo University of Agriculture and Technology Tokyo Japan

ISBN: (纸本)9798331510831

Despite advancements in visual question answering, challenges persist with documents like financial reports, often structured in complicated tabular structures with complex numerical computations. An alternative approach, the pipeline-driven methodology, includes table recognition (TR) and table question-answering (TQA). Recent advancements in TR support this approach with better accuracy and interpretability. However, real-world tables usually represent hierarchical tables. They pose additional challenges due to merged cells and indents, necessitating a specific approach for hierarchical relationship extraction. In this paper, we propose TRH2TQA (Table recognition with Hierarchical Relationships to Table Question-Answering) for business table images. It consists of three modules on table images with question-answer pairs. First, the TR module extracts structure and textual content from table images into HTML format. Second, post-structure extraction is applied to identify header and hierarchical relationships using predicted column span and bounding box. Finally, this information is combined with natural language questions in the TQA module to generate the answer through the decoder. In extensive experiments, TRH2TQA outperforms in questionanswering performance on the VQAonBD 2023 dataset. © 2025 ieee.

关键词： Character recognition

来源：评论

学校读者我要写书评

暂无评论

Sign Language recognition: A Large-scale Multi-view Dataset and Comprehensive Evaluation

Sign Language Recognition: A Large-scale Multi-view Dataset ...

引用

2025 ieee/cvf Winter conference on Applications of computer vision, WACV 2025

作者： Dinh, Nguyen Son Nguyen, Tuan Dung Tran, Duc Tri Huy Pham, Nguyen Dang Tran, Thuan Hieu Tong, Ngoc Anh Hoang, Quang Huy Nguyen, Phi Le Hanoi University of Science and Technology Viet Nam Hanoi - Amsterdam High School for the Gifted Viet Nam

ISBN: (纸本)9798331510831

vision-based sign language recognition is an extensively researched problem aimed at advancing communication be-tween deaf and hearing individuals. Numerous Sign Lan-guage recognition (SLR) datasets have been introduced to promote research in this field, spanning multiple languages, vocabulary sizes, and signers. However, most existing pop-ular datasets focus predominantly on the frontal view of signers, neglecting visual information from other perspec-tives. In practice, many sign languages contain words that have similar hand movements and expressions, making it challenging to differentiate between them from a single frontal view. Although a few studies have proposed sign language datasets using multi-view data, these datasets remain limited in vocabulary size and scale, hindering their gener-alizability and practicality. To address this issue, we in-troduce a new large-scale, multi-view sign language recog-nition dataset spanning 1,000 glosses and 30 signers, re-sulting in over 84,000 multi-view videos. To the best of our knowledge, this is the first multi-view sign language recognition dataset of this scale. In conjunction with of-fering a comprehensive dataset, we perform extensive ex-periments to assess the performance of state-of-the-art Sign Language recognition models utilizing on our dataset. The findings indicate that utilizing multi-view data substantially enhances model accuracy across all models, with a maxi-mum performance improvement of up to 19.75% compared to models trained on single-view data. Our dataset and baseline models are publicly accessible on GitHub11Available at https://***/Etdihatthoc/Multi-VSL_WACV_2025. © 2025 ieee.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Modality-Specific Strategies for Medical Image Segmentation Using Lightweight SAM Architectures

Modality-Specific Strategies for Medical Image Segmentation...

引用

International Challenge on Segment Anything in Medical Images on Laptop held in conjunction with the ieee/cvf conference on computer vision and pattern recognition, cvpr 2024

作者： Dao, Thuy Ye, Xincheng Scarsbrook, Joshua Balarupan, Gowrienanthan Ribeiro, Fernanda L. Bollmann, Steffen School of Electrical Engineering and Computer Science University of Queensland Brisbane Australia Queensland Digital Health Centre University of Queensland Brisbane Australia

ISBN: (纸本)9783031818530

Medical image segmentation tasks are often intricate and require medical domain expertise. Recent advancements in deep learning have expedited these demanding tasks, transitioning from specialized models tailored to each task to versatile foundation models capable of accommodating various image modalities. However, many of these foundation models are optimized for GPU computation, necessitating significant computational resources and constraining their practical utility in clinical settings. Furthermore, their variable accuracy across modalities and novel domains undermines their reliability in clinical practice. To address these limitations, we undertake a comparative investigation into deploying medical image segmentation models on CPU, focusing on accuracy and runtime efficiency, as part of the "cvpr 2024: Segment Anything In Medical Images On Laptop" challenge. Our methodology employs different models customized for each modality, including pre-trained EfficientViT-SAM and LiteMedSAM to yield the most precise and efficient outcomes. Additionally, to bolster model performance for datasets featuring small regions of interest, such as PET scans, we integrate a majority voting mechanism. We optimize runtime using the OpenVINO format within a C++ inference script. This approach improves inference runtime while maintaining competitive accuracy, achieving an average DSC score of 0.86 on the validation set and 0.75 on the testing set with an average runtime of 4.61 s on the testing set. Notably, given that most modalities are evaluated in a zero-shot manner, our findings suggest that the zero-shot capability of foundation models can be further refined through dataset-specific inference strategies. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： C++ (programming language)

来源：评论

学校读者我要写书评

暂无评论

Semantically Conditioned Prompts for Visual recognition Under Missing Modality Scenarios

Semantically Conditioned Prompts for Visual Recognition Unde...

引用

2025 ieee/cvf Winter conference on Applications of computer vision, WACV 2025

作者： Pipoli, Vittorio Bolelli, Federico Sarto, Sara Cornia, Marcella Baraldi, Lorenzo Grana, Costantino Cucchiara, Rita Ficarra, Elisa University of Modena and Reggio Emilia Italy University of Pisa Italy

ISBN: (纸本)9798331510831

This paper tackles the domain of multimodal prompting for visual recognition, specifically when dealing with missing modalities through multimodal Transformers. It presents two main contributions: (i) we introduce a novel prompt learning module which is designed to produce sample-specific prompts and (ii) we show that modalityagnostic prompts can effectively adjust to diverse missing modality scenarios. Our model, termed SCP, exploits the semantic representation of available modalities to query a learnable memory bank, which allows the generation of prompts based on the semantics of the input. Notably, SCP distinguishes itself from existing methodologies for its capacity of self-adjusting to both the missing modality scenario and the semantic context of the input, without prior knowledge about the specific missing modality and the number of modalities. Through extensive experiments, we show the effectiveness of the proposed prompt learning framework and demonstrate enhanced performance and robustness across a spectrum of missing modality cases. Our source code is available at https://***/vittoriopipoli/SCP_WACV2025. © 2025 ieee.

关键词： missing modalities multimodal learning prompt learning transformer

来源：评论

学校读者我要写书评

暂无评论

Segment Anything in Medical Images with nnUNet

Segment Anything in Medical Images with nnUNet

引用

International Challenge on Segment Anything in Medical Images on Laptop held in conjunction with the ieee/cvf conference on computer vision and pattern recognition, cvpr 2024

作者： Stock, Raphael Kirchhoff, Yannick Rokuss, Maximilian R. Ravindran, Ashis Maier-Hein, Klaus Heidelberg Germany Faculty of Mathematics and Computer Science Heidelberg University Heidelberg Germany HIDSS4Health - Helmholtz Information and Data Science School for Health Karlsruhe Germany HIDSS4Health - Helmholtz Information and Data Science School for Health Heidelberg Germany Pattern Analysis and Learning Group Department of Radiation Oncology Heidelberg University Hospital Heidelberg Germany

ISBN: (纸本)9783031818530

In this paper, we present an enhanced medical image segmentation approach leveraging the nnUNet framework, specifically tailored to integrate bounding box prompts for improved segmentation accuracy in resource-constrained environments. By incorporating these prompts as binary masks in an additional input channel, we enable more precise and context-aware segmentation. Our methodology employs a 2D slice-wise approach optimized for CPU-based inference through just-in-time (JIT) compiled functions, ensuring efficient processing on standard clinical equipment. Our solution demonstrates robust performance, achieving an average Dice Similarity Coefficient (DSC) of 80.98% and a Normalized Surface Dice (NSD) of 83.23% across multiple modalities in the validation set. This indicates its practical applicability and effectiveness in real-world clinical settings, where computational resources may be limited. By focusing on both accuracy and efficiency, our approach makes advanced segmentation technology accessible to a broader range of healthcare providers, facilitating enhanced clinical decision-making and patient care. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Image segmentation

来源：评论

学校读者我要写书评

暂无评论

TRH2TQA: Table recognition with Hierarchical Relationships to Table Question-Answering on Business Table Images

TRH2TQA: Table Recognition with Hierarchical Relationships t...

引用

ieee Workshop on Applications of computer vision (WACV)

作者： Pongsakorn Jirachanchaisiri Nam Tuan Ly Atsuhiro Takasu National Institute of Informatics Tokyo Japan The Graduate University for Advanced Studies (SOKENDAI) Kanagawa Japan Tokyo University of Agriculture and Technology Tokyo Japan

ISBN: (数字)9798331510831

ISBN: (纸本)9798331510848

关键词： Measurement Visualization Image recognition Pipelines Natural languages Transformers Question answering (information retrieval) Decoding Logic Business

来源：评论

学校读者我要写书评

暂无评论

Differentiable Shadow Mapping for Efficient Inverse Graphics

Differentiable Shadow Mapping for Efficient Inverse Graphics

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Worchel, Markus Alexa, Marc TU Berlin Berlin Germany

ISBN: (纸本)9798350301298

We show how shadows can be efficiently generated in differentiable rendering of triangle meshes. Our central observation is that pre-filtered shadow mapping, a technique for approximating shadows based on rendering from the perspective of a light, can be combined with existing differentiable rasterizers to yield differentiable visibility information. We demonstrate at several inverse graphics problems that differentiable shadow maps are orders of magnitude faster than differentiable light transport simulation with similar accuracy - while differentiable rasterization without shadows often fails to converge.

关键词： vision + graphics

来源：评论

学校读者我要写书评

暂无评论

Initialization Noise in Image Gradients and Saliency Maps

Initialization Noise in Image Gradients and Saliency Maps

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Woerl, Ann-Christin Disselhoff, Jan Wand, Michael Johannes Gutenberg Univ Mainz Inst Comp Sci Mainz Germany

ISBN: (纸本)9798350301298

In this paper, we examine gradients of logits of image classification CNNs by input pixel values. We observe that these fluctuate considerably with training randomness, such as the random initialization of the networks. We extend our study to gradients of intermediate layers, obtained via GradCAM, as well as popular network saliency estimators such as DeepLIFT, SHAP, LIME, Integrated Gradients, and SmoothGrad. While empirical noise levels vary, qualitatively different attributions to image features are still possible with all of these, which comes with implications for interpreting such attributions, in particular when seeking data-driven explanations of the phenomenon generating the data. Finally, we demonstrate that the observed artefacts can be removed by marginalization over the initialization distribution by simple stochastic integration.

关键词： Explainable computer vision

来源：评论

学校读者我要写书评

暂无评论

Exploring and Utilizing pattern Imbalance

Exploring and Utilizing Pattern Imbalance

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Mei, Shibin Zhao, Chenglong Yuan, Shengchao Ni, Bingbing Shanghai Jiao Tong Univ Shanghai 200240 Peoples R China

ISBN: (纸本)9798350301298

In this paper, we identify pattern imbalance from several aspects, and further develop a new training scheme to avert pattern preference as well as spurious correlation. In contrast to prior methods which are mostly concerned with category or domain granularity, ignoring the potential finer structure that existed in datasets, we give a new definition of seed category as an appropriate optimization unit to distinguish different patterns in the same category or domain. Extensive experiments on domain generalization datasets of diverse scales demonstrate the effectiveness of the proposed method.

关键词： Datasets and evaluation

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 1 2 3 4 5 6 7 8 9 10 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：