检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

29,426 篇 会议
1,400 册 图书
235 篇 期刊文献

馆藏范围

31,059 篇 电子文献
2 种 纸本馆藏

日期分布

学科分类号

17,311 篇 工学
- 13,652 篇 计算机科学与技术...
- 5,219 篇 软件工程
- 2,970 篇 机械工程
- 2,647 篇 光学工程
- 1,413 篇 控制科学与工程
- 1,412 篇 电气工程
- 1,334 篇 信息与通信工程
- 658 篇 生物工程
- 576 篇 仪器科学与技术
- 514 篇 生物医学工程（可授...
- 466 篇 电子科学与技术（可...
- 251 篇 化学工程与技术
- 216 篇 安全科学与工程
- 143 篇 交通运输工程
- 134 篇 建筑学
- 122 篇 材料科学与工程（可...
- 120 篇 土木工程
5,070 篇 理学
- 3,136 篇 物理学
- 2,409 篇 数学
- 826 篇 生物学
- 803 篇 统计学（可授理学、...
- 299 篇 系统科学
- 228 篇 化学
3,832 篇 医学
- 3,801 篇 临床医学
- 187 篇 基础医学(可授医学...
- 140 篇 药学(可授医学、理...
1,065 篇 管理学
- 618 篇 图书情报与档案管...
- 471 篇 管理科学与工程(可...
- 148 篇 工商管理
373 篇 艺术学
- 373 篇 设计学（可授艺术学...
117 篇 法学
82 篇 农学
48 篇 教育学
44 篇 经济学
18 篇 军事学
8 篇 文学

主题

12,609 篇 computer vision
5,703 篇 pattern recognit...
3,181 篇 training
2,263 篇 cameras
2,179 篇 computational mo...
2,116 篇 feature extracti...
2,051 篇 image segmentati...
1,971 篇 visualization
1,967 篇 shape
1,642 篇 robustness
1,491 篇 layout
1,476 篇 three-dimensiona...
1,442 篇 computer science
1,339 篇 computer archite...
1,296 篇 object detection
1,221 篇 semantics
1,144 篇 face recognition
1,107 篇 conferences
1,077 篇 benchmark testin...
1,056 篇 humans

机构

137 篇 univ sci & techn...
134 篇 tsinghua univers...
134 篇 univ chinese aca...
118 篇 chinese univ hon...
101 篇 microsoft resear...
97 篇 zhejiang univers...
95 篇 national laborat...
94 篇 shanghai jiao to...
93 篇 zhejiang univ pe...
85 篇 university of sc...
79 篇 shanghai ai lab ...
78 篇 swiss fed inst t...
66 篇 microsoft res as...
62 篇 adobe research
62 篇 computer vision ...
61 篇 peking univ peop...
58 篇 univ oxford oxfo...
57 篇 google mountain ...
57 篇 hong kong univ s...
56 篇 google res mount...

作者

107 篇 umapada pal
82 篇 van gool luc
70 篇 zhang lei
59 篇 timofte radu
41 篇 yang yi
37 篇 loy chen change
37 篇 hanqing lu
33 篇 liu yang
32 篇 nassir navab
32 篇 wang liang
32 篇 xiaoou tang
30 篇 tian qi
29 篇 h. bischof
29 篇 jan-michael frah...
29 篇 vittorio murino
29 篇 darrell trevor
28 篇 ling haibin
28 篇 chen chen
27 篇 li xin
27 篇 vasconcelos nuno

语言

30,810 篇 英文
181 篇 其他
100 篇 中文
6 篇 土耳其文
2 篇 日文
2 篇 俄文

检索条件"任意字段=Conference on Computer Vision and Pattern Recognition"

共 31061 条记录，以下是4491-4500 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets

Towards Good Practices for Efficiently Annotating Large-Scal...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Liao, Yuan-Hong Kar, Amlan Fidler, Sanja Univ Toronto Toronto ON Canada Vector Inst Toronto ON Canada NVIDIA Santa Clara CA USA

ISBN: (纸本)9781665445092

Data is the engine of modern computer vision, which necessitates collecting large-scale datasets. This is expensive, and guaranteeing the quality of the labels is a major challenge. In this paper, we investigate efficient annotation strategies for collecting multi-class classification labels for a large collection of images. While methods that exploit learnt models for labeling exist, a surprisingly prevalent approach is to query humans for a fixed number of labels per datum and aggregate them, which is expensive. Building on prior work on online joint probabilistic modeling of human annotations and machine-generated beliefs, we propose modifications and best practices aimed at minimizing human labeling effort. Specifically, we make use of advances in self-supervised learning, view annotation as a semi-supervised learning problem, identify and mitigate pitfalls and ablate several key design choices to propose effective guidelines for labeling. Our analysis is done in a more realistic simulation that involves querying human labelers, which uncovers issues with evaluation using existing worker simulation methods. Simulated experiments on a 125k image subset of the ImageNet100 show that it can be annotated to 80% top-1 accuracy with 0.35 annotations per image on average, a 2.7x and 6.7x improvement over prior work and manual annotation, respectively.(1)

关键词： computer vision Analytical models Annotations Manuals Semisupervised learning Probabilistic logic pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Student-Teacher Learning from Clean Inputs to Noisy Inputs

Student-Teacher Learning from Clean Inputs to Noisy Inputs

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Hong, Guanzhe Mao, Zhiyuan Lin, Xiaojun Chan, Stanley H. Purdue Univ Sch Elect & Comp Engn W Lafayette IN 47907 USA

ISBN: (纸本)9781665445092

Feature-based student-teacher learning, a training method that encourages the student's hidden features to mimic those of the teacher network, is empirically successful in transferring the knowledge from a pre-trained teacher network to the student network. Furthermore, recent empirical results demonstrate that, the teacher's features can boost the student network's generalization even when the student's input sample is corrupted by noise. However, there is a lack of theoretical insights into why and when this method of transferring knowledge can be successful between such heterogeneous tasks. We analyze this method theoretically using deep linear networks, and experimentally using nonlinear networks. We identify three vital factors to the success of the method: (1) whether the student is trained to zero training loss;(2) how knowledgeable the teacher is on the clean-input problem;(3) how the teacher decomposes its knowledge in its hidden features. Lack of proper control in any of the three factors leads to failure of the student-teacher learning method.

关键词： Training Learning systems Knowledge engineering computer vision Systematics Protocols Numerical analysis

来源：评论

学校读者我要写书评

暂无评论

Harnessing vision Transformers for Precise and Explainable Breast Cancer Diagnosis 27th

Harnessing Vision Transformers for Precise and Explainable B...

引用

27th International conference on pattern recognition, ICPR 2024

作者： Balaha, Hossam Magdy Ali, Khadiga M. Gondim, Dibson Ghazal, Mohammed El-Baz, Ayman Bioengineering Department J.B. Speed School of Engineering University of Louisville LouisvilleKY United States Pathology Department Faculty of Medicine Mansoura University Mansoura Egypt Department of Pathology and Laboratory Medicine University of Louisville LouisvilleKY United States Electrical Computer and Biomedical Engineering Department Abu Dhabi University Abu Dhabi United Arab Emirates

ISBN: (纸本)9783031781940

Breast cancer (BC) remains a significant global health challenge, impacting millions of lives annually. Traditional histopathological analysis, while essential, can be subjective and time-consuming, potentially leading to diagnostic inaccuracies. This study proposes a novel computer-Aided Diagnosis (CAD) framework utilizing vision Transformers (ViTs) for BC diagnosis from histopathology slides. ViTs excel in capturing global dependencies within images, offering enhanced diagnostic accuracy compared to conventional methods. The framework integrates ViTs with advanced decision-making techniques like 2-tier majority fusion and SHapley Additive exPlanations (SHAP) for improved interpretability. Experimental results on a dataset of post-neoadjuvant therapy breast cancer samples demonstrate the efficacy of the proposed approach, achieving high performance metrics and providing insights into model predictions. The proposed approach achieves state-of-the-art performance with an accuracy exceeding 97% surpassing existing methods both on the utilized dataset and an external benchmark, specifically the Breast Cancer Histopathological Database (BreakHis). Time complexity analysis suggests that the proposed framework offers computational efficiency, with the dominant factors influencing overall complexity being the number of patches, sequence length, and number of layers in the ViT model. This study contributes a robust methodology towards enhancing BC diagnostic precision and efficiency through cutting-edge AI technologies. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Diseases

来源：评论

学校读者我要写书评

暂无评论

Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning

Revamping Cross-Modal Recipe Retrieval with Hierarchical Tra...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Salvador, Amaia Gundogdu, Erhan Bazzani, Loris Donoser, Michael Amazon Seattle WA 98109 USA

ISBN: (纸本)9781665445092

Cross-modal recipe retrieval has recently gained substantial attention due to the importance of food in people's lives, as well as the availability of vast amounts of digital cooking recipes and food images to train machine learning models. In this work, we revisit existing approaches for cross-modal recipe retrieval and propose a simplified end-to-end model based on well established and high performing encoders for text and images. We introduce a hierarchical recipe Transformer which attentively encodes individual recipe components (titles, ingredients and instructions). Further, we propose a self-supervised loss function computed on top of pairs of individual recipe components, which is able to leverage semantic relationships within recipes, and enables training using both image-recipe and recipe-only samples. We conduct a thorough analysis and ablation studies to validate our design choices. As a result, our proposed method achieves state-of-the-art performance in the cross-modal recipe retrieval task on the Recipe1M dataset. We make code and models publicly available(1).

关键词： Training computer vision Codes Computational modeling Semantics Machine learning Transformers

来源：评论

学校读者我要写书评

暂无评论

Dynamic Cues-Assisted Transformer for Robust Point Cloud Registration

Dynamic Cues-Assisted Transformer for Robust Point Cloud Reg...

引用

conference on computer vision and pattern recognition (CVPR)

作者： Hong Chen Pei Yan Sihe Xiang Yihua Tan Hubei Engineering Research Center of Machine Vision and Intelligent Systems School of Artificial Intelligence and Automation Huazhong University of Science and Technology China

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

Point Cloud Registration is a critical and challenging task in computer vision. Recent advancements have pre-dominantly embraced a coarse-to-fine matching mechanism, with the key to matching the superpoints located in patches with interframe consistent structures. How-ever, previous methods still face challenges with ambiguous matching, because the interference information aggregated from irrelevant regions may disturb the capture of interframe consistency relations, leading to wrong matches. To address this issue, we propose Dynamic Cues-Assisted Transformer (DCATr). Firstly, the interference from irrelevant regions is greatly reduced by constraining attention to certain cues, i.e., regions with highly correlated structures of potential corresponding superpoints. Secondly, cues-assisted attention is designed to mine the interframe consistency relations, while more attention is assigned to pairs with high consistent confidence in feature aggregation. Finally, a dynamic updating fashion is proposed to facilitate mining richer consistency information, further improving aggregated features' distinctiveness and relieving matching ambiguity. Extensive evaluations on indoor and outdoor standard benchmarks demonstrate that DCATr outperforms all state-of-the-art methods.

关键词： Point cloud compression computer vision Interference Benchmark testing Transformers Feature extraction pattern recognition

来源：评论

学校读者我要写书评

暂无评论

GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation

GDR-Net: Geometry-Guided Direct Regression Network for Monoc...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Wang, Gu Manhardt, Fabian Tombari, Federico Ji, Xiangyang Tsinghua Univ BNRist Beijing Peoples R China Tech Univ Munich Munich Germany Google Mountain View CA 94043 USA

ISBN: (纸本)9781665445092

6D pose estimation from a single RGB image is a fundamental task in computer vision. The current top-performing deep learning-based methods rely on an indirect strategy, i.e., first establishing 2D-3D correspondences between the coordinates in the image plane and object coordinate system, and then applying a variant of the PnP/RANSAC algorithm. However, this two-stage pipeline is not end-toend trainable, thus is hard to be employed for many tasks requiring differentiable poses. On the other hand, methods based on direct regression are currently inferior to geometry-based methods. In this work, we perform an indepth investigation on both direct and indirect methods, and propose a simple yet effective Geometry-guided Direct Regression Network (GDR-Net) to learn the 6D pose in an end-to-end manner from dense correspondence-based intermediate geometric representations. Extensive experiments show that our approach remarkably outperforms state-of-the-art methods on LM, LM-O and YCB-V datasets. Code is available at https://***/GDR-Net.

关键词： Learning systems Convolutional codes computer vision Pose estimation Pipelines Real-time systems pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Exploiting Spatial Dimensions of Latent in GAN for Real-time Image Editing

Exploiting Spatial Dimensions of Latent in GAN for Real-time...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Kim, Hyunsu Choi, Yunjey Kim, Junho Yoo, Sungjoo Uh, Youngjung NAVER AI Lab Seoul South Korea Seoul Natl Univ Seoul South Korea Yonsei Univ Seoul South Korea

ISBN: (纸本)9781665445092

Generative adversarial networks (GANs) synthesize realistic images from random latent vectors. Although manipulating the latent vectors controls the synthesized outputs, editing real images with GANs suffers from i) time-consuming optimization for projecting real images to the latent vectors, ii) or inaccurate embedding through an encoder. We propose StyleMapGAN: the intermediate latent space has spatial dimensions, and a spatially variant modulation replaces AdaIN. It makes the embedding through an encoder more accurate than existing optimization-based methods while maintaining the properties of GANs. Experimental results demonstrate that our method significantly outperforms state-of-the-art models in various image manipulation tasks such as local editing and image interpolation. Last but not least, conventional editing methods on GANs are still valid on our StyleMapGAN.

关键词： Interpolation computer vision Codes Computational modeling Modulation Generative adversarial networks Real-time systems

来源：评论

学校读者我要写书评

暂无评论

Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging

Boosting Monocular Depth Estimation Models to High-Resolutio...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Miangoleh, S. Mahdi H. Dille, Sebastian Mai, Long Paris, Sylvain Aksoy, Yagiz Simon Fraser Univ Burnaby BC Canada Adobe Res Bangalore Karnataka India

ISBN: (纸本)9781665445092

Neural networks have shown great abilities in estimating depth from a single image. However, the inferred depth maps are well below one-megapixel resolution and often lack fine-grained details, which limits their practicality. Our method builds on our analysis on how the input resolution and the scene structure affects depth estimation performance. We demonstrate that there is a trade-off between a consistent scene structure and the high-frequency details, and merge low- and high-resolution estimations to take advantage of this duality using a simple depth merging network. We present a double estimation method that improves the whole-image depth estimation and a patch selection method that adds local details to the final result. We demonstrate that by merging estimations at different resolutions with changing context, we can generate multi-megapixel depth maps with a high level of detail using a pre-trained model.

关键词： Location awareness Image segmentation computer vision Image resolution Merging Neural networks Estimation

来源：评论

学校读者我要写书评

暂无评论

Guidance Network with Staged Learning for Image enhancement

Guidance Network with Staged Learning for Image enhancement

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Liang, Luming Zharkov, Ilya Amjadi, Faezeh Joze, Hamid Reza Vaezi Pradeep, Vivek Microsoft One Microsoft Way Redmond WA 98052 USA

ISBN: (纸本)9781665448994

Many important yet not fully resolved problems in computational photography and image enhancement, e.g. generating well-lit images from their low-light counterparts or producing RGB images from their RAW camera inputs share a common nature: discovering a color mapping between input pixels to output pixels based on both global information and local details. We propose a novel deep neural network architecture to learn the RAW to RGB mapping based on this common nature. This architecture consists of both global and local sub-networks, where the first sub-network focuses on determining illumination and color mapping, the second sub-network deals with recovering image details. The result of the global network serves as a guidance to the local network to form the final RGB images. Our method outperforms state-of-the-art with a significantly smaller size of network features on various image enhancement tasks.

关键词： Training Photography Image color analysis Lighting computer architecture Cameras Service-oriented architecture

来源：评论

学校读者我要写书评

暂无评论

Convolutional Hough Matching Networks

Convolutional Hough Matching Networks

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Min, Juhong Cho, Minsu POSTECH CSE Pohang South Korea POSTECH GSAI Pohang South Korea

ISBN: (纸本)9781665445092

Despite advances in feature representation, leveraging geometric relations is crucial for establishing reliable visual correspondences under large variations of images. In this work we introduce a Hough transform perspective on convolutional matching and propose an effective geometric matching algorithm, dubbed Convolutional Hough Matching (CHM). The method distributes similarities of candidate matches over a geometric transformation space and evaluate them in a convolutional manner. We cast it into a trainable neural layer with a semi-isotropic high-dimensional kernel, which learns non-rigid matching with a small number of interpretable parameters. To validate the effect, we develop the neural network with CHM layers that perform convolutional matching in the space of translation and scaling. Our method sets a new state of the art on standard benchmarks for semantic visual correspondence, proving its strong robustness to challenging intra-class variations.

关键词： Visualization computer vision Semantics Neural networks Transforms Benchmark testing Robustness

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 446 447 448 449 450 451 452 453 454 455 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：