检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

23,008 篇 会议
126 册 图书
94 篇 期刊文献

馆藏范围

23,227 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

13,631 篇 工学
- 11,116 篇 计算机科学与技术...
- 3,481 篇 软件工程
- 2,445 篇 机械工程
- 1,716 篇 光学工程
- 1,080 篇 电气工程
- 1,014 篇 控制科学与工程
- 788 篇 信息与通信工程
- 411 篇 仪器科学与技术
- 352 篇 生物工程
- 251 篇 生物医学工程（可授...
- 196 篇 电子科学与技术（可...
- 114 篇 化学工程与技术
- 109 篇 安全科学与工程
- 100 篇 测绘科学与技术
- 88 篇 建筑学
- 88 篇 交通运输工程
- 84 篇 土木工程
3,495 篇 医学
- 3,482 篇 临床医学
- 82 篇 基础医学(可授医学...
3,246 篇 理学
- 1,941 篇 物理学
- 1,643 篇 数学
- 563 篇 统计学（可授理学、...
- 500 篇 生物学
- 249 篇 系统科学
- 106 篇 化学
521 篇 管理学
- 311 篇 图书情报与档案管...
- 223 篇 管理科学与工程(可...
- 76 篇 工商管理
276 篇 艺术学
- 276 篇 设计学（可授艺术学...
66 篇 法学
- 63 篇 社会学
38 篇 农学
28 篇 教育学
22 篇 经济学
10 篇 军事学
3 篇 文学

主题

10,186 篇 computer vision
3,967 篇 pattern recognit...
3,005 篇 training
2,007 篇 computational mo...
1,818 篇 visualization
1,815 篇 cameras
1,515 篇 feature extracti...
1,481 篇 shape
1,455 篇 three-dimensiona...
1,438 篇 image segmentati...
1,287 篇 robustness
1,206 篇 computer archite...
1,155 篇 semantics
1,147 篇 conferences
1,107 篇 layout
1,092 篇 computer science
1,088 篇 object detection
1,025 篇 benchmark testin...
970 篇 codes
922 篇 face recognition

机构

136 篇 univ sci & techn...
121 篇 univ chinese aca...
118 篇 chinese univ hon...
105 篇 carnegie mellon ...
101 篇 tsinghua univers...
101 篇 microsoft resear...
95 篇 swiss fed inst t...
93 篇 zhejiang univ pe...
82 篇 university of sc...
81 篇 zhejiang univers...
79 篇 university of ch...
77 篇 shanghai ai lab ...
72 篇 shanghai jiao to...
69 篇 national laborat...
67 篇 microsoft res as...
67 篇 alibaba grp peop...
64 篇 adobe research
60 篇 peking univ peop...
60 篇 tsinghua univ pe...
59 篇 univ oxford oxfo...

作者

81 篇 van gool luc
72 篇 timofte radu
65 篇 zhang lei
47 篇 luc van gool
40 篇 yang yi
40 篇 li stan z.
37 篇 loy chen change
35 篇 chen chen
33 篇 xiaoou tang
32 篇 liu yang
32 篇 qi tian
31 篇 tian qi
31 篇 sun jian
30 篇 murino vittorio
29 篇 ling haibin
29 篇 darrell trevor
29 篇 pascal fua
29 篇 li fei-fei
28 篇 li xin
28 篇 ying shan

语言

22,989 篇 英文
210 篇 其他
22 篇 中文
5 篇 土耳其文
2 篇 日文

检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition Workshops"

共 23228 条记录，以下是511-520 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Hot-started NAS for Task-specific Embedded Applications

Hot-started NAS for Task-specific Embedded Applications

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Hendrickx, Lotte Van Ranst, Wiebe Goedeme, Toon Katholieke Univ Leuven EAVISE PSI ESAT Jan Pieter De Nayerlaan 5 B-2860 St Katelijne Waver Belgium

ISBN: (纸本)9781665487399

Neural architecture search (NAS) has proven its worth in discovering new neural networks. Combining the possibility to satisfy multiple objectives in one search, it is especially useful for getting the most out of embedded devices with limited resources. However, research into small and efficient neural networks precedes NAS. We investigate the influence of combining this pre-existing knowledge with NAS techniques, for which we propose to hot-start the NAS search with a human-designed optimal network. Our experiments show that doing so speeds up the NAS process significantly, but the resulting optimal model at the end is only marginally better. Since embedded devices are often used for a specific task, we also explore the impact of using a task-specific dataset in the NAS process. Our experiments demonstrate that for a constrained problem, a smaller network can be found as compared to a general problem.

关键词： Knowledge engineering computer vision conferences Neural networks Size measurement Search problems pattern recognition

来源：评论

学校读者我要写书评

暂无评论

A Deeper Look into Aleatoric and Epistemic Uncertainty Disentanglement

A Deeper Look into Aleatoric and Epistemic Uncertainty Disen...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Valdenegro-Toro, Matias Mori, Daniel Saromo Univ Groningen Dept AI Bernoulli Inst Groningen Netherlands Pontifical Catholic Univ Peru Artificial Intelligence Res Grp San Miguel Peru

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Neural networks are ubiquitous in many tasks, but trusting their predictions is an open issue. Uncertainty quantification is required for many applications, and disentangled aleatoric and epistemic uncertainties are best. In this paper, we generalize methods to produce disentangled uncertainties to work with different uncertainty quantification methods, and evaluate their capability to produce disentangled uncertainties. Our results show that: there is an interaction between learning aleatoric and epistemic uncertainty, which is unexpected and violates assumptions on aleatoric uncertainty, some methods like Flipout produce zero epistemic uncertainty, aleatoric uncertainty is unreliable in the out-of-distribution setting, and Ensembles provide overall the best disentangling quality. We also explore the error produced by the number of samples hyper-parameter in the sampling softmax function, recommending N > 100 samples. We expect that our formulation and results help practitioners and researchers choose uncertainty methods and expand the use of disentangled uncertainties, as well as motivate additional research into this topic.

关键词： computer vision Uncertainty conferences Neural networks pattern recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

Cyclical Pruning for Sparse Neural Networks

Cyclical Pruning for Sparse Neural Networks

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Srinivas, Suraj Kuzmin, Andrey Nagel, Markus van Baalen, Mart Skliar, Andrii Blankevoort, Tijmen Idiap Res Inst Martigny Switzerland Ecole Polytech Fed Lausanne Lausanne Switzerland Qualcomm AI Res Amsterdam Netherlands

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Current methods for pruning neural network weights iteratively apply magnitude-based pruning on the model weights and re-train the resulting model to recover lost accuracy. In this work, we show that such strategies do not allow for the recovery of erroneously pruned weights. To enable weight recovery, we propose a simple strategy called cyclical pruning which requires the pruning schedule to be periodic and allows for weights pruned erroneously in one cycle to recover in subsequent ones. Experimental results on both linear models and large-scale deep neural networks show that cyclical pruning outperforms existing pruning algorithms, especially at high sparsity ratios. Our approach is easy to tune and can be readily incorporated into existing pruning pipelines to boost performance.

关键词： Deep learning Schedules computer vision conferences Computational modeling Neural networks Pipelines

来源：评论

学校读者我要写书评

暂无评论

Toward Generalist Anomaly Detection via In-context Residual Learning with Few-shot Sample Prompts

Toward Generalist Anomaly Detection via In-context Residual ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Zhu, Jiawen Pang, Guansong Singapore Management Univ Sch Comp & Informat Syst Singapore Singapore

ISBN: (纸本)9798350353006

This paper explores the problem of Generalist Anomaly Detection (GAD), aiming to train one single detection model that can generalize to detect anomalies in diverse datasets from different application domains without any further training on the target data. Some recent studies have showed that large pre-trained Visual-Language Models (VLMs) like CLIP have strong generalization capabilities on detecting industrial defects from various datasets, but their methods rely heavily on handcrafted text prompts about defects, making them difficult to generalize to anomalies in other applications, e.g., medical image anomalies or semantic anomalies in natural images. In this work, we propose to train a GAD model with few-shot normal images as sample prompts for AD on diverse datasets on the fly. To this end, we introduce a novel approach that learns an in-context residual learning model for GAD, termed InCTRL. It is trained on an auxiliary dataset to discriminate anomalies from normal samples based on a holistic evaluation of the residuals between query images and few-shot normal sample prompts. Regardless of the datasets, per definition of anomaly, larger residuals are expected for anomalies than normal samples, thereby enabling InCTRL to generalize across different domains without further training. Comprehensive experiments on nine AD datasets are performed to establish a GAD benchmark that encapsulate the detection of industrial defect anomalies, medical anomalies, and semantic anomalies in both one-vs-all and multi-class setting, on which InCTRL is the best performer and significantly outperforms state-of-the-art competing methods. Code is available at https://***/mala-lab/InCTRL.

关键词： Anomaly Detection Few-shot Anomaly Detection Generalist Anomaly Detection vision Language Model

来源：评论

学校读者我要写书评

暂无评论

Equivariant Multi-Modality Image Fusion

Equivariant Multi-Modality Image Fusion

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Zhao, Zixiang Hai, Haowen Zhang, Jiangshe Zhang, Yulun Zhane, Kai Xu, Shuang Chen, Dongdong Timofte, Radu Van Gool, Luc Xi An Jiao Tong Univ Xian Peoples R China Swiss Fed Inst Technol Zurich Switzerland Shanghai Jiao Tong Univ Shanghai Peoples R China Nanjing Univ Nanjing Peoples R China Northwestern Polytech Univ Xian Peoples R China Heriot Watt Univ Edinburgh Midlothian Scotland Univ Wurzburg Wurzburg Germany INSAIT Sofia Bulgaria

ISBN: (纸本)9798350353006

Multi-modality image fusion is a technique that combines information from different sensors or modalities, enabling the fused image to retain complementary features from each modality, such as functional highlights and texture details. However, effective training of such fusion models is challenging due to the scarcity of ground truth fusion data. To tackle this issue, we propose the Equivariant Multi-Modality imAge fusion (EMMA) paradigm for end-to-end self-supervised learning. Our approach is rooted in the prior knowledge that natural imaging responses are equivariant to certain transformations. Consequently, we introduce a novel training paradigm that encompasses a fusion module, a pseudo-sensing module, and an equivariant fusion module. These components enable the net training to follow the principles of the natural sensing-imaging process while satisfying the equivariant imaging prior. Extensive experiments confirm that EMMA yields high-quality fusion results for infraredvisible and medical images, concurrently facilitating downstream multi-modal segmentation and detection tasks. The code is available at https://***/Zhaozixiang1228/MMIF-EMMA.

关键词： image fusion low-level vision

来源：评论

学校读者我要写书评

暂无评论

Multi-Class Cell Detection Using Modified Self-Attention

Multi-Class Cell Detection Using Modified Self-Attention

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Sugimoto, Tatsuhiko Ito, Hiroaki Teramoto, Yuki Yoshizawa, Akihiko Bise, Ryoma Kyushu Univ Fukuoka Japan Kyoto Univ Hosp Kyoto Japan

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Multi-class cell detection (cancer or non-cancer) from a whole slide image (WSI) is an important task for pathological diagnosis. Cancer and non-cancer cells often have a similar appearance, so it is difficult even for experts to classify a cell from a patch image of individual cells. They usually identify the cell type not only on the basis of the appearance of a single cell but also on the context of the surrounding cells. For using such information, we propose a multi-class cell-detection method that introduces a modified self-attention to aggregate the surrounding image features of both classes. Experimental results demonstrate the effectiveness of the proposed method;our method achieved the best performance compared with a method, which simply uses the standard self-attention method.

关键词： Heating systems Pathology computer vision Aggregates conferences Feature extraction pattern recognition

来源：评论

学校读者我要写书评

暂无评论

CLOVA: A Closed-LOop Visual Assistant with Tool Usage and Update

CLOVA: A Closed-LOop Visual Assistant with Tool Usage and Up...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Gao, Zhi Du, Yuntao Zhang, Xintong Ma, Xiaojian Han, Wenjuan Zhu, Song-Chun Li, Qing Peking Univ Sch Intelligence Sci & Technol Beijing Peoples R China BIGAI State Key Lab Gen Artificial Intelligence Beijing Peoples R China Beijing Jiaotong Univ Beijing Peoples R China Tsinghua Univ Dept Automat Beijing Peoples R China

ISBN: (纸本)9798350353006

Utilizing large language models (LLMs) to compose off-the-shelf visual tools represents a promising avenue of research for developing robust visual assistants capable of addressing diverse visual tasks. However, these methods often overlook the potential for continual learning, typically by freezing the utilized tools, thus limiting their adaptation to environments requiring new knowledge. To tackle this challenge, we propose CLOVA, a Closed-LOop Visual Assistant, which operates within a framework encompassing inference, reflection, and learning phases. During the inference phase, LLMs generate programs and execute cor responding tools to complete assigned tasks. In the reflection phase, a multimodal global-local reflection scheme analyzes human feedback to determine which tools require updating. Lastly, the learning phase employs three flexible approaches to automatically gather training data and introduces a novel prompt tuning scheme to update the tools, allowing CLOVA to efficiently acquire new knowledge. Experimental findings demonstrate that CLOVA surpasses existing tool-usage methods by 5% in visual question answering and multiple-image reasoning, by 10% in knowledge tagging, and by 20% in image editing. These results underscore the significance of the continual learning capability in general visual assistants.

关键词： Compositional Reasoning Large Language Models Multimodal Learning vision and Language Visual Assistants

来源：评论

学校读者我要写书评

暂无评论

CityDreamer: Compositional Generative Model of Unbounded 3D Cities

CityDreamer: Compositional Generative Model of Unbounded 3D ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Xie, Haozhe Chen, Zhaoxi Hong, Fangzhou Liu, Ziwei Nanyang Technol Univ S Lab Singapore Singapore

ISBN: (纸本)9798350353006

3D city generation is a desirable yet challenging task, since humans are more sensitive to structural distortions in urban environments. Additionally, generating 3D cities is more complex than 3D natural scenes since buildings, as objects of the same class, exhibit a wider range of appearances compared to the relatively consistent appearance of objects like trees in natural scenes. To address these challenges, we propose CityDreamer, a compositional generative model designed specifically for unbounded 3D cities. Our key insight is that 3D city generation should be a composition of different types of neural fields: 1) various building instances, and 2) background stuff, such as roads and green lands. Specifically, we adopt the bird's eye view scene representation and employ a volumetric render for both instance-oriented and stuff-oriented neural fields. The generative hash grid and periodic positional embedding are tailored as scene parameterization to suit the distinct characteristics of building instances and background stuff. Furthermore, we contribute a suite of CityGen Datasets, including OSM and GoogleEarth, which comprises a vast amount of real-world city imagery to enhance the realism of the generated 3D cities both in their layouts and appearances. CityDreamer achieves state-of-the-art performance not only in generating realistic 3D cities but also in localized editing within the generated cities.

关键词： 3D vision AIGC City Generation Scene Generation

来源：评论

学校读者我要写书评

暂无评论

Composing Object Relations and Attributes for Image-Text Matching

Composing Object Relations and Attributes for Image-Text Mat...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Pham, Khoi Huynh, Chuong Lim, Ser-Nam Shrivastava, Abhinav Univ Maryland College Pk MD 20742 USA Univ Cent Florida Orlando FL 32816 USA

ISBN: (纸本)9798350353006

We study the visual semantic embedding problem for image-text matching. Most existing work utilizes a tailored cross-attention mechanism to perform local alignment across the two image and text modalities. This is computationally expensive, even though it is more powerful than the unimodal dual-encoder approach. This work introduces a dual-encoder image-text matching model, leveraging a scene graph to represent captions with nodes for objects and attributes interconnected by relational edges. Utilizing a graph attention network, our model efficiently encodes object-attribute and object-object semantic relations, resulting in a robust and fast-performing system. Representing caption as a scene graph offers the ability to utilize the strong relational inductive bias of graph neural networks to learn object-attribute and object-object relations effectively. To train the model, we propose losses that align the image and caption both at the holistic level ( image-caption) and the local level (image-object entity), which we show is key to the success of the model. Our model is termed Composition model for Object Relations and Attributes, CORA. Experimental results on two prominent image-text retrieval benchmarks, Flickr30K and MS-COCO, demonstrate that CORA outperforms existing state-of-the-art computationally expensive cross-attention methods regarding recall score while achieving fast computation speed of the dual encoder. Our code is available at https://***/vkhoi/cora_cvpr24

关键词： image retrieval image-text matching vision-language

来源：评论

学校读者我要写书评

暂无评论

Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image

Holo-Relighting: Controllable Volumetric Portrait Relighting...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Mei, Yiqun Zeng, Yu Zhang, He Shu, Zhixin Zhang, Xuaner Bi, Sai Zhang, Jianming Jung, HyunJoon Patel, Vishal M. Johns Hopkins Univ Baltimore MD 21218 USA Adobe Inc San Jose CA USA

ISBN: (纸本)9798350353013;9798350353006

At the core of portrait photography is the search for ideal lighting and viewpoint. The process often requires advanced knowledge in photography and an elaborate studio setup. In this work, we propose Holo-Relighting, a volumetric relighting method that is capable of synthesizing novel viewpoints, and novel lighting from a single image. Holo-Relighting leverages the pretrained 3D GAN (EG3D) to reconstruct geometry and appearance from an input portrait as a set of 3D-aware features. We design a relighting module conditioned on a given lighting to process these features, and predict a relit 3D representation in the form of a tri-plane, which can render to an arbitrary viewpoint through volume rendering. Besides viewpoint and lighting control, Holo-Relighting also takes the head pose as a condition to enable head-pose-dependent lighting effects. With these novel designs, Holo-Relighting can generate complex non-Lambertian lighting effects (e.g., specular highlights and cast shadows) without using any explicit physical lighting priors. We train Holo-Relighting with data captured with a light stage, and propose two data-rendering techniques to improve the data quality for training the volumetric relighting system. Through quantitative and qualitative experiments, we demonstrate Holo-Relighting can achieve state-of-the-arts relighting quality with better photorealism, 3D consistency and controllability.

关键词： vision + graphics

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 48 49 50 51 52 53 54 55 56 57 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：