检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,905 篇 会议
43 篇 期刊文献
18 册 图书

馆藏范围

8,965 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,564 篇 工学
- 4,024 篇 计算机科学与技术...
- 2,182 篇 软件工程
- 1,241 篇 光学工程
- 558 篇 控制科学与工程
- 433 篇 信息与通信工程
- 430 篇 机械工程
- 294 篇 电气工程
- 288 篇 仪器科学与技术
- 179 篇 生物工程
- 159 篇 生物医学工程（可授...
- 119 篇 电子科学与技术（可...
- 64 篇 安全科学与工程
- 58 篇 建筑学
- 58 篇 化学工程与技术
- 52 篇 土木工程
- 52 篇 交通运输工程
- 40 篇 力学（可授工学、理...
2,066 篇 理学
- 1,382 篇 物理学
- 1,198 篇 数学
- 420 篇 统计学（可授理学、...
- 238 篇 生物学
- 55 篇 化学
- 36 篇 系统科学
266 篇 管理学
- 182 篇 图书情报与档案管...
- 92 篇 管理科学与工程(可...
- 47 篇 工商管理
223 篇 医学
- 222 篇 临床医学
- 39 篇 基础医学(可授医学...
205 篇 艺术学
- 205 篇 设计学（可授艺术学...
45 篇 法学
- 43 篇 社会学
21 篇 农学
14 篇 教育学
9 篇 经济学
6 篇 军事学

主题

3,414 篇 computer vision
1,216 篇 pattern recognit...
946 篇 cameras
908 篇 conferences
765 篇 computer science
674 篇 image segmentati...
618 篇 layout
598 篇 training
548 篇 shape
518 篇 robustness
451 篇 feature extracti...
448 篇 humans
445 篇 face recognition
405 篇 computational mo...
402 篇 object detection
365 篇 visualization
356 篇 computer archite...
336 篇 application soft...
304 篇 lighting
257 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 tsinghua univers...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 computer vision ...
17 篇 department of co...
16 篇 institute of inf...
16 篇 school of comput...
15 篇 school of comput...
15 篇 carnegie mellon ...
14 篇 national laborat...

作者

57 篇 timofte radu
25 篇 huang thomas s.
24 篇 van gool luc
23 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 luc van gool
19 篇 t.s. huang
18 篇 xiaoou tang
18 篇 murino vittorio
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 li stan z.
15 篇 m. shah

语言

8,904 篇 英文
53 篇 其他
8 篇 中文
1 篇 土耳其文

检索条件"任意字段=IEEE-Computer-Society Conference on Computer Vision and Pattern Recognition Workshops"

共 8966 条记录，以下是1231-1240 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

What Makes Multimodal In-Context Learning Work?

What Makes Multimodal In-Context Learning Work?

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Folco Bertini Baldassini Mustafa Shukor Matthieu Cord Laure Soulier Benjamin Piwowarski CNRS ISIR Sorbonne Université Paris France Valeo.ai Paris France

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Large Language Models have demonstrated remarkable performance across various tasks, exhibiting the capacity to swiftly acquire new skills, such as through In-Context Learning (ICL) with minimal demonstration examples. In this work, we present a comprehensive framework for investigating Multimodal ICL (M-ICL) in the context of Large Multimodal Models. We consider the best open-source multimodal models (e.g., IDEFICS, OpenFlamingo) and a wide range of multimodal tasks. Our study unveils several noteworthy findings: (1) M-ICL primarily relies on text-driven mechanisms, showing little to no influence from the image modality. (2) When used with advanced-ICL strategy (like RICES), M-ICL is not better than a simple strategy based on majority voting over context examples. Moreover, we identify several biases and limitations of M-ICL that warrant consideration prior to deployment. Code available at ***/folbaeni/multimodal-icl

关键词： Training Analytical models computer vision Codes Large language models Impedance matching conferences

来源：评论

学校读者我要写书评

暂无评论

Making use of unlabeled data: Comparing strategies for marine animal detection in long-tailed datasets using self-supervised and semi-supervised pre-training

Making use of unlabeled data: Comparing strategies for marin...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Tarun Sharma Danelle E. Cline Duane Edgington California Institute of Technology USA MBARI USA

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

This paper discusses strategies for object detection in marine images from a practitioner’s perspective working with real-world long-tail distributed datasets with a large amount of additional unlabeled data on hand. The paper discusses the benefits of separating the localization and classification stages, making the case for robustness in localization through the amalgamation of additional datasets inspired by a widely used approach by practitioners in the camera-trap literature. For the classification stage, the paper compares strategies to use additional unlabeled data, comparing supervised, supervised iteratively, self-supervised, and semi-supervised pre-training approaches. Our findings reveal that semi-supervised pre-training, followed by supervised fine-tuning, yields a significantly improved balanced performance across the long-tail distribution, albeit occasionally with a trade-off in overall accuracy. These insights are validated through experiments on two real-world long-tailed underwater datasets collected by the Monterey Bay Aquarium Research Institute (MBARI).

关键词： Location awareness computer vision Accuracy conferences Distributed databases Object detection Robustness

来源：评论

学校读者我要写书评

暂无评论

ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation

ShiftAddAug: Augment Multiplication-Free Tiny Neural Network...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Yipin Guo Zihao Li Yilin Lang Qinyuan Ren College of Control Science and Engineering Zhejiang University

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Operators devoid of multiplication, such as Shift and Add, have gained prominence for their compatibility with hardware. However, neural networks (NNs) employing these operators typically exhibit lower accuracy compared to conventional NNs with identical structures. ShiftAddAug uses costly multiplication to augment efficient but less powerful multiplication-free operators, improving performance without any inference overhead. It puts a ShiftAdd tiny NN into a large multiplicative model and encourages it to be trained as a sub-model to obtain additional supervision. In order to solve the weight discrepancy problem between hybrid operators, a new weight sharing method is proposed. Additionally, a novel two stage neural architecture search is used to obtain better augmentation effects for smaller but stronger multiplication-free tiny neural networks. The superiority of ShiftAddAug is validated through experiments in image classification and semantic segmentation, consistently delivering noteworthy enhancements. Remarkably, it secures up to a 4.95% increase in accuracy on the CIFAR100 compared to its directly trained counterparts, even surpassing the performance of multiplicative NNs.

关键词： Training computer vision Accuracy Computational modeling Semantic segmentation conferences Hardware

来源：评论

学校读者我要写书评

暂无评论

Large-scale Dataset Pruning with Dynamic Uncertainty

Large-scale Dataset Pruning with Dynamic Uncertainty

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Muyang He Shuo Yang Tiejun Huang Bo Zhao Beijing Academy of Artificial Intelligence Peking University University of Technology Sydney

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

The state of the art of many learning tasks, e.g., image classification, is advanced by collecting larger datasets and then training larger models on them. As the outcome, the increasing computational cost is becoming unaffordable. In this paper, we investigate how to prune the large-scale datasets, and thus produce an informative subset for training sophisticated deep models with negligible performance drop. We propose a simple yet effective dataset pruning method by exploring both the prediction uncertainty and training dynamics. We study dataset pruning by measuring the variation of predictions during the whole training process on large-scale datasets, i.e., ImageNet-1K and ImageNet-21K, and advanced models, i.e., Swin Transformer and ConvNeXt. Extensive experimental results indicate that our method outperforms the state of the art and achieves 25% lossless pruning ratio on both ImageNet1K and ImageNet-21K. The code and pruned datasets are available at https://***/BAAI-DCAI/Dataset-Pruning.

关键词： Training computer vision Uncertainty Computational modeling conferences Predictive models Transformers

来源：评论

学校读者我要写书评

暂无评论

Generation of Structurally Realistic Retinal Fundus Images with Diffusion Models

Generation of Structurally Realistic Retinal Fundus Images w...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Sojung Go Younghoon Ji Sang Jun Park Soochahn Lee Department of Ophthalmology Seoul National University College of Medicine Seoul National University Bundang Hospital Seongnam Korea VUNO Inc. Seoul Korea School of Electrical Engineering Kookmin University Seoul Korea

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

We introduce a new technique for generating retinal fundus images that have anatomically accurate vascular structures, using diffusion models. We generate artery/vein masks to create the vascular structure, which we then condition to produce retinal fundus images. The proposed method can generate high-quality images with more realistic vascular structures and can create a diverse range of images based on the strengths of the diffusion model. We present quantitative evaluations that demonstrate the performance improvement using our method for data augmentation on vessel segmentation and artery/vein classification. We also present Turing test results by clinical experts, showing that our generated images are difficult to distinguish with real images. We believe that our method can be applied to construct stand-alone datasets that are irrelevant of patient privacy.

关键词： Image segmentation Data privacy computer vision Accuracy conferences Retina Diffusion models

来源：评论

学校读者我要写书评

暂无评论

DGBD: Depth Guided Branched Diffusion for Comprehensive Controllability in Multi-View Generation

DGBD: Depth Guided Branched Diffusion for Comprehensive Cont...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Hovhannes Margaryan Daniil Hayrapetyan Wenyan Cong Zhangyang Wang Humphrey Shi Picsart AI Research (PAIR) UT Austin SHI Labs @ Georgia Tech Oregon & UIUC

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

This paper presents an innovative approach to multi-view generation that can be comprehensively controlled over both perspectives (viewpoints) and non-perspective attributes (such as depth maps). Our controllable dual-branch pipeline, named Depth Guided Branched Diffusion (DGBD), leverages depth maps and perspective information to generate images from alternative viewpoints while preserving shape and size fidelity. In the first DGBD branch, we fine-tune a pre-trained diffusion model on multi-view data, introducing a regularized batch-aware self-attention mechanism for multi-view consistency and generalization. Direct control over perspective is then achieved through cross-attention conditioned on camera position. Meanwhile, the second DGBD branch introduces non-perspective control using depth maps. Qualitative and quantitative experiments validate the effectiveness of our approach, surpassing or matching the performance of state-of-the-art novel view and multi-view synthesis methods.

关键词： Geometry computer vision Shape conferences Pipelines Text to image Cameras

来源：评论

学校读者我要写书评

暂无评论

Affine-based Deformable Attention and Selective Fusion for Semi-dense Matching

Affine-based Deformable Attention and Selective Fusion for S...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Hongkai Chen Zixin Luo Yurun Tian Xuyang Bai Ziyu Wang Lei Zhou Mingmin Zhen Tian Fang David McKinnon Yanghai Tsin Long Quan Apple Inc Hong Kong University of Science and Technology

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Identifying robust and accurate correspondences across images is a fundamental problem in computer vision that enables various downstream tasks. Recent semi-dense matching methods emphasize the effectiveness of fusing relevant cross-view information through Transformer. In this paper, we propose several improvements upon this paradigm. Firstly, we introduce affine-based local attention to model cross-view deformations. Secondly, we present selective fusion to merge local and global messages from cross attention. Apart from network structure, we also identify the importance of enforcing spatial smoothness in loss design, which has been omitted by previous works. Based on these augmentations, our network demonstrate strong matching capacity under different settings. The full version of our network achieves state-of-the-art performance among semi-dense matching methods at a similar cost to LoFTR, while the slim version reaches LoFTR baseline’s performance with only 15% computation cost and 18% parameters.

关键词： Geometry Deformable models computer vision Costs Deformation Shape Fuses

来源：评论

学校读者我要写书评

暂无评论

Thermal Image Super-Resolution Challenge Results - PBVS 2024

Thermal Image Super-Resolution Challenge Results - PBVS 2024

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Rafael E. Rivadeneira Angel D. Sappa Chenyang Wang Junjun Jiang Zhiwei Zhong Peilin Chen Shiqi Wang Escuela Superior Politécnica del Litoral ESPOL Guayaquil Ecuador Computer Vision Center Campus UAB Barcelona Spain

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

This paper outlines the advancements and results of the Fifth Thermal Image Super-Resolution challenge, hosted at the Perception Beyond the Visible Spectrum CVPR 2024 workshop. The challenge employed a novel benchmark cross-spectral dataset consisting of 1000 thermal images, each paired with its corresponding registered RGB image. The challenge featured two tracks: Track-1 focused on Single Thermal Image Super-Resolution with an ×8 upscale factor, while Track-2 extended its evaluation to include both ×8 and ×16 scaling factors, utilizing high-resolution RGB images to guide the super-resolution process for low-resolution thermal images. The participation of over 175 teams highlights the research community’s strong engagement and dedication to enhancing image resolution techniques across both single and cross-spectral methodologies. This year’s challenge sets new benchmarks and provides valuable insights into future directions for research in thermal image super-resolution.

关键词： Image quality computer vision Thermal factors conferences Superresolution Benchmark testing Transformers

来源：评论

学校读者我要写书评

暂无评论

FineRehab: A Multi-modality and Multi-task Dataset for Rehabilitation Analysis

FineRehab: A Multi-modality and Multi-task Dataset for Rehab...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Jianwei Li Jun Xue Rui Cao Xiaoxia Du Siyu Mo Kehao Ran Zeyan Zhang School of Sports Engineering Beijing Sport University China Department of Neurorehabilitation Rehabilitation Research Center China

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

The assessment of rehabilitation exercises for neurological and musculoskeletal disorders are crucial for recovery. Traditionally, assessment methods have been subjective, with inherent uncertainty and limitations. This paper introduces a novel multi-modality dataset named FineRehab § to prompt the study of rehabilitation movement analysis, leveraging advancements in sensor technology and artificial intelligence. FineRehab collects 16 actions from 50 participants, including both patients with musculoskeletal disorders and healthy individuals, and consists of 4,215 action samples captured by two Kinect cameras and 17 IMUs. To benchmark FineRehab, we present a reliable approach to analyze rehabilitation exercises, and make experiments to evaluate the comprehensive movement quality from across multi-dimensions. Comparative experimental analyses have verified the validity of our dataset in distinguishing between the movement of the normal population and patients, which can offer a quantifiable basis for personalized rehabilitation feedback. The introduction of FineRehab will encourage researchers to apply, develop and adapt various methods for rehabilitation exercise analysis.

关键词： Deep learning Musculoskeletal system computer vision Uncertainty conferences Multitasking Cameras

来源：评论

学校读者我要写书评

暂无评论

EVREAL: Towards a Comprehensive Benchmark and Analysis Suite for Event-based Video Reconstruction

EVREAL: Towards a Comprehensive Benchmark and Analysis Suite...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Burak Ercan Onur Eker Aykut Erdem Erkut Erdem Computer Engineering Department Hacettepe University HAVELSAN Inc Computer Engineering Department Koç University KUIS AI Center Koç University

Event cameras are a new type of vision sensor that incorporates asynchronous and independent pixels, offering advantages over traditional frame-based cameras such as high dynamic range and minimal motion blur. However, their output is not easily understandable by humans, making the reconstruction of intensity images from event streams a fundamental task in event-based vision. While recent deep learning-based methods have shown promise in video reconstruction from events, this problem is not completely solved yet. To facilitate comparison between different approaches, standardized evaluation protocols and diverse test datasets are essential. This paper proposes a unified evaluation methodology and introduces an open-source framework called EVREAL to comprehensively benchmark and analyze various event-based video reconstruction methods from the literature. Using EVREAL, we give a detailed analysis of the state-of-the-art methods for event-based video reconstruction, and provide valuable insights into the performance of these methods under varying settings, challenging scenarios, and downstream tasks.

关键词：

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 120 121 122 123 124 125 126 127 128 129 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：