检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

5,349 篇 会议
128 册 图书
29 篇 期刊文献

馆藏范围

5,506 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

2,747 篇 工学
- 2,253 篇 计算机科学与技术...
- 926 篇 软件工程
- 478 篇 光学工程
- 399 篇 控制科学与工程
- 332 篇 机械工程
- 261 篇 仪器科学与技术
- 250 篇 信息与通信工程
- 128 篇 电气工程
- 122 篇 生物工程
- 90 篇 生物医学工程（可授...
- 70 篇 电子科学与技术（可...
- 39 篇 安全科学与工程
- 33 篇 化学工程与技术
- 31 篇 建筑学
- 27 篇 土木工程
- 25 篇 交通运输工程
- 24 篇 航空宇航科学与技...
858 篇 理学
- 526 篇 物理学
- 439 篇 数学
- 155 篇 统计学（可授理学、...
- 133 篇 生物学
- 28 篇 化学
- 21 篇 系统科学
242 篇 艺术学
- 242 篇 设计学（可授艺术学...
196 篇 管理学
- 101 篇 图书情报与档案管...
- 97 篇 管理科学与工程(可...
- 27 篇 工商管理
96 篇 医学
- 96 篇 临床医学
- 21 篇 基础医学(可授医学...
28 篇 法学
- 26 篇 社会学
8 篇 教育学
6 篇 经济学
5 篇 农学
2 篇 军事学

主题

2,165 篇 computer vision
915 篇 pattern recognit...
890 篇 conferences
640 篇 training
565 篇 cameras
410 篇 feature extracti...
373 篇 computational mo...
364 篇 image segmentati...
350 篇 visualization
338 篇 face recognition
321 篇 computer archite...
309 篇 robustness
290 篇 computer science
287 篇 shape
280 篇 humans
266 篇 object detection
234 篇 layout
196 篇 neural networks
191 篇 application soft...
191 篇 lighting

机构

20 篇 swiss fed inst t...
19 篇 institute for co...
18 篇 university of sc...
18 篇 microsoft resear...
18 篇 swiss fed inst t...
16 篇 carnegie mellon ...
16 篇 chinese acad sci...
14 篇 tsinghua univers...
14 篇 univ sci & techn...
14 篇 carnegie mellon ...
12 篇 harbin inst tech...
12 篇 tsinghua univ pe...
12 篇 department of co...
11 篇 megvii technol p...
11 篇 computer vision ...
11 篇 computer vision ...
10 篇 indian statistic...
10 篇 comp vis ctr bar...
10 篇 school of comput...
10 篇 univ modena & re...

作者

58 篇 timofte radu
17 篇 van gool luc
17 篇 luc van gool
17 篇 horst bischof
16 篇 rita cucchiara
16 篇 radu timofte
14 篇 escalera sergio
14 篇 bischof horst
14 篇 sergio escalera
13 篇 zhigang zhu
13 篇 hugo jair escala...
12 篇 murino vittorio
12 篇 li stan z.
12 篇 chen wei-ting
12 篇 qiang ji
11 篇 cucchiara rita
11 篇 fan haoqiang
11 篇 umapada pal
11 篇 marcos v. conde
11 篇 bowyer kevin w.

语言

5,492 篇 英文
10 篇 中文
5 篇 其他

检索条件"任意字段=2006 Conference on Computer Vision and Pattern Recognition Workshops"

共 5506 条记录，以下是4901-4910 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

NICE: CVPR 2023 Challenge on Zero-shot Image Captioning

NICE: CVPR 2023 Challenge on Zero-shot Image Captioning

引用

IEEE computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Taehoon Kim Pyunghwan Ahn Sangyun Kim Sihaeng Lee Mark Marsden Alessandra Sala Seung Hwan Kim Bohyung Han Kyoung Mu Lee Honglak Lee Kyounghoon Bae Xiangyu Wu Yi Gao Hailiang Zhang Yang Yang Weili Guo Jianfeng Lu Youngtaek Oh Jae Won Cho Dong-Jin Kim In So Kweon Junmo Kim Wooyoung Kang Won Young Jhoo Byungseok Roh Jonghwan Mun Solgil Oh Kenan Emir Ak Gwang-Gook Lee Yan Xu Mingwei Shen Kyomin Hwang Wonsik Shin Kamin Lee Wonhark Park Dongkwan Lee Nojun Kwak Yujin Wang Yimu Wang Tiancheng Gu Xingchang Lv Mingmao Sun

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project 1 and share the results and outcomes of 2023 challenge. This project is designed to challenge the computer vision community to develop robust image captioning models that advance the state-of-the-art both in terms of accuracy and fairness. Through the challenge, the image captioning models were tested using a new evaluation dataset that includes a large variety of visual concepts from many domains. There was no specific training data provided for the challenge, and therefore the challenge entries were required to adapt to new types of image descriptions that had not been seen during training. This report includes information on the newly proposed NICE dataset, evaluation methods, challenge results, and technical details of top-ranking entries. We expect that the outcomes of the challenge will contribute to the improvement of AI models on various vision-language tasks.

关键词： Training Adaptation models Visualization computer vision Computational modeling conferences Training data

来源：评论

学校读者我要写书评

暂无评论

T2FNorm: Train-time Feature Normalization for OOD Detection in Image Classification

T2FNorm: Train-time Feature Normalization for OOD Detection ...

引用

IEEE computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Sudarshan Regmi Bibek Panthi Sakar Dotel Prashnna K Gyawali Danail Stoyanov Binod Bhattarai University College London UK NepAl Applied Mathematics and Informatics Institute for Research Nepal Tribhuvan University Nepal West Virginia University USA University of Aberdeen UK

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Neural networks are notorious for being overconfident predictors, posing a significant challenge to their safe deployment in real-world applications. While feature normalization has garnered considerable attention within the deep learning literature, current train-time regularization methods for Out-of-Distribution(OOD) detection are yet to fully exploit this potential. Indeed, the naive incorporation of feature normalization within neural networks does not guarantee substantial improvement in OOD detection performance. In this work, we introduce T2FNorm, a novel approach to transforming features to hyperspherical space during training, while employing non-transformed space for OOD-scoring purposes. This method yields a surprising enhancement in OOD detection capabilities without compromising model accuracy in in-distribution(ID). Our investigation demonstrates that the proposed technique substantially diminishes the norm of the features of all samples, more so in the case of out-of-distribution samples, thereby addressing the prevalent concern of overconfidence in neural networks. The proposed method also significantly improves various post-hoc OOD detection methods.

关键词： Training Measurement Deep learning computer vision Accuracy conferences Neural networks

来源：评论

学校读者我要写书评

暂无评论

Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero-shot Medical Image Segmentation

Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP f...

引用

IEEE computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Sidra Aleem Fangyijie Wang Mayug Maniparambil Eric Arazo Julia Dietlmeier Kathleen Curran Noel E. O’Connor Suzanne Little ML-Labs Dublin City University ML-Labs University College Dublin Centre for Applied AI (CeADAR) University College Dublin Ireland Insight SFI Centre for Data Analytics Dublin City University

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

The Segment Anything Model (SAM) and CLIP are remarkable vision foundation models (VFMs). SAM, a prompt-driven segmentation model, excels in segmentation tasks across diverse domains, while CLIP is renowned for its zero-shot recognition capabilities. However, their unified potential has not yet been explored in medical image segmentation. To adapt SAM, to medical imaging, existing methods primarily rely on tuning strategies that require extensive data or prior prompts tailored to the specific task, making it particularly challenging when only a limited number of data samples are available. This work presents an in-depth exploration of integrating SAM and CLIP into a unified framework for medical image segmentation. Specifically, we propose a simple unified framework, SaLIP, for organ segmentation. Initially, SAM is used for part-based segmentation within the image, followed by CLIP to retrieve the mask corresponding to the region of interest (ROI) from the pool of SAM’s generated masks. Finally, SAM is prompted by the retrieved ROI to segment a specific organ. Thus, SaLIP is training/fine-tuning free and does not rely on domain expertise or labeled data for prompt engineering. Our method shows substantial enhancements in zero-shot segmentation, showcasing notable improvements in DICE scores across diverse segmentation tasks like brain (63.46%), lung (50.11%), and fetal head (30.82%), when compared to un-prompted SAM. Code and text prompts are available at SaLIP.

关键词： Image segmentation Adaptation models Filtering Inference mechanisms Lung Robustness pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Photorealistic Arm Robot Simulation for 3D Plant Reconstruction and Automatic Annotation using Unreal Engine 5

Photorealistic Arm Robot Simulation for 3D Plant Reconstruct...

引用

IEEE computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Xingjian Li Jeremy Park Chris Reberg-Horton Steven Mirsky Edgar Lobaton Lirong Xiang Department of Biological and Agricultural Engineering North Carolina State University Department of Electrical and Computer Engineering North Carolina State University Department of Computer Science North Carolina State University Department of Crop and Soil Science North Carolina State University Sustainable Agricultural Systems Laboratory USDA-ARS

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Robotics paired with computer vision are widely used in precision agriculture. Simulations are critical for safety and performance estimation by verifying their routine in a virtual world before real-world testing and deployment. However, many simulators used in agricultural robots lack photorealism in their virtual worlds compared to the real world. We implemented Unreal Engine 5 (UE5) and the Robot Operating System (ROS) to develop a robot simulator tailored to agricultural tasks and synthetic data generation with RGB, segmentation, and depth images. We designed a method for assigning multiple segmentation labels within a single plant mesh. We experimented with a semi-spherical routine for two robot arms to perform 3D point cloud reconstruction across 10 plant assets. We showed our simulator produces much more accurate segmentation images and reconstruction compared to existing UE5 solutions. We extend our results with Neural Radiance Field (NeRF) reconstructions. The packaged simulator, UE5 project, and ROS package with the Python routine can be found at https://***/NCSU-BAE-ARLab/AgriRoboSimUE5.

关键词： Image segmentation computer vision Three-dimensional displays Computational modeling Neural radiance field Manipulators Image reconstruction

来源：评论

学校读者我要写书评

暂无评论

Lensless Imaging with a Controllable Aperture

Lensless Imaging with a Controllable Aperture

引用

conference on computer vision and pattern recognition (CVPR)

作者： A. Zomet S.K. Nayar Computer Science Department Columbia University New York NY USA

In this paper we propose a novel, highly flexible camera. The camera consists of an image detector and a special aperture, but no lens. The aperture is a set of parallel light attenuating layers whose transmittances are controllable in space and time. By applying different transmittance patterns to this aperture, it is possible to modulate the incoming light in useful ways and capture images that are impossible to capture with conventional lens-based cameras. For example, the camera can pan and tilt its field of view without the use of any moving parts. It can also capture disjoint regions of interest in the scene without having to capture the regions in between them. In addition, the camera can be used as a computational sensor, where the detector measures the end result of computations performed by the attenuating layers on the scene radiance values. These and other imaging functionalities can be implemented with the same physical camera and the functionalities can be switched from one video frame to the next via software. We have built a prototype camera based on this approach using a bare image detector and a liquid crystal modulator for the aperture. We discuss in detail the merits and limitations of lensless imaging using controllable apertures.

关键词： Apertures Cameras Detectors Layout Lenses Lighting control Optical modulation Attenuation measurement Performance evaluation Software prototyping

来源：评论

学校读者我要写书评

暂无评论

Robust Real-Time Face Pose and Facial Expression Recovery

Robust Real-Time Face Pose and Facial Expression Recovery

引用

conference on computer vision and pattern recognition (CVPR)

作者： Zhiwei Zhu Qiang Ji Department of ECSE Rensselaer Polytechnic Institute Troy NY USA Sarnoff Corporation USA

Face motion is the sum of rigid motion related with face pose and non-rigid motion related with facial expression. Both motions are coupled in the captured image so that they can not be easily recovered from the image directly. In this paper, a novel technique is proposed to recover 3D face pose and facial expression simultaneously from a monocular video sequence in real time. First, twenty-eight salient facial features are detected and tracked robustly under various face orientations and facial expressions. Second, after modelling the coupling between face pose and facial expression in the 2D image as a nonlinear function, a normalized SVD (N-SVD) decomposition technique is proposed to recover the pose and expression parameters analytically. A nonlinear technique is subsequently utilized to refine the solution obtained from the N-SVD technique by imposing the orthonormality constraint on the pose parameters. Compared to the original SVD technique proposed in [1], which is very sensitive to the image noise and numerically unstable in practice, the proposed method can recover the face pose and facial expression robustly and accurately. Finally, the performance of the proposed technique is evaluated in the experiments using both synthetic and real image sequences.

关键词： Robustness Facial animation Motion estimation Face detection Cameras Image analysis Head Human computer interaction Face recognition Video sequences

来源：评论

学校读者我要写书评

暂无评论

Scaling Graph Convolutions for Mobile vision

Scaling Graph Convolutions for Mobile Vision

引用

IEEE computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： William Avery Mustafa Munir Radu Marculescu The University of Texas at Austin

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

To compete with existing mobile architectures, Mobile-ViG introduces Sparse vision Graph Attention (SVGA), a fast token-mixing operator based on the principles of GNNs. However, MobileViG scales poorly with model size, falling at most 1% behind models with similar latency. This paper introduces Mobile Graph Convolution (MGC), a new vision graph neural network (ViG) module that solves this scaling problem. Our proposed mobile vision architecture, Mobile-ViGv2, uses MGC to demonstrate the effectiveness of our approach. MGC improves on SVGA by increasing graph sparsity and introducing conditional positional encodings to the graph operation. Our smallest model, MobileViGv2-Ti, achieves a 77.7% top-1 accuracy on ImageNet-1K, 2% higher than MobileViG-Ti, with 0.9 ms inference latency on the iPhone 13 Mini NPU. Our largest model, MobileViGv2-B, achieves an 83.4% top-1 accuracy, 0.8% higher than MobileViG-B, with 2.7 ms inference latency. Besides image classification, we show that MobileViGv2 generalizes well to other tasks. For object detection and instance segmentation on MS COCO 2017, MobileViGv2-M outperforms MobileViG-M by 1.2 AP box and 0.7 AP mask , and MobileViGv2-B outperforms MobileViG-B by 1.0 AP box and 0.7 AP mask . For semantic segmentation on ADE20K, MobileViGv2-M achieves 42.9% mIoU and MobileViGv2-B achieves 44.3% mIoU 1 .

关键词： Instance segmentation Accuracy Convolution Semantic segmentation Message passing computer architecture Object detection

来源：评论

学校读者我要写书评

暂无评论

ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery

ConceptHash: Interpretable Fine-Grained Hashing via Concept ...

引用

IEEE computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Kam Woh Ng Xiatian Zhu Yi-Zhe Song Tao Xiang CVSSP University of Surrey iFlyTek-Surrey Joint Research Centre on Artificial Intelligence Surrey Institute for People-Centred Artificial Intelligence

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Existing fine-grained hashing methods typically lack code interpretability as they compute hash code bits holistically using both global and local features. To address this limitation, we propose ConceptHash, a novel method that achieves sub-code level interpretability. In ConceptHash, each sub-code corresponds to a human-understandable concept, such as an object part, and these concepts are automatically discovered without human annotations. Specifically, we leverage a vision Transformer architecture and introduce concept tokens as visual prompts, along with image patch tokens as model inputs. Each concept is then mapped to a specific sub-code at the model output, providing natural sub-code interpretability. To capture subtle visual differences among highly similar sub-categories (e.g., bird species), we incorporate language guidance to ensure that the learned hash codes are distinguishable within fine-grained object classes while maintaining semantic alignment. This approach allows us to develop hash codes that exhibit similarity within families of species while remaining distinct from species in other families. Extensive experiments on four fine-grained image retrieval benchmarks demonstrate that ConceptHash outperforms previous methods by a significant margin, offering unique sub-code interpretability as an additional benefit. Code at: https://***/kamwoh/concepthash.

关键词： Visualization computer vision Codes Semantics Image retrieval Manuals Inspection

来源：评论

学校读者我要写书评

暂无评论

Robust camera calibration tool for video surveillance camera in urban environment

Robust camera calibration tool for video surveillance camera...

引用

IEEE computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Sung Chun Lee Ram Nevatia Institute of Robotics and Intelligent Systems University of Southern California Los Angeles CA USA

Video surveillance applications such as smart room and security system are prevailing nowadays. Camera calibration information (e.g. camera position, orientation, and focal length) is very useful for various surveillance systems because it can provide scene knowledge and limit search space for object detection or tracking. In this paper, we describe a camera calibration tool that does not require any calibration object or specific geometric objects by using vanishing points. In urban environment, vanishing points are easily obtainable since there exist many parallel lines such as street lines, light poles, buildings, etc in either outdoor or indoor scene images. Experimental results from various surveillance cameras are presented.

关键词： Cameras Calibration computer vision Estimation Video surveillance Urban areas

来源：评论

学校读者我要写书评

暂无评论

An Integrated Segmentation and Classification Approach Applied to Multiple Sclerosis Analysis

An Integrated Segmentation and Classification Approach Appli...

引用

conference on computer vision and pattern recognition (CVPR)

作者： A. Akselrod-Ballin M. Galun R. Basri A. Brandt M.J. Gomori M. Filippi P. Valsasina Dept. of Computer Science and Applied Math Weizmann Institute of Science Rehovot Israel Dept. of Radiology Hadassah University Hospital Jerusalem Israel Neuroimaging Research Unit Hospital San Raffaele Milan Italy

We present a novel multiscale approach that combines segmentation with classification to detect abnormal brain structures in medical imagery, and demonstrate its utility in detecting multiple sclerosis lesions in 3D MRI data. Our method uses segmentation to obtain a hierarchical decomposition of a multi-channel, anisotropic MRI scan. It then produces a rich set of features describing the segments in terms of intensity, shape, location, and neighborhood relations. These features are then fed into a decision tree-based classifier, trained with data labeled by experts, enabling the detection of lesions in all scales. Unlike common approaches that use voxel-by-voxel analysis, our system can utilize regional properties that are often important for characterizing abnormal brain structures. We provide experiments showing successful detections of lesions in both simulated and real MR images.

关键词： Multiple sclerosis Image segmentation Lesions Brain Magnetic resonance imaging Biomedical imaging Anisotropic magnetoresistance Shape Decision trees Classification tree analysis

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 487 488 489 490 491 492 493 494 495 496 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：