检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

4,539 篇 会议
8 册 图书
8 篇 期刊文献

馆藏范围

4,555 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

2,354 篇 工学
- 1,905 篇 计算机科学与技术...
- 545 篇 软件工程
- 434 篇 机械工程
- 329 篇 光学工程
- 263 篇 控制科学与工程
- 204 篇 仪器科学与技术
- 124 篇 信息与通信工程
- 109 篇 电气工程
- 80 篇 生物工程
- 50 篇 生物医学工程（可授...
- 35 篇 电子科学与技术（可...
- 27 篇 安全科学与工程
- 23 篇 化学工程与技术
- 18 篇 交通运输工程
- 16 篇 建筑学
- 14 篇 土木工程
485 篇 理学
- 324 篇 物理学
- 192 篇 数学
- 81 篇 生物学
- 77 篇 统计学（可授理学、...
- 22 篇 系统科学
- 20 篇 化学
197 篇 艺术学
- 197 篇 设计学（可授艺术学...
65 篇 管理学
- 49 篇 图书情报与档案管...
- 16 篇 管理科学与工程(可...
- 8 篇 工商管理
59 篇 医学
- 58 篇 临床医学
- 12 篇 基础医学(可授医学...
- 10 篇 药学(可授医学、理...
20 篇 法学
- 18 篇 社会学
7 篇 农学
4 篇 教育学
1 篇 经济学
1 篇 军事学

主题

1,864 篇 computer vision
906 篇 conferences
750 篇 pattern recognit...
721 篇 training
502 篇 cameras
411 篇 computational mo...
388 篇 feature extracti...
383 篇 visualization
324 篇 computer archite...
292 篇 image segmentati...
250 篇 robustness
244 篇 face recognition
227 篇 object detection
214 篇 three-dimensiona...
213 篇 shape
191 篇 semantics
184 篇 humans
182 篇 neural networks
170 篇 estimation
164 篇 computer science

机构

21 篇 university of sc...
21 篇 swiss fed inst t...
18 篇 swiss fed inst t...
17 篇 carnegie mellon ...
15 篇 univ sci & techn...
15 篇 institute for co...
14 篇 tsinghua univers...
14 篇 computer vision ...
13 篇 chinese univ hon...
13 篇 mit cambridge ma...
13 篇 tsinghua univ pe...
12 篇 harbin inst tech...
12 篇 chinese acad sci...
11 篇 comp vis ctr bar...
11 篇 eth zurich
11 篇 megvii technol p...
11 篇 stanford univ st...
11 篇 carnegie mellon ...
10 篇 univ modena & re...
10 篇 beihang univ peo...

作者

57 篇 timofte radu
19 篇 luc van gool
19 篇 radu timofte
15 篇 horst bischof
15 篇 sergio escalera
14 篇 van gool luc
12 篇 zhigang zhu
12 篇 chen wei-ting
11 篇 fan haoqiang
11 篇 li stan z.
11 篇 marios savvides
11 篇 marcos v. conde
11 篇 bischof horst
11 篇 lei lei
10 篇 cucchiara rita
10 篇 angel d. sappa
10 篇 liu shuaicheng
10 篇 huang thomas s.
10 篇 guoliang fan
9 篇 escalera sergio

语言

4,554 篇 英文
1 篇 土耳其文

检索条件"任意字段=2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2014"

共 4555 条记录，以下是51-60 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Evaluating Confidence Calibration in Endoscopic Diagnosis Models

Evaluating Confidence Calibration in Endoscopic Diagnosis Mo...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Dehghani, Nikoo Thijssen, Ayla van der Zander, Quirine E. W. Schreuder, Ramon-Michel Schoon, Erik J. van der Sommen, Fons de With, Peter H. N. Eindhoven Univ Technol Eindhoven Netherlands Maastricht Univ Med Ctr Maastricht Netherlands GROW Res Inst Oncol & Reprod Maastricht Netherlands Catharina Hosp Eindhoven Netherlands Eindhoven Artificial Intelligence Syst Inst Eindhoven Netherlands

ISBN: (纸本)9798350365474

Colorectal polyps are prevalent precursors to colorectal cancer, making their accurate characterization essential for timely intervention and patient outcomes. Deep learning-based computer-aided diagnosis (CADx) systems have shown promising performance in the automated detection and categorization of colorectal polyps (CRP) using endoscopic images. However, alongside the advancement in diagnostic accuracy, the need for reliable and accurate quantification of uncertainty estimates within these systems has become increasingly important. The primary focus of this study is on refining the reliability of computer-aided diagnosis of CRPs within clinical practice. We perform an investigation of widely used model calibration techniques and how they translate into clinical applications, specifically for CRP categorization data. The experiments reveal that the Variational Inference method excels in intra-dataset calibration, but lacks efficiency and inter-dataset generalization. Laplace approximation and temperature scaling methods offer improved calibration across datasets.

关键词： Bayesian neural networks computer-aided diagnosis Confidence calibration Model reliability

来源：评论

学校读者我要写书评

暂无评论

A Comprehensive Analysis of Factors Impacting Membership Inference

A Comprehensive Analysis of Factors Impacting Membership Inf...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： DeAlcala, Daniel Mancera, Gonzalo Morales, Aythami Fierrez, Julian Tolosana, Ruben Ortega-Garcia, Javier Univ Autonoma Madrid Biometr & Data Pattern Analyt Lab Madrid Spain

ISBN: (纸本)9798350365474

We analyze various factors affecting the proper functioning of MIA and MINT, two research lines aimed at detecting data used for training. The difference between these lines lies in the environmental conditions, while the fundamental bases are similar for both. As evident in the literature, this detection task is far from straightforward and poses an ongoing challenge for the scientific community. Specifically, in this work, we conclude that factors such as the number of times data passes through the original network, the loss function, or dropout significantly impact detection outcomes. Therefore, it is crucial to consider them when developing these methods and during the training of any neural network, both to avoid (MIA) and to enhance (MINT) this detection. We evaluate the AdaFace facial recognition model using five databases with over 22 million images, modifying the different factors under analysis and defining a suitable protocol for their examination. State-of-the-art accuracy reaching up to 87% is achieved, surpassing existing methods.

关键词： Face recognition Fairness Membership Inference MIA MINT Realiability

来源：评论

学校读者我要写书评

暂无评论

Towards Engineered Safe AI with Modular Concept Models

Towards Engineered Safe AI with Modular Concept Models

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Heidemann, Lena Kurzidem, Iwo Monnet, Maureen Roscher, Karsten Guennemann, Stephan Fraunhofer IKS Munich Germany Tech Univ Munich Munich Germany

ISBN: (纸本)9798350365474

The inherent complexity and uncertainty of Machine Learning (ML) makes it difficult for ML-based computer vision (CV) approaches to become prevalent in safety-critical domains like autonomous driving, despite their high performance. A crucial challenge in these domains is the safety assurance of ML-based systems. To address this, recent safety standardization in the automotive domain has introduced an ML safety lifecycle following an iterative development process. While this approach facilitates safety assurance, its iterative nature requires frequent adaptation and optimization of the ML function, which might include costly retraining of the ML model and is not guaranteed to converge to a safe AI solution. In this paper, we propose a modular ML approach which allows for more efficient and targeted measures to each of the modules and process steps. Each module of the modular concept model represents one visual concept and is aggregated with the other modules' outputs into a task output. The design choices of a modular concept model can be categorized into the selection of the concept modules, the aggregation of their output and the training of the concept modules. Using the example of traffic sign classification, we present each step of the involved design choices and the corresponding targeted measures to take in an iterative development process for engineering safe AI.

关键词： computer vision Concept Models Deep Neural Networks Explainable AI Interpretable Models Machine Learning ML Safety Modular Concept Models Modular Deep Learning Safe AI Safe ML

来源：评论

学校读者我要写书评

暂无评论

How Much You Ate? Food Portion Estimation on Spoons

How Much You Ate? Food Portion Estimation on Spoons

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Sharma, Aaryam Czarnecki, Chris Chen, Yuhao Xi, Pengcheng Xu, Linlin Wong, Alexander Univ Waterloo Vis & Image Proc Lab Waterloo ON Canada Natl Res Council Canada Ottawa ON Canada

ISBN: (纸本)9798350365474

Monitoring dietary intake is a crucial aspect of promoting healthy living. In recent years, advances in computer vision technology have facilitated dietary intake monitoring through the use of images and depth cameras. However, the current state-of-the-art image-based food portion estimation algorithms assume that users take images of their meals one or two times, which can be inconvenient and fail to capture food items that are not visible from a top-down perspective, such as ingredients submerged in a stew. To address these limitations, we introduce an innovative solution that utilizes stationary user-facing cameras to track food items on utensils, not requiring any change of camera perspective after installation. The shallow depth of utensils provides a more favorable angle for capturing food items, and tracking them on the utensil's surface offers a significantly more accurate estimation of dietary intake without the need for post-meal image capture. The system is reliable for estimation of nutritional content of liquid-solid heterogeneous mixtures such as soups and stews. Through a series of experiments, we demonstrate the exceptional potential of our method as a non-invasive, user-friendly, and highly accurate dietary intake monitoring tool.

关键词： computer-vision estimation food nutrition volumetric

来源：评论

学校读者我要写书评

暂无评论

One Embedding to Predict Them All: Visible and Thermal Universal Face Representations for Soft Biometric Estimation via vision Transformers

One Embedding to Predict Them All: Visible and Thermal Unive...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Mirabet-Herranz, Nelida Galdi, Chiara Dugelay, Jean-Luc EURECOM Campus SophiaTech450 Route Chappes F-06410 Biot France

ISBN: (纸本)9798350365474

Human faces encode a vast amount of information including not only uniquely distinctive features of the individual but also demographic information such as a person's age, gender, and weight. Such information is referred to as soft-biometrics, which are physical, behavioral or adhered human characteristics, classifiable in pre-defined human compliant categories. As we often say 'one look is worth a thousand words'. vision Transformers have emerged as a powerful deep learning architecture able to achieve accurate classifications for different computer vision tasks, but these models have not been yet applied to soft-biometrics. In this work, we propose the Bidirectional Encoder Face representation from image Transformers (BEFiT), a model that leverages the multi-attention mechanisms to capture local and global features and produce a multi-purpose face embedding. This unique embedding enables the estimation of different demographics without having to re-train the model for each soft-biometric trait, ensuring high efficiency without compromising accuracy. Our approach explores the use of visible and thermal images to achieve powerful face embedding in different light spectra. We demonstrate that the BEFiT embeddings can capture essential information for gender, age, and weight estimation, surpassing the performance of dedicated deep learning structures for the estimation of a single soft biometric trait. The code of BEFiT implementation is publicly available(1)

关键词： Biometrics

来源：评论

学校读者我要写书评

暂无评论

A Perspective on Deep vision Performance with Standard Image and Video Codecs

A Perspective on Deep Vision Performance with Standard Image...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Reich, Christoph Hahn, Oliver Cremers, Daniel Roth, Stefan Debnath, Biplob Tech Univ Darmstadt Darmstadt Germany Tech Univ Munich Munich Germany NEC Labs Amer Inc San Jose CA 95110 USA Hessian Ctr AI Hessian AI Darmstadt Germany Munich Ctr Machine Learning MCML Munich Germany

ISBN: (纸本)9798350365474

Resource-constrained hardware, such as edge devices or cell phones, often rely on cloud servers to provide the required computational resources for inference in deep vision models. However, transferring image and video data from an edge or mobile device to a cloud server requires coding to deal with network constraints. The use of standardized codecs, such as JPEG or H.264, is prevalent and required to ensure interoperability. This paper aims to examine the implications of employing standardized codecs within deep vision pipelines. We find that using JPEG and H.264 coding significantly deteriorates the accuracy across a broad range of vision tasks and models. For instance, strong compression rates reduce semantic segmentation accuracy by more than 80% in mIoU. In contrast to previous findings, our analysis extends beyond image and action classification to localization and dense prediction tasks, thus providing a more comprehensive perspective.

关键词： Image Classification Image Compression Object Detectio Optical Flow Estimation Semantic Segmentation Video Compression

来源：评论

学校读者我要写书评

暂无评论

vision-Language Pseudo-Labels for Single-Positive Multi-Label Learning

Vision-Language Pseudo-Labels for Single-Positive Multi-Labe...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Xing, Xin Xiong, Zhexiao Stylianou, Abby Sastry, Srikumar Gong, Liyu Jacobs, Nathan Univ Nebraska Omaha Omaha NE 68182 USA Washington Univ St Louis St Louis MO USA St Louis Univ St Louis MO USA Oracle Inc Austin TX USA

ISBN: (纸本)9798350365474

We study a limited label problem and present a novel approach to Single-Positive Multi-label Learning. In the multi-label learning setting, a model learns to predict multiple labels or categories for a single input image. This contrasts with standard multi-class image classification, where the task is to predict a single label from many possible labels for an image. Single-Positive Multi-label Learning specifically considers learning to predict multiple labels when there is only one annotation per image in the training data. Multi-label learning is a more natural task than single-label learning because real-world data often involves instances belonging to multiple categories simultaneously;however, most computer vision datasets contain single labels due to the inherent complexity and cost of collecting multiple high-quality annotations per image. We propose a novel approach called vision-Language Pseudo-Labeling, which uses a vision-language model, CLIP, to suggest strong positive and negative pseudo-labels. The experiment performance shows the effectiveness of the proposed model. Our code and data will be made publicly available at https://***/mvrl/VLPL.

关键词： CLIP Pseudo-labeling Single-Positive Multi-label Learning

来源：评论

学校读者我要写书评

暂无评论

VMCML: Video and Music Matching via Cross-Modality Lifting

VMCML: Video and Music Matching via Cross-Modality Lifting

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Lee, Yi-Shan Tseng, Wei-Cheng Wang, Fu-En Sun, Min Natl Tsing Hua Univ Hsinchu Taiwan Univ Toronto Toronto ON Canada Vector Inst Toronto ON Canada

ISBN: (纸本)9798350365474

We propose a content-based system for matching video and background music. The system aims to address the challenges in music recommendation for new users or new music give short-form videos. To this end, we propose a cross-modal framework VMCML (Video and Music Matching via Cross-Modality Lifting) that finds a shared embedding space between video and music representations. To ensure the embedding space can be effectively shared by both representations, we leverage CosFace loss based on margin-based cosine similarity loss. Furthermore, to confirm the music is not the original sound of the video and that more than one video is matched to the same music, we follow the rule and collect videos and music from a well-known multi-media platform. That is because there are limitations of previous datasets. We establish a large-scale dataset called MSV, which provide 390 individual music and the corresponding matched 150,000 videos. We conduct extensive experiments on Youtube-8M and our MSV datasets. Our quantitative and qualitative results demonstrate the effectiveness of our proposed framework and achieve state-of-the-art video and music matching performance.

关键词： computer music

来源：评论

学校读者我要写书评

暂无评论

VMRNN: Integrating vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting

VMRNN: Integrating Vision Mamba and LSTM for Efficient and A...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Tang, Yujin Dong, Peijie Tang, Zhenheng Chu, Xiaowen Liang, Junwei Hong Kong Univ Sci & Technol Guangzhou AI Thrust Guangzhou Peoples R China Hong Kong Univ Sci & Technol Guangzhou DSA Thrust Guangzhou Peoples R China Hong Kong Baptist Univ Dept Comp Sci Hong Kong Peoples R China Hong Kong Univ Sci & Technol Dept Comp Sci & Engn Hong Kong Peoples R China

ISBN: (纸本)9798350365474

Combining Convolutional Neural Networks (CNNs) or vision Transformers(ViTs) with Recurrent Neural Networks (RNNs) for spatiotemporal forecasting has yielded unparalleled results in predicting temporal and spatial dynamics. However, modeling extensive global information remains a formidable challenge;CNNs are limited by their narrow receptive fields, and ViTs struggle with the intensive computational demands of their attention mechanisms. The emergence of recent Mamba-based architectures has been met with enthusiasm for their exceptional long-sequence modeling capabilities, surpassing established vision models in efficiency and accuracy, which motivates us to develop an innovative architecture tailored for spatiotemporal forecasting. In this paper, we propose the VMRNN cell, a new recurrent unit that integrates the strengths of vision Mamba blocks with LSTM. We construct a network centered on VMRNN cells to tackle spatiotemporal prediction tasks effectively. Our extensive evaluations show that our proposed approach secures competitive results on a variety of tasks while maintaining a smaller model size. Our code is available at https://***/yyyujintang/VMRNN-PyTorch.

关键词： Spatiotemporal Forecasting State Space Model Video Prediction

来源：评论

学校读者我要写书评

暂无评论

AIGeN: An Adversarial Approach for Instruction Generation in VLN

AIGeN: An Adversarial Approach for Instruction Generation in...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Rawal, Niyati Bigazzi, Roberto Baraldi, Lorenzo Cucchiara, Rita Univ Modena & Reggio Emilia Modena Italy

ISBN: (纸本)9798350365474

In the last few years, the research interest in vision-and-Language Navigation (VLN) has grown significantly. VLN is a challenging task that involves an agent following human instructions and navigating in a previously unknown environment to reach a specified goal. Recent work in literature focuses on different ways to augment the available datasets of instructions for improving navigation performance by exploiting synthetic training data. In this work, we propose AIGeN, a novel architecture inspired by Generative Adversarial Networks (GANs) that produces meaningful and well-formed synthetic instructions to improve navigation agents' performance. The model is composed of a Transformer decoder (GPT-2) and a Transformer encoder (BERT). During the training phase, the decoder generates sentences for a sequence of images describing the agent's path to a particular point while the encoder discriminates between real and fake instructions. Experimentally, we evaluate the quality of the generated instructions and perform extensive ablation studies. Additionally, we generate synthetic instructions for 217K trajectories using AIGeN on Habitat-Matterport 3D Dataset (HM3D) and show an improvement in the performance of an off-the-shelf VLN method. The validation analysis of our proposal is conducted on REVERIE and R2R and highlights the promising aspects of our proposal, achieving state-of-the-art performance.

关键词： Generative Adversarial Networks Text Generation vision-and-Language Navigation

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共456页 << < 2 3 4 5 6 7 8 9 10 11 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：