检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

22,774 篇 会议
111 篇 期刊文献
23 册 图书

馆藏范围

22,907 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

13,400 篇 工学
- 10,880 篇 计算机科学与技术...
- 3,450 篇 软件工程
- 2,429 篇 机械工程
- 1,723 篇 光学工程
- 1,011 篇 控制科学与工程
- 998 篇 电气工程
- 761 篇 信息与通信工程
- 393 篇 仪器科学与技术
- 337 篇 生物工程
- 257 篇 生物医学工程（可授...
- 214 篇 电子科学与技术（可...
- 113 篇 化学工程与技术
- 112 篇 安全科学与工程
- 98 篇 测绘科学与技术
- 93 篇 交通运输工程
- 86 篇 建筑学
- 82 篇 土木工程
3,361 篇 医学
- 3,347 篇 临床医学
- 79 篇 基础医学(可授医学...
3,251 篇 理学
- 1,953 篇 物理学
- 1,665 篇 数学
- 567 篇 统计学（可授理学、...
- 484 篇 生物学
- 245 篇 系统科学
- 109 篇 化学
506 篇 管理学
- 299 篇 图书情报与档案管...
- 219 篇 管理科学与工程(可...
- 75 篇 工商管理
252 篇 艺术学
- 252 篇 设计学（可授艺术学...
62 篇 法学
- 59 篇 社会学
40 篇 农学
25 篇 教育学
19 篇 经济学
11 篇 军事学
3 篇 文学

主题

10,126 篇 computer vision
4,026 篇 pattern recognit...
2,900 篇 training
1,958 篇 computational mo...
1,792 篇 cameras
1,759 篇 visualization
1,484 篇 shape
1,466 篇 image segmentati...
1,445 篇 feature extracti...
1,412 篇 three-dimensiona...
1,288 篇 robustness
1,170 篇 computer archite...
1,146 篇 layout
1,142 篇 computer science
1,134 篇 semantics
1,071 篇 object detection
1,043 篇 conferences
1,009 篇 benchmark testin...
967 篇 codes
810 篇 face recognition

机构

135 篇 univ sci & techn...
118 篇 univ chinese aca...
118 篇 chinese univ hon...
110 篇 carnegie mellon ...
99 篇 tsinghua univers...
99 篇 microsoft resear...
94 篇 swiss fed inst t...
92 篇 zhejiang univ pe...
82 篇 university of sc...
81 篇 zhejiang univers...
77 篇 shanghai ai lab ...
77 篇 university of ch...
72 篇 shanghai jiao to...
68 篇 microsoft res as...
65 篇 national laborat...
65 篇 alibaba grp peop...
63 篇 adobe research
63 篇 tsinghua univ pe...
60 篇 peking univ peop...
59 篇 peng cheng labor...

作者

78 篇 van gool luc
72 篇 timofte radu
63 篇 zhang lei
45 篇 luc van gool
40 篇 yang yi
37 篇 loy chen change
33 篇 xiaoou tang
33 篇 li stan z.
33 篇 qi tian
32 篇 sun jian
31 篇 liu yang
31 篇 li fei-fei
30 篇 chen chen
30 篇 tian qi
30 篇 pascal fua
29 篇 darrell trevor
28 篇 ying shan
27 篇 li xin
27 篇 vasconcelos nuno
27 篇 hanqing lu

语言

22,719 篇 英文
162 篇 其他
20 篇 中文
5 篇 土耳其文
2 篇 日文

检索条件"任意字段=1994 IEEE Computer-Society Conference on Computer Vision and Pattern Recognition"

共 22908 条记录，以下是51-60 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Hairy Ground Truth Enhancement for Semantic Segmentation

Hairy Ground Truth Enhancement for Semantic Segmentation

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Fischer, Sophie Voiculescu, Irina Univ Oxford Dept Comp Sci Oxford England

ISBN: (纸本)9798350365474

Semantic segmentation is a key task within applications of machine learning for medical imaging, requiring large amounts of medical scans annotated by clinicians. The high cost of data annotation means that models need to make the most of all available ground truth masks;yet many models consider two false positive (or false negative) pixel predictions as 'equally wrong' regardless of the individual pixels' relative position to the ground truth mask. These methods also have no sense of whether a pixel is solitary or belongs to a contiguous group. We propose the Hairy transform, a novel method to enhance ground truths using 3D 'hairs' to represent each pixel's position relative to objects in the ground truth. We illustrate its effectiveness using a mainstream model and loss function on a commonly used cardiac MRI dataset, as well as a set of synthetic data constructed to highlight the effect of the method during training. The overall improvement in segmentation results comes at the small cost of a one-off pre-processing step, and can easily be integrated into any standard machine learning model. Rather than looking to make minute improvements for mostly correct 'standard' masks we instead show how this method helps improve robustness against catastrophic failures for edge cases.

关键词： computer vision Ground Truth Enhancement Machine Learning Medical Imaging

来源：评论

学校读者我要写书评

暂无评论

DELTA: Decoupling Long-Tailed Online Continual Learning

DELTA: Decoupling Long-Tailed Online Continual Learning

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Raghavan, Siddeshwar He, Jiangpeng Zhu, Fengqing Purdue Univ Sch Elect & Comp Engn W Lafayette IN 47907 USA

ISBN: (纸本)9798350365474

A significant challenge in achieving ubiquitous Artificial Intelligence is the limited ability of models to rapidly learn new information in real-world scenarios where data follows long-tailed distributions, all while avoiding forgetting previously acquired knowledge. In this work, we study the under-explored problem of Long-Tailed Online Continual Learning (LTOCL), which aims to learn new tasks from sequentially arriving class-imbalanced data streams. Each data is observed only once for training without knowing the task data distribution. We present DELTA, a decoupled learning approach designed to enhance learning representations and address the substantial imbalance in LTOCL. We enhance the learning process by adapting supervised contrastive learning to attract similar samples and repel dissimilar (out-of-class) samples. Further, by balancing gradients during training using an equalization loss, DELTA significantly enhances learning outcomes and successfully mitigates catastrophic forgetting. Through extensive evaluation, we demonstrate that DELTA improves the capacity for incremental learning, surpassing existing OCL methods. Our results suggest considerable promise for applying OCL in real-world applications. Code is available online (1)

关键词： computer vision long-tailed image classification online continual learning

来源：评论

学校读者我要写书评

暂无评论

One Embedding to Predict Them All: Visible and Thermal Universal Face Representations for Soft Biometric Estimation via vision Transformers

One Embedding to Predict Them All: Visible and Thermal Unive...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Mirabet-Herranz, Nelida Galdi, Chiara Dugelay, Jean-Luc EURECOM Campus SophiaTech450 Route Chappes F-06410 Biot France

ISBN: (纸本)9798350365474

Human faces encode a vast amount of information including not only uniquely distinctive features of the individual but also demographic information such as a person's age, gender, and weight. Such information is referred to as soft-biometrics, which are physical, behavioral or adhered human characteristics, classifiable in pre-defined human compliant categories. As we often say 'one look is worth a thousand words'. vision Transformers have emerged as a powerful deep learning architecture able to achieve accurate classifications for different computer vision tasks, but these models have not been yet applied to soft-biometrics. In this work, we propose the Bidirectional Encoder Face representation from image Transformers (BEFiT), a model that leverages the multi-attention mechanisms to capture local and global features and produce a multi-purpose face embedding. This unique embedding enables the estimation of different demographics without having to re-train the model for each soft-biometric trait, ensuring high efficiency without compromising accuracy. Our approach explores the use of visible and thermal images to achieve powerful face embedding in different light spectra. We demonstrate that the BEFiT embeddings can capture essential information for gender, age, and weight estimation, surpassing the performance of dedicated deep learning structures for the estimation of a single soft biometric trait. The code of BEFiT implementation is publicly available(1)

关键词： Biometrics

来源：评论

学校读者我要写书评

暂无评论

Towards Engineered Safe AI with Modular Concept Models

Towards Engineered Safe AI with Modular Concept Models

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Heidemann, Lena Kurzidem, Iwo Monnet, Maureen Roscher, Karsten Guennemann, Stephan Fraunhofer IKS Munich Germany Tech Univ Munich Munich Germany

ISBN: (纸本)9798350365474

The inherent complexity and uncertainty of Machine Learning (ML) makes it difficult for ML-based computer vision (CV) approaches to become prevalent in safety-critical domains like autonomous driving, despite their high performance. A crucial challenge in these domains is the safety assurance of ML-based systems. To address this, recent safety standardization in the automotive domain has introduced an ML safety lifecycle following an iterative development process. While this approach facilitates safety assurance, its iterative nature requires frequent adaptation and optimization of the ML function, which might include costly retraining of the ML model and is not guaranteed to converge to a safe AI solution. In this paper, we propose a modular ML approach which allows for more efficient and targeted measures to each of the modules and process steps. Each module of the modular concept model represents one visual concept and is aggregated with the other modules' outputs into a task output. The design choices of a modular concept model can be categorized into the selection of the concept modules, the aggregation of their output and the training of the concept modules. Using the example of traffic sign classification, we present each step of the involved design choices and the corresponding targeted measures to take in an iterative development process for engineering safe AI.

关键词： computer vision Concept Models Deep Neural Networks Explainable AI Interpretable Models Machine Learning ML Safety Modular Concept Models Modular Deep Learning Safe AI Safe ML

来源：评论

学校读者我要写书评

暂无评论

Domain Targeted Synthetic Plant Style Transfer using Stable Diffusion, LoRA and ControlNet

Domain Targeted Synthetic Plant Style Transfer using Stable ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Hartley, Zane K. J. Lind, Rob J. Pound, Michael P. French, Andrew P. Univ Nottingham Wollaton Rd Nottingham NG8 1BB England Syngenta Jealotts Hill Int Res Ctr Warfield England

ISBN: (纸本)9798350365474

Synthetic images can help alleviate much of the cost in the creation of training data for plant phenotyping-focused AI development. Synthetic-to-real style transfer is of particular interest to users of artificial data because of the domain shift problem created by training neural networks on images generated in a digital environment. In this paper we present a pipeline for synthetic plant creation and image-to-image style transfer, with a particular interest in synthetic to real domain adaptation targeting specific real datasets. Utilizing new advances in generative AI, we employ a combination of Stable diffusion, Low Ranked Adapters (LoRA) and ControlNets to produce an advanced system of style transfer. We focus our work on the core task of leaf instance segmentation, exploring both synthetic to real style transfer as well as inter-species style transfer and find that our pipeline makes numerous improvements over CycleGAN for style transfer, and the images we produce are comparable to real images when used as training data.

关键词： Agriculture computer vision ControlNet Deep Learning Diffusion LoRA Plant Phenotyping

来源：评论

学校读者我要写书评

暂无评论

Classifier Guided Cluster Density Reduction for Dataset Selection

Classifier Guided Cluster Density Reduction for Dataset Sele...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Chang, Cheng Long, Keyu Li, Zijian Rai, Himanshu Layer 6 AI Toronto ON Canada

ISBN: (纸本)9798350365474

In this paper, we address the challenge of selecting an optimal dataset from a source pool with annotations to enhance performance on a target dataset derived from a different source. This is important in scenarios where it is hard to afford on-the-fly dataset annotation and is also the theme of the second Visual Data Understanding (VDU) Challenge. Our solution, the Classifier Guided Cluster Density Reduction (CCDR) framework, operates in two stages. Initially, we employ a filtering technique to identify images that align with the target dataset's distribution. Subsequently, we implement a graph-based cluster density reduction method, steered by a classifier that approximates the distance between the target distribution and source distribution. This classifier is trained to distinguish between images that resemble the target dataset and those that do not, facilitating the pruning process shown in Figure 1. Our approach maintains a balance between selecting pertinent images that match the target distribution and eliminating redundant ones that do not contribute to the enhancement of the detection model. We demonstrate the superiority of our method over various baselines in object detection tasks, particularly in optimizing the training set distribution on the region100 dataset. We have released our code here: https://***/ himsR/DataCVChallenge-2024/tree/main

关键词： computer vision Data Search deep learning domain Transfer

来源：评论

学校读者我要写书评

暂无评论

A Perspective on Deep vision Performance with Standard Image and Video Codecs

A Perspective on Deep Vision Performance with Standard Image...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Reich, Christoph Hahn, Oliver Cremers, Daniel Roth, Stefan Debnath, Biplob Tech Univ Darmstadt Darmstadt Germany Tech Univ Munich Munich Germany NEC Labs Amer Inc San Jose CA 95110 USA Hessian Ctr AI Hessian AI Darmstadt Germany Munich Ctr Machine Learning MCML Munich Germany

ISBN: (纸本)9798350365474

Resource-constrained hardware, such as edge devices or cell phones, often rely on cloud servers to provide the required computational resources for inference in deep vision models. However, transferring image and video data from an edge or mobile device to a cloud server requires coding to deal with network constraints. The use of standardized codecs, such as JPEG or H.264, is prevalent and required to ensure interoperability. This paper aims to examine the implications of employing standardized codecs within deep vision pipelines. We find that using JPEG and H.264 coding significantly deteriorates the accuracy across a broad range of vision tasks and models. For instance, strong compression rates reduce semantic segmentation accuracy by more than 80% in mIoU. In contrast to previous findings, our analysis extends beyond image and action classification to localization and dense prediction tasks, thus providing a more comprehensive perspective.

关键词： Image Classification Image Compression Object Detectio Optical Flow Estimation Semantic Segmentation Video Compression

来源：评论

学校读者我要写书评

暂无评论

ALINA: Advanced Line Identification and Notation Algorithm

ALINA: Advanced Line Identification and Notation Algorithm

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Khan, Mohammed Abdul Hafeez Ganeriwala, Parth Bhattacharyya, Siddhartha Neogi, Natasha Muthalagu, Raja Florida Inst Technol Melbourne FL 32901 USA NASA Langley Res Ctr Hampton VA 23665 USA BITS Pilani Dubai Campus Dubai U Arab Emirates

ISBN: (纸本)9798350365474

Labels are the cornerstone of supervised machine learning algorithms. Most visual recognition methods are fully supervised, using bounding boxes or pixel-wise segmentations for object localization. Traditional labeling methods, such as crowd-sourcing, are prohibitive due to cost, data privacy, amount of time, and potential errors on large datasets. To address these issues, we propose a novel annotation framework, Advanced Line Identification and Notation Algorithm (ALINA), which can be used for labeling taxiway datasets that consist of different camera perspectives and variable weather attributes (sunny and cloudy). Additionally, the CIRCular threshoLd pixEl Discovery And Traversal (CIRCLEDAT) algorithm has been proposed, which is an integral step in determining the pixels corresponding to taxiway line markings. Once the pixels are identified, ALINA generates corresponding pixel coordinate annotations on the frame. Using this approach, 60,249 frames from the taxiway dataset, AssistTaxi have been labeled. To evaluate the performance, a context-based edge map (CBEM) set was generated manually based on edge features and connectivity. The detection rate after testing the annotated labels with the CBEM set was recorded as 98.45%, attesting its dependability and effectiveness.

关键词： aircraft perception annotation autonomous driving computer vision labeling line identification taxiway data

来源：评论

学校读者我要写书评

暂无评论

VMRNN: Integrating vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting

VMRNN: Integrating Vision Mamba and LSTM for Efficient and A...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Tang, Yujin Dong, Peijie Tang, Zhenheng Chu, Xiaowen Liang, Junwei Hong Kong Univ Sci & Technol Guangzhou AI Thrust Guangzhou Peoples R China Hong Kong Univ Sci & Technol Guangzhou DSA Thrust Guangzhou Peoples R China Hong Kong Baptist Univ Dept Comp Sci Hong Kong Peoples R China Hong Kong Univ Sci & Technol Dept Comp Sci & Engn Hong Kong Peoples R China

ISBN: (纸本)9798350365474

Combining Convolutional Neural Networks (CNNs) or vision Transformers(ViTs) with Recurrent Neural Networks (RNNs) for spatiotemporal forecasting has yielded unparalleled results in predicting temporal and spatial dynamics. However, modeling extensive global information remains a formidable challenge;CNNs are limited by their narrow receptive fields, and ViTs struggle with the intensive computational demands of their attention mechanisms. The emergence of recent Mamba-based architectures has been met with enthusiasm for their exceptional long-sequence modeling capabilities, surpassing established vision models in efficiency and accuracy, which motivates us to develop an innovative architecture tailored for spatiotemporal forecasting. In this paper, we propose the VMRNN cell, a new recurrent unit that integrates the strengths of vision Mamba blocks with LSTM. We construct a network centered on VMRNN cells to tackle spatiotemporal prediction tasks effectively. Our extensive evaluations show that our proposed approach secures competitive results on a variety of tasks while maintaining a smaller model size. Our code is available at https://***/yyyujintang/VMRNN-PyTorch.

关键词： Spatiotemporal Forecasting State Space Model Video Prediction

来源：评论

学校读者我要写书评

暂无评论

VLM-PL: Advanced Pseudo Labeling approach for Class Incremental Object Detection via vision-Language Model

VLM-PL: Advanced Pseudo Labeling approach for Class Incremen...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Kim, Junsu Ku, Yunhoe Kim, Jihyeon Cha, Junuk Baek, Seungryul UNIST Ulsan South Korea MODULABS Seoul South Korea

ISBN: (纸本)9798350365474

In the field of Class Incremental Object Detection (CIOD), creating models that can continuously learn like humans is a major challenge. Pseudo-labeling methods, although initially powerful, struggle with multi-scenario incremental learning due to their tendency to forget past knowledge. To overcome this, we introduce a new approach called vision-Language Model assisted Pseudo-Labeling (VLM-PL). This technique uses vision-Language Model (VLM) to verify the correctness of pseudo ground-truths (GTs) without requiring additional model training. VLM-PL starts by deriving pseudo GTs from a pre-trained detector. Then, we generate custom queries for each pseudo GT using carefully designed prompt templates that combine image and text features. This allows the VLM to classify the correctness through its responses. Furthermore, VLM-PL integrates refined pseudo and real GTs from upcoming training, effectively combining new and old knowledge. Extensive experiments conducted on the Pascal VOC and MS COCO datasets not only highlight VLM-PL's exceptional performance in multi-scenario but also illuminate its effectiveness in dual-scenario by achieving state-of-the-art results in both.

关键词： CIOD Class Incremental Object Detection Continual Learning Incremental Learning Object Detection Pseudo Labeling vision-Language Model

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 2 3 4 5 6 7 8 9 10 11 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：