检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

23,008 篇 会议
126 册 图书
94 篇 期刊文献

馆藏范围

23,227 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

13,631 篇 工学
- 11,116 篇 计算机科学与技术...
- 3,481 篇 软件工程
- 2,445 篇 机械工程
- 1,716 篇 光学工程
- 1,080 篇 电气工程
- 1,014 篇 控制科学与工程
- 788 篇 信息与通信工程
- 411 篇 仪器科学与技术
- 352 篇 生物工程
- 251 篇 生物医学工程（可授...
- 196 篇 电子科学与技术（可...
- 114 篇 化学工程与技术
- 109 篇 安全科学与工程
- 100 篇 测绘科学与技术
- 88 篇 建筑学
- 88 篇 交通运输工程
- 84 篇 土木工程
3,495 篇 医学
- 3,482 篇 临床医学
- 82 篇 基础医学(可授医学...
3,246 篇 理学
- 1,941 篇 物理学
- 1,643 篇 数学
- 563 篇 统计学（可授理学、...
- 500 篇 生物学
- 249 篇 系统科学
- 106 篇 化学
521 篇 管理学
- 311 篇 图书情报与档案管...
- 223 篇 管理科学与工程(可...
- 76 篇 工商管理
276 篇 艺术学
- 276 篇 设计学（可授艺术学...
66 篇 法学
- 63 篇 社会学
38 篇 农学
28 篇 教育学
22 篇 经济学
10 篇 军事学
3 篇 文学

主题

10,186 篇 computer vision
3,967 篇 pattern recognit...
3,005 篇 training
2,007 篇 computational mo...
1,818 篇 visualization
1,815 篇 cameras
1,515 篇 feature extracti...
1,481 篇 shape
1,455 篇 three-dimensiona...
1,438 篇 image segmentati...
1,287 篇 robustness
1,206 篇 computer archite...
1,155 篇 semantics
1,147 篇 conferences
1,107 篇 layout
1,092 篇 computer science
1,088 篇 object detection
1,025 篇 benchmark testin...
970 篇 codes
922 篇 face recognition

机构

136 篇 univ sci & techn...
121 篇 univ chinese aca...
118 篇 chinese univ hon...
105 篇 carnegie mellon ...
101 篇 tsinghua univers...
101 篇 microsoft resear...
95 篇 swiss fed inst t...
93 篇 zhejiang univ pe...
82 篇 university of sc...
81 篇 zhejiang univers...
79 篇 university of ch...
77 篇 shanghai ai lab ...
72 篇 shanghai jiao to...
69 篇 national laborat...
67 篇 microsoft res as...
67 篇 alibaba grp peop...
64 篇 adobe research
60 篇 peking univ peop...
60 篇 tsinghua univ pe...
59 篇 univ oxford oxfo...

作者

81 篇 van gool luc
72 篇 timofte radu
65 篇 zhang lei
47 篇 luc van gool
40 篇 yang yi
40 篇 li stan z.
37 篇 loy chen change
35 篇 chen chen
33 篇 xiaoou tang
32 篇 liu yang
32 篇 qi tian
31 篇 tian qi
31 篇 sun jian
30 篇 murino vittorio
29 篇 ling haibin
29 篇 darrell trevor
29 篇 pascal fua
29 篇 li fei-fei
28 篇 li xin
28 篇 ying shan

语言

22,989 篇 英文
210 篇 其他
22 篇 中文
5 篇 土耳其文
2 篇 日文

检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition Workshops"

共 23228 条记录，以下是821-830 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Uncurated Image-Text Datasets: Shedding Light on Demographic Bias

Uncurated Image-Text Datasets: Shedding Light on Demographic...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Garcia, Noa Hirota, Yusuke Wu, Yankun Nakashima, Yuta Osaka Univ Osaka Japan

ISBN: (纸本)9798350301298

The increasing tendency to collect large and uncurated datasets to train vision-and-language models has raised concerns about fair representations. It is known that even small but manually annotated datasets, such as MSCOCO, are affected by societal bias. This problem, far from being solved, may be getting worse with data crawled from the Internet without much control. In addition, the lack of tools to analyze societal bias in big collections of images makes addressing the problem extremely challenging. Our first contribution is to annotate part of the Google Conceptual Captions dataset, widely used for training vision-and-language models, with four demographic and two contextual attributes. Our second contribution is to conduct a comprehensive analysis of the annotations, focusing on how different demographic groups are represented. Our last contribution lies in evaluating three prevailing vision-and-language tasks: image captioning, text-image CLIP embeddings, and text-to-image generation, showing that societal bias is a persistent problem in all of them.

关键词： accountability ethics in vision fairness privacy Transparency

来源：评论

学校读者我要写书评

暂无评论

CNLL: A Semi-supervised Approach For Continual Noisy Label Learning

CNLL: A Semi-supervised Approach For Continual Noisy Label L...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Karim, Nazmul Khalid, Umar Esmaeili, Ashkan Rahnavard, Nazanin Univ Cent Florida Dept Elect & Comp Engn Orlando FL 32816 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

The task of continual learning requires careful design of algorithms that can tackle catastrophic forgetting. However, the noisy label, which is inevitable in a real-world scenario, seems to exacerbate the situation. While very few studies have addressed the issue of continual learning under noisy labels, long training time and complicated training schemes limit their applications in most cases. In contrast, we propose a simple purification technique to effectively cleanse the online data stream that is both cost-effective and more accurate. After purification, we perform fine-tuning in a semi-supervised fashion that ensures the participation of all available samples. Training in this fashion helps us learn a better representation that results in state-of-the-art (SOTA) performance. Through extensive experimentation on 3 benchmark datasets, MNIST, CIFAR10 and CIFAR100, we show the effectiveness of our proposed approach. We achieve a 24.8% performance gain for CIFAR10 with 20% noise over previous SOTA methods. Our code is publicly available.(1)

关键词： Training computer vision Codes Purification conferences Benchmark testing Performance gain

来源：评论

学校读者我要写书评

暂无评论

Graph Representation for Order-aware Visual Transformation

Graph Representation for Order-aware Visual Transformation

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Qiu, Yue Sun, Yanjun Matsuzawa, Fumiya Iwata, Kenji Kataoka, Hirokatsu Natl Inst Adv Ind Sci & Technol Tokyo Japan

ISBN: (纸本)9798350301298

This paper proposes a new visual reasoning formulation that aims at discovering changes between image pairs and their temporal orders. Recognizing scene dynamics and their chronological orders is a fundamental aspect of human cognition. The aforementioned abilities make it possible to follow step-by-step instructions, reason about and analyze events, recognize abnormal dynamics, and restore scenes to their previous states. However, it remains unclear how well current AI systems perform in these capabilities. Although a series of studies have focused on identifying and describing changes from image pairs, they mainly consider those changes that occur synchronously, thus neglecting potential orders within those changes. To address the above issue, we first propose a visual transformation graph structure for conveying order-aware changes. Then, we bench-marked previous methods on our newly generated dataset and identified the issues of existing methods for change order recognition. Finally, we show a significant improvement in order-aware change recognition by introducing a new model that explicitly associates different changes and then identifies changes and their orders in a graph representation.

关键词： and reasoning language vision

来源：评论

学校读者我要写书评

暂无评论

Adversarial Machine Learning Attacks Against Video Anomaly Detection Systems

Adversarial Machine Learning Attacks Against Video Anomaly D...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Mumcu, Furkan Doshi, Keval Yilmaz, Yasin Univ S Florida 4202 E Fowler Ave Tampa FL 33620 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Anomaly detection in videos is an important computer vision problem with various applications including automated video surveillance. Although adversarial attacks on image understanding models have been heavily investigated, there is not much work on adversarial machine learning targeting video understanding models and no previous work which focuses on video anomaly detection. To this end, we investigate an adversarial machine learning attack against video anomaly detection systems, that can be implemented via an easy-to-perform cyber-attack. Since surveillance cameras are usually connected to the server running the anomaly detection model through a wireless network, they are prone to cyber-attacks targeting the wireless connection. We demonstrate how Wi-Fi deauthentication attack, a notoriously easy-to-perform and effective denial-of-service (DoS) attack, can be utilized to generate adversarial data for video anomaly detection systems. Specifically, we apply several effects caused by the Wi-Fi deauthentication attack on video quality (e.g., slow down, freeze, fast forward, low resolution) to the popular benchmark datasets for video anomaly detection. Our experiments with several state-of-the-art anomaly detection models show that the attackers can significantly undermine the reliability of video anomaly detection systems by causing frequent false alarms and hiding physical anomalies from the surveillance system.

关键词： computer vision Computational modeling Wireless networks Video surveillance Adversarial machine learning Synchronization Servers

来源：评论

学校读者我要写书评

暂无评论

Once for Both: Single Stage of Importance and Sparsity Search for vision Transformer Compression

Once for Both: Single Stage of Importance and Sparsity Searc...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Ye, Hancheng Yu, Chong Ye, Peng Xia, Renqiu Tang, Yansong Lu, Jiwen Chen, Tao Zhang, Bo Fudan Univ Sch Informat Sci & Technol Shanghai Peoples R China Shanghai Artificial Intelligence Lab Shanghai Peoples R China Fudan Univ Acad Engn & Technol Shanghai Peoples R China Shanghai Jiao Tong Univ Shanghai Peoples R China Tsinghua Univ Beijing Peoples R China

ISBN: (纸本)9798350353013;9798350353006

Recent vision Transformer Compression (VTC) works mainly follow a two-stage scheme, where the importance score of each model unit is first evaluated or preset in each submodule, followed by the sparsity score evaluation according to the target sparsity constraint. Such a separate evaluation process induces the gap between importance and sparsity score distributions, thus causing high search costs for VTC. In this work, for the first time, we investigate how to integrate the evaluations of importance and sparsity scores into a single stage, searching the optimal subnets in an efficient manner. Specifically, we present OFB, a cost-efficient approach that simultaneously evaluates both importance and sparsity scores, termed Once for Both (OFB), for VTC. First, a bi-mask scheme is developed by entangling the importance score and the differentiable sparsity score to jointly determine the pruning potential (prunability) of each unit. Such a bi-mask search strategy is further used together with a proposed adaptive one-hot loss to realize the progressive-andefficient search for the most important subnet. Finally, Progressive Masked Image Modeling (PMIM) is proposed to regularize the feature space to be more representative during the search process, which may be degraded by the dimension reduction. Extensive experiments demonstrate that OFB can achieve superior compression performance over state-of-the-art searching-based and pruning-based methods under various vision Transformer architectures, meanwhile promoting search efficiency significantly, e.g., costing one GPU search day for the compression of DeiT-S on ImageNet-1K.

关键词： Image compression

来源：评论

学校读者我要写书评

暂无评论

PromptKD: Unsupervised Prompt Distillation for vision-Language Models

PromptKD: Unsupervised Prompt Distillation for Vision-Langua...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Li, Zheng Li, Xiang Fu, Xinyi Zhang, Xin Wang, Weiqiang Chen, Shuo Yang, Jian Nankai Univ Coll Comp Sci PCA Lab VCIP Tianjin Peoples R China NKIARI Shenzhen Futian Peoples R China Ant Grp Tiansuan Lab Hangzhou Peoples R China RIKEN Wako Saitama Japan

ISBN: (纸本)9798350353006

Prompt learning has emerged as a valuable technique in enhancing vision-language models (VLMs) such as CLIP for downstream tasks in specific domains. Existing work mainly focuses on designing various learning forms of prompts, neglecting the potential of prompts as effective distillers for learning from larger teacher models. In this paper, we introduce an unsupervised domain prompt distillation framework, which aims to transfer the knowledge of a larger teacher model to a lightweight target model through prompt-driven imitation using unlabeled domain images. Specifically, our framework consists of two distinct stages. In the initial stage, we pre-train a large CLIP teacher model using domain (few-shot) labels. After pre-training, we leverage the unique decoupled-modality characteristics of CLIP by pre-computing and storing the text features as class vectors only once through the teacher text encoder. In the subsequent stage, the stored class vectors are shared across teacher and student image encoders for calculating the predicted logits. Further, we align the logits of both the teacher and student models via KL divergence, encouraging the student image encoder to generate similar probability distributions to the teacher through the learnable prompts. The proposed prompt distillation process eliminates the reliance on labeled data, enabling the algorithm to leverage a vast amount of unlabeled images within the domain. Finally, the well-trained student image encoders and pre-stored text features (class vectors) are utilized for inference. To our best knowledge, we are the first to (1) perform unsupervised domain-specific prompt-driven knowledge distillation for CLIP, and (2) establish a practical pre-storing mechanism of text features as shared class vectors between teacher and student. Extensive experiments on 11 datasets demonstrate the effectiveness of our method. Code is publicly available at https://***/zhengli97/PromptKD.

关键词： knowledge distillation prompt learning vision-language models zero-shot learning

来源：评论

学校读者我要写书评

暂无评论

Semantic Segmentation for Thermal Images: A Comparative Survey

Semantic Segmentation for Thermal Images: A Comparative Surv...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Kutuk, Zulfiye Algan, Gorkem Aselsan Inc Dept Image Proc & Comp Vis Technol Ankara Turkey

ISBN: (纸本)9781665487399

Semantic segmentation is a challenging task since it requires excessively more low-level spatial information of the image compared to other computer vision problems. The accuracy of pixel-level classification can be affected by many factors, such as imaging limitations and the ambiguity of object boundaries in an image. Conventional methods exploit three-channel RGB images captured in the visible spectrum with deep neural networks (DNN). Thermal images can significantly contribute during the segmentation since thermal imaging cameras are capable of capturing details despite the weather and illumination conditions. Using infrared spectrum in semantic segmentation has many real-world use cases, such as autonomous driving, medical imaging, agriculture, defense industry, etc. Due to this wide range of use cases, designing accurate semantic segmentation algorithms with the help of infrared spectrum is an important challenge. One approach is to use both visible and infrared spectrum images as inputs. These methods can accomplish higher accuracy due to enriched input information, with the cost of extra effort for the alignment and processing of multiple inputs. Another approach is to use only thermal images, enabling less hardware cost for smaller use cases. Even though there are multiple surveys on semantic segmentation methods, the literature lacks a comprehensive survey centered explicitly around semantic segmentation using infrared spectrum. This work aims to fill this gap by presenting algorithms in the literature and categorizing them by their input images.

关键词： Image segmentation computer vision Costs Semantics Neural networks Lighting Robustness

来源：评论

学校读者我要写书评

暂无评论

Continual Learning with Transformers for Image Classification

Continual Learning with Transformers for Image Classificatio...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Ermis, Beyza Zappella, Giovanni Wistuba, Martin Rawal, Aditya Archambeau, Cedric AWS Berlin Germany AWS Santa Clara CA USA

ISBN: (纸本)9781665487399

In many real-world scenarios, data to train machine learning models become available over time. However, neural network models struggle to continually learn new concepts without forgetting what has been learnt in the past. This phenomenon is known as catastrophic forgetting and it is often difficult to prevent due to practical constraints, such as the amount of data that can be stored or the limited computation sources that can be used. Moreover, training large neural networks, such as Transformers, from scratch is very costly and requires a vast amount of training data, which might not be available in the application domain of interest. A recent trend indicates that dynamic architectures based on an expansion of the parameters can reduce catastrophic forgetting efficiently in continual learning, but this needs complex tuning to balance the growing number of parameters and barely share any information across tasks. As a result, they struggle to scale to a large number of tasks without significant overhead. In this paper, we validate in the computer vision domain a recent solution called Adaptive Distillation of Adapters (ADA), which is developed to perform continual learning using pre-trained Transformers and Adapters on text classification tasks. We empirically demonstrate on different classification tasks that this method maintains a good predictive performance without retraining the model or increasing the number of model parameters over the time. Besides it is significantly faster at inference time compared to the state-of-the-art methods.

关键词： Training computer vision Adaptation models Computational modeling Neural networks Training data Predictive models

来源：评论

学校读者我要写书评

暂无评论

Transformaly - Two (Feature Spaces) Are Better Than One

Transformaly - Two (Feature Spaces) Are Better Than One

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Cohen, Matan Jacob Avidan, Shai Tel Aviv Univ Blavatnik Sch Comp Sci Tel Aviv Israel Tel Aviv Univ Sch Elect Engn Tel Aviv Israel

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Anomaly detection is a well-established research area that seeks to identify samples outside of a predetermined distribution. An anomaly detection pipeline is comprised of two main stages: (1) feature extraction and (2) normality score assignment. Recent papers used pre-trained networks for feature extraction achieving state-of-the-art results. However, the use of pre-trained networks does not fully-utilize the normal samples that are available at train time. This paper suggests taking advantage of this information by using teacher-student training. In our setting, a pretrained teacher network is used to train a student network on the normal training samples. Since the student network is trained only on normal samples, it is expected to deviate from the teacher network in abnormal cases. This difference can serve as a complementary representation to the pre-trained feature vector. Our method - Transformaly - exploits a pre-trained vision Transformer (ViT) to extract both feature vectors: the pre-trained (agnostic) features and the teacher-student (fine-tuned) features. We report state-of-the-art AUROC results in both the common unimodal setting, where one class is considered normal and the rest are considered abnormal, and the multimodal setting, where all classes but one are considered normal, and just one class is considered abnormal(1).

关键词： Training Visualization computer vision conferences Pipelines computer architecture Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Remote Estimation of Continuous Blood Pressure by a Convolutional Neural Network Trained on Spatial patterns of Facial Pulse Waves

Remote Estimation of Continuous Blood Pressure by a Convolut...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Iuchi, Kaito Miyazaki, Ryogo Cardoso, George C. Ogawa-Ochiai, Keiko Tsumura, Norimichi Chiba Univ Grad Sch Sci & Engn Dept Imaging Sci Chiba Japan Univ Sao Paulo Phys Dept FFCLRP Sao Paulo Brazil Hiroshima Univ Hosp Dept Gen Med Hiroshima Japan

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

We propose a remote method to estimate continuous blood pressure based on spatial information of a pulse wave at a single point in time. By setting regions of interest to cover a face in a mutually exclusive and collectively exhaustive manner, RGB facial video is converted into a spatial pulse wave signal. The spatial pulse wave signal is converted into spatial signals of contours of each segmented pulse beat and relationships of each segmented pulse beat. The spatial signal is represented as a time-continuous value based on a representation of a pulse contour in a time axis and a phase axis and an interpolation along with the time axis. A relationship between the spatial signals and blood pressure is modeled by a convolutional neural network. A dataset was built to demonstrate the effectiveness of the proposed method. The dataset consists of continuous blood pressure and facial RGB videos of ten healthy volunteers. A comparison of conventional methods with the proposed method shows superior error for the latter. The results show an adequate estimation of the performance of the proposed method, when compared to the ground truth in mean blood pressure, in both the correlation coefficient (0.85) and mean absolute error (5.4 mmHg).

关键词： Correlation coefficient Interpolation computer vision Face recognition conferences Estimation Blood pressure

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 79 80 81 82 83 84 85 86 87 88 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：