检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

29,416 篇 会议
1,399 册 图书
233 篇 期刊文献

馆藏范围

31,046 篇 电子文献
2 种 纸本馆藏

日期分布

学科分类号

17,299 篇 工学
- 13,641 篇 计算机科学与技术...
- 5,208 篇 软件工程
- 2,971 篇 机械工程
- 2,647 篇 光学工程
- 1,413 篇 电气工程
- 1,411 篇 控制科学与工程
- 1,336 篇 信息与通信工程
- 656 篇 生物工程
- 576 篇 仪器科学与技术
- 513 篇 生物医学工程（可授...
- 465 篇 电子科学与技术（可...
- 251 篇 化学工程与技术
- 215 篇 安全科学与工程
- 143 篇 交通运输工程
- 132 篇 建筑学
- 121 篇 材料科学与工程（可...
- 119 篇 土木工程
5,066 篇 理学
- 3,135 篇 物理学
- 2,408 篇 数学
- 824 篇 生物学
- 802 篇 统计学（可授理学、...
- 299 篇 系统科学
- 228 篇 化学
3,831 篇 医学
- 3,800 篇 临床医学
- 186 篇 基础医学(可授医学...
- 140 篇 药学(可授医学、理...
1,061 篇 管理学
- 617 篇 图书情报与档案管...
- 469 篇 管理科学与工程(可...
- 146 篇 工商管理
373 篇 艺术学
- 373 篇 设计学（可授艺术学...
116 篇 法学
81 篇 农学
48 篇 教育学
43 篇 经济学
18 篇 军事学
8 篇 文学

主题

12,607 篇 computer vision
5,702 篇 pattern recognit...
3,181 篇 training
2,263 篇 cameras
2,179 篇 computational mo...
2,116 篇 feature extracti...
2,050 篇 image segmentati...
1,971 篇 visualization
1,967 篇 shape
1,642 篇 robustness
1,491 篇 layout
1,476 篇 three-dimensiona...
1,442 篇 computer science
1,339 篇 computer archite...
1,296 篇 object detection
1,221 篇 semantics
1,144 篇 face recognition
1,107 篇 conferences
1,077 篇 benchmark testin...
1,056 篇 humans

机构

137 篇 univ sci & techn...
134 篇 tsinghua univers...
134 篇 univ chinese aca...
118 篇 chinese univ hon...
101 篇 microsoft resear...
97 篇 zhejiang univers...
95 篇 national laborat...
94 篇 shanghai jiao to...
93 篇 zhejiang univ pe...
85 篇 university of sc...
79 篇 shanghai ai lab ...
78 篇 swiss fed inst t...
66 篇 microsoft res as...
62 篇 adobe research
62 篇 computer vision ...
61 篇 peking univ peop...
58 篇 univ oxford oxfo...
57 篇 google mountain ...
57 篇 hong kong univ s...
56 篇 google res mount...

作者

107 篇 umapada pal
82 篇 van gool luc
70 篇 zhang lei
59 篇 timofte radu
41 篇 yang yi
37 篇 loy chen change
37 篇 hanqing lu
33 篇 liu yang
32 篇 nassir navab
32 篇 wang liang
32 篇 xiaoou tang
30 篇 tian qi
29 篇 h. bischof
29 篇 jan-michael frah...
29 篇 vittorio murino
29 篇 darrell trevor
28 篇 ling haibin
28 篇 chen chen
27 篇 li xin
27 篇 vasconcelos nuno

语言

30,691 篇 英文
288 篇 其他
99 篇 中文
6 篇 土耳其文
2 篇 日文
2 篇 俄文

检索条件"任意字段=Conference on Computer Vision and Pattern Recognition"

共 31048 条记录，以下是4321-4330 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Source-Free Domain Adaptation for Semantic Segmentation

Source-Free Domain Adaptation for Semantic Segmentation

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Liu, Yuang Zhang, Wei Wang, Jun East China Normal Univ Shanghai Peoples R China

ISBN: (纸本)9781665445092

Unsupervised Domain Adaptation (UDA) can tackle the challenge that convolutional neural network (CNN)-based approaches for semantic segmentation heavily rely on the pixel-level annotated data, which is labor-intensive. However, existing UDA approaches in this regard inevitably require the full access to source datasets to reduce the gap between the source and target domains during model adaptation, which are impractical in the real scenarios where the source datasets are private, and thus cannot be released along with the well-trained source models. To cope with this issue, we propose a source-free domain adaptation framework for semantic segmentation, namely SFDA, in which only a well-trained source model and an unlabeled target domain dataset are available for adaptation. SFDA not only enables to recover and preserve the source domain knowledge from the source model via knowledge transfer during model adaptation, but also distills valuable information from the target domain for self-supervised learning. The pixel- and patch-level optimization objectives tailored for semantic segmentation are seamlessly integrated in the framework. The extensive experimental results on numerous benchmark datasets highlight the effectiveness of our framework against the existing UDA approaches relying on source data.

关键词： Adaptation models Image segmentation computer vision Semantics Benchmark testing pattern recognition Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Learning Position and Target Consistency for Memory-based Video Object Segmentation

Learning Position and Target Consistency for Memory-based Vi...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Hu, Li Zhang, Peng Zhang, Bang Pan, Pan Xu, Yinghui Jin, Rong Alibaba Grp Machine Intelligence Technol Lab Hangzhou Peoples R China

ISBN: (纸本)9781665445092

This paper studies the problem of semi-supervised video object segmentation(VOS). Multiple works have shown that memory-based approaches can be effective for video object segmentation. They are mostly based on pixel-level matching, both spatially and temporally. The main shortcoming of memory-based approaches is that they do not take into account the sequential order among frames and do not exploit object-level knowledge from the target. To address this limitation, we propose to Learn position and target Consistency framework for Memory-based video object segmentation, termed as LCM. It applies the memory mechanism to retrieve pixels globally, and meanwhile learns position consistency for more reliable segmentation. The learned location response promotes a better discrimination between target and distractors. Besides, LCM introduces an object-level relationship from the target to maintain target consistency, making LCM more robust to error drifting. Experiments show that our LCM achieves state-of-the-art performance on both DAVIS and Youtube-VOS benchmark. And we rank the 1st in the DAVIS 2020 challenge semi-supervised VOS task.

关键词： computer vision Object segmentation Benchmark testing pattern recognition Reliability Task analysis

来源：评论

学校读者我要写书评

暂无评论

Unified Face Attack Detection with Micro Disturbance and a Two-Stage Training Strategy

Unified Face Attack Detection with Micro Disturbance and a T...

引用

IEEE computer Society conference on computer vision and pattern recognition Workshops (CVPRW)

作者： Jiaruo Yu Dagong Lu Xingyue Shi Chenfan Qu Fengjun Guo IntSig Information Co. Ltd Shanghai China

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Face recognition systems are widely used in real-world scenarios but are susceptible to physical and digital attacks. Effective methods for unified detection of both physical face attacks and digital face attacks are essential to ensure the reliability of face recognition systems. However, how to obtain a unified face attack detection model that has adequate ability of fine-grained perception and cross-domain generalization ability remains an open challenge. To address this issue, we first propose a two-stage training strategy, which utilizes unlabeled face images with masked image modeling and unleashes the potential of vision transformers. Furthermore, we propose a novel method termed as Micro Disturbance, which successfully enriches the representation distribution of forged faces and increases the diversity of the training data, thereby addressing the issue of cross-domain generalization. Attribute to the effectiveness of our proposed methods, our model finally wins the third place in the 5th Face Anti-Spoofing Challenge@CVPR2024, with an impressive ACER score of 5.511.

关键词： Training computer vision Social networking (online) Face recognition Forensics conferences Training data

来源：评论

学校读者我要写书评

暂无评论

Video Object Segmentation Using Global and Instance Embedding Learning

Video Object Segmentation Using Global and Instance Embeddin...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Ge, Wenbin Lu, Xiankai Shen, Jianbing Beijing Inst Technol Beijing Peoples R China Shandong Univ Sch Software Jinan Peoples R China Incept Inst Artificial Intelligence Beijing Peoples R China

ISBN: (纸本)9781665445092

In this paper, we propose a feature embedding based video object segmentation (VOS) method which is simple, fast and effective. The current VOS task involves two main challenges: object instance differentiation and cross-frame instance alignment. Most state-of-the-art matching based VOS methods simplify this task into a binary segmentation task and tackle each instance independently. In contrast, we decompose the VOS task into two subtasks: global embedding learning that segments foreground objects of each frame in a pixel-to-pixel manner, and instance feature embedding learning that separates instances. The outputs of these two subtasks are fused to obtain the final instance masks quickly and accurately. Through using the relation among different instances per-frame as well as temporal relation across different frames, the proposed network learns to differentiate multiple instances and associate them properly in one feed-forward manner. Extensive experimental results on the challenging DAVIS[34] and YoutubeVOS [57] datasets show that our method achieves better performances than most counterparts in each case.

关键词： computer vision Annotations Object segmentation pattern recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

Collaborative Image and Object Level Features for Image Colourisation

Collaborative Image and Object Level Features for Image Colo...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Pucci, Rita Micheloni, Christian Martinel, Niki Univ Udine Udine Italy

ISBN: (纸本)9781665448994

Image colourisation is an ill-posed problem, with multiple correct solutions which depend on the context and object instances present in the input datum. Previous approaches attacked the problem either by requiring intense user-interactions or by exploiting the ability of convolutional neural networks (CNNs) in learning image-level (context) features. However, obtaining human hints is not always feasible and CNNs alone are not able to learn entity-level semantics, unless multiple models pre-trained with supervision are considered. In this work, we propose a single network, named UCapsNet, that takes into consideration the image-level features obtained through convolutions and entity-level features captured by means of capsules. Then, by skip connections over different layers, we enforce collaboration between such the convolutional and entity factors to produce a high-quality and plausible image colourisation. We pose the problem as a classification task that can be addressed by a fully unsupervised approach, thus requires no human effort. Experimental results on three benchmark datasets show that our approach outperforms existing methods on standard quality metrics and achieves state-of-the-art performances on image colourisation. A large scale user study shows that our method is preferred over existing solutions. Code available at https://***/Riretta/Image_Colourisation_WiCV_2021.

关键词： Convolutional codes computer vision Semantics Collaboration Training data Feature extraction pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Visual Event-Based Egocentric Human Action recognition 10th

Visual Event-Based Egocentric Human Action Recognition

引用

10th Iberian conference on pattern recognition and Image Analysis (IbPRIA)

作者： Moreno-Rodriguez, Francisco J. Javier Traver, V Barranco, Francisco Dimiccoli, Mariella Pla, Filiberto Univ Jaume 1 Castellon de La Plana Spain Univ Jaume 1 Inst New Imaging Technol Castellon de La Plana Spain Univ Granada CITIC Dept Comp Architecture & Technol Granada Spain Inst Robot & Informat Ind CSIC UPC Barcelona Spain

ISBN: (纸本)9783031048814;9783031048807

This paper lies at the intersection of three research areas: human action recognition, egocentric vision, and visual event-based sensors. The main goal is the comparison of egocentric action recognition performance under either of two visual sources: conventional images, or event-based visual data. In this work, the events, as triggered by asynchronous event sensors or their simulation, are spatio-temporally aggregated into event frames (a grid-like representation). This allows to use exactly the same neural model for both visual sources, thus easing a fair comparison. Specifically, a hybrid neural architecture combining a convolutional neural network and a recurrent network is used. It is empirically found that this general architecture works for both, conventional gray-level frames, and event frames. This finding is relevant because it reveals that no modification or adaptation is strictly required to deal with event data for egocentric action classification. Interestingly, action recognition is found to perform better with event frames, suggesting that these data provide discriminative information that aids the neural model to learn good features.

关键词： Egocentric view Action recognition Event vision

来源：评论

学校读者我要写书评

暂无评论

SwiftNet: Real-time Video Object Segmentation

SwiftNet: Real-time Video Object Segmentation

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Wang, Haochen Jiang, Xiaolong Ren, Haibing Hu, Yao Bai, Song Alibaba Youku Cognit & Intelligent Lab Beijing Peoples R China Univ Oxford Oxford England

ISBN: (纸本)9781665445092

In this work we present SwiftNet for real-time semisupervised video object segmentation (one-shot VOS), which reports 77.8% J&F and 70 FPS on DAVIS 2017 validation dataset, leading all present solutions in overall accuracy and speed performance. We achieve this by elaborately compressing spatiotemporal redundancy in matching-based VOS via Pixel-Adaptive Memory (PAM). Temporally, PAM adaptively triggers memory updates on frames where objects display noteworthy inter-frame variations. Spatially, PAM selectively performs memory update and match on dynamic pixels while ignoring the static ones, significantly reducing redundant computations wasted on segmentation-irrelevant pixels. To promote efficient reference encoding, light-aggregation encoder is also introduced in SwiftNet deploying reversed sub-pixel. We hope SwiftNet could set a strong and efficient baseline for real-time VOS and facilitate its application in mobile vision.

关键词： computer vision Codes Redundancy Memory management Object segmentation Streaming media Real-time systems

来源：评论

学校读者我要写书评

暂无评论

Tailored visions: Enhancing Text-to-Image Generation with Personalized Prompt Rewriting

Tailored Visions: Enhancing Text-to-Image Generation with Pe...

引用

conference on computer vision and pattern recognition (CVPR)

作者： Zijie Chen Lichao Zhang Fangsheng Weng Lili Pan Zhenzhong Lan Zhejiang University Westlake University Scietrain University of Electronic Science and Technology of China

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

Despite significant progress in the field, it is still challenging to create personalized visual representations that align closely with the desires and preferences of individ-ual users. This process requires users to articulate their ideas in words that are both comprehensible to the models and accurately capture their vision, posing difficul-ties for many users. In this paper, we tackle this challenge by leveraging historical user interactions with the system to enhance user prompts. We propose a novel approach that involves rewriting user prompts based on a newly collected large-scale text-to-image dataset with over 300k prompts from 3115 users. Our rewriting model enhances the expressiveness and alignment of user prompts with their intended visual outputs. Experimental results demonstrate the superiority of our methods over baseline approaches, as evidenced in our new offline evaluation method and online tests. Our code and dataset are available at https://***/zzjchen/Tailored-visions

关键词： Visualization computer vision Codes Text to image pattern recognition

来源：评论

学校读者我要写书评

暂无评论

CompositeTasking: Understanding Images by Spatial Composition of Tasks

CompositeTasking: Understanding Images by Spatial Compositio...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Popovic, Nikola Paudel, Danda Pani Probst, Thomas Sun, Guolei Van Gool, Luc Swiss Fed Inst Technol Comp Vis Lab Zurich Switzerland Katholieke Univ Leuven ESAT PSI VISICS Leuven Belgium

ISBN: (纸本)9781665445092

We define the concept of CompositeTasking as the fusion of multiple, spatially distributed tasks, for various aspects of image understanding. Learning to perform spatially distributed tasks is motivated by the frequent availability of only sparse labels across tasks, and the desire for a compact multi-tasking network. To facilitate CompositeTasking, we introduce a novel task conditioning model - a single encoder-decoder network that performs multiple, spatially varying tasks at once. The proposed network takes an image and a set of pixel-wise dense task requests as inputs, and performs the requested prediction task for each pixel. Moreover, we also learn the composition of tasks that needs to be performed according to some CompositeTasking rules, which includes the decision of where to apply which task. It not only offers us a compact network for multi-tasking, but also allows for task-editing. Another strength of the proposed method is demonstrated by only having to supply sparse supervision per task. The obtained results are on par with our baselines that use dense supervision and a multi-headed multi-tasking design. The source code will be made publicly available at ***/nikola3794/composite-tasking.

关键词： computer vision Codes Multitasking pattern recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

PU-GCN: Point Cloud Upsampling using Graph Convolutional Networks

PU-GCN: Point Cloud Upsampling using Graph Convolutional Net...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Qian, Guocheng Abualshour, Abdulellah Li, Guohao Thabet, Ali Ghanem, Bernard King Abdullah Univ Sci & Technol KAUST Abu Dhabi U Arab Emirates

ISBN: (纸本)9781665445092

The effectiveness of learning-based point cloud upsampling pipelines heavily relies on the upsampling modules and feature extractors used therein. For the point upsampling module, we propose a novel model called NodeShuffle, which uses a Graph Convolutional Network (GCN) to better encode local point information from point neighborhoods. NodeShuffle is versatile and can be incorporated into any point cloud upsampling pipeline. Extensive experiments show how NodeShuffle consistently improves state-of-theart upsampling methods. For feature extraction, we also propose a new multi-scale point feature extractor, called Inception DenseGCN. By aggregating features at multiple scales, this feature extractor enables further performance gain in the final upsampled point clouds. We combine Inception DenseGCN with NodeShuffle into a new point upsampling pipeline called PU-GCN. PU-GCN sets new state-of-art performance with much fewer parameters and more efficient inference. Our code is publicly available at https://***/guochengqian/PU-GCN.

关键词： Convolutional codes computer vision Pipelines Performance gain Feature extraction pattern recognition

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 429 430 431 432 433 434 435 436 437 438 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：