检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

23,136 篇 会议
90 篇 期刊文献
15 册 图书

馆藏范围

23,240 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

13,632 篇 工学
- 11,162 篇 计算机科学与技术...
- 3,338 篇 软件工程
- 2,413 篇 机械工程
- 1,664 篇 光学工程
- 1,204 篇 电气工程
- 973 篇 控制科学与工程
- 738 篇 信息与通信工程
- 381 篇 仪器科学与技术
- 322 篇 生物工程
- 239 篇 生物医学工程（可授...
- 188 篇 电子科学与技术（可...
- 109 篇 化学工程与技术
- 104 篇 安全科学与工程
- 99 篇 测绘科学与技术
- 85 篇 建筑学
- 83 篇 交通运输工程
- 82 篇 土木工程
- 56 篇 力学（可授工学、理...
3,695 篇 医学
- 3,683 篇 临床医学
- 76 篇 基础医学(可授医学...
3,138 篇 理学
- 1,880 篇 物理学
- 1,605 篇 数学
- 547 篇 统计学（可授理学、...
- 466 篇 生物学
- 243 篇 系统科学
- 107 篇 化学
491 篇 管理学
- 290 篇 图书情报与档案管...
- 212 篇 管理科学与工程(可...
- 74 篇 工商管理
252 篇 艺术学
- 251 篇 设计学（可授艺术学...
58 篇 法学
38 篇 农学
25 篇 教育学
19 篇 经济学
10 篇 军事学
3 篇 文学

主题

10,396 篇 computer vision
3,893 篇 pattern recognit...
3,101 篇 training
2,104 篇 computational mo...
1,898 篇 visualization
1,800 篇 cameras
1,487 篇 feature extracti...
1,475 篇 three-dimensiona...
1,464 篇 shape
1,447 篇 image segmentati...
1,287 篇 robustness
1,234 篇 computer archite...
1,213 篇 semantics
1,112 篇 benchmark testin...
1,111 篇 conferences
1,104 篇 layout
1,093 篇 object detection
1,085 篇 computer science
1,026 篇 codes
907 篇 face recognition

机构

137 篇 univ sci & techn...
124 篇 univ chinese aca...
121 篇 chinese univ hon...
108 篇 tsinghua univers...
108 篇 carnegie mellon ...
105 篇 microsoft resear...
97 篇 zhejiang univ pe...
91 篇 swiss fed inst t...
85 篇 university of sc...
84 篇 zhejiang univers...
81 篇 shanghai ai lab ...
79 篇 university of ch...
75 篇 shanghai jiao to...
69 篇 microsoft res as...
68 篇 alibaba grp peop...
66 篇 adobe research
65 篇 national laborat...
64 篇 peking univ peop...
61 篇 univ oxford oxfo...
59 篇 peng cheng labor...

作者

80 篇 van gool luc
71 篇 timofte radu
65 篇 zhang lei
43 篇 luc van gool
40 篇 yang yi
37 篇 loy chen change
34 篇 li stan z.
33 篇 liu yang
33 篇 xiaoou tang
33 篇 murino vittorio
33 篇 chen chen
33 篇 qi tian
33 篇 li fei-fei
32 篇 tian qi
32 篇 sun jian
30 篇 ying shan
30 篇 pascal fua
29 篇 darrell trevor
28 篇 li xin
28 篇 hanqing lu

语言

23,043 篇 英文
171 篇 其他
20 篇 中文
5 篇 土耳其文
2 篇 日文

检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition"

共 23241 条记录，以下是4861-4870 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Stacked Deep Multi-Scale Hierarchical Network for Fast Bokeh Effect Rendering from a Single Image

Stacked Deep Multi-Scale Hierarchical Network for Fast Bokeh...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Dutta, Saikat Das, Sourya Dipta Shah, Nisarg A. Tiwari, Anil Kumar IIT Madras Chennai Tamil Nadu India Jadavpur Univ Kolkata India IIT Jodhpur Jodhpur Rajasthan India

ISBN: (纸本)9781665448994

The Bokeh Effect is one of the most desirable effects in photography for rendering artistic and aesthetic photos. Usually, it requires a DSLR camera with different aperture and shutter settings and certain photography skills to generate this effect. In smartphones, computational methods and additional sensors are used to overcome the physical lens and sensor limitations to achieve such effect. Most of the existing methods utilized additional sensor's data or pre-trained network for fine depth estimation of the scene and sometimes use portrait segmentation pretrained network module to segment salient objects in the image. Because of these reasons, networks have many parameters, become runtime intensive and unable to run in mid-range devices. In this paper, we used an end-to-end Deep Multi-Scale Hierarchical Network (DMSHN) model for direct Bokeh effect rendering of images captured from the monocular camera. To further improve the perceptual quality of such effect, a stacked model consisting of two DMSHN modules is also proposed. Our model does not rely on any pretrained network module for Monocular Depth Estimation or Saliency Detection, thus significantly reducing the size of model and run time. Stacked DMSHN achieves state-of-the-art results on a large scale EBB! dataset with around 6x less runtime compared to the current state-of-the-art model in processing HD quality images.

关键词： Photography Image segmentation Runtime Estimation Rendering (computer graphics) Cameras Real-time systems

来源：评论

学校读者我要写书评

暂无评论

M3DSSD: Monocular 3D Single Stage Object Detector

M3DSSD: Monocular 3D Single Stage Object Detector

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Luo, Shujie Dai, Hang Shao, Ling Ding, Yong Zhejiang Univ Coll Informat Sci & Elect Engn Hangzhou Peoples R China Zhejiang Univ Sch Micronano Elect Hangzhou Peoples R China Mohamed Bin Zayed Univ Artificial Intelligence Abu Dhabi U Arab Emirates Incept Inst Artificial Intelligence Abu Dhabi U Arab Emirates

ISBN: (纸本)9781665445092

In this paper, we propose a Monocular 3D Single Stage object Detector (M3DSSD) with feature alignment and asymmetric non-local attention. Current anchor-based monocular 3D object detection methods suffer from feature mismatching. To overcome this, we propose a two-step feature alignment approach. In the first step, the shape alignment is performed to enable the receptive field of the feature map to focus on the pre-defined anchors with high confidence scores. In the second step, the center alignment is used to align the features at 2D/3D centers. Further, it is often difficult to learn global information and capture long-range relationships, which are important for the depth prediction of objects. Therefore, we propose a novel asymmetric non-local attention block with multi-scale sampling to extract depth-wise features. The proposed M3DSSD achieves significantly better performance than the monocular 3D object detection methods on the KITH dataset, in both 3D object detection and bird's eye view tasks.

关键词： Solid modeling computer vision Three-dimensional displays Shape Computational modeling Object detection Detectors

来源：评论

学校读者我要写书评

暂无评论

Towards Extremely Compact RNNs for Video recognition with Fully Decomposed Hierarchical Tucker Structure

Towards Extremely Compact RNNs for Video Recognition with Fu...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Yin, Miao Liao, Siyu Liu, Xiao-Yang Wang, Xiaodong Yuan, Bo Rutgers State Univ Newark NJ 07101 USA Amazon Seattle WA USA Columbia Univ New York NY 10027 USA

ISBN: (纸本)9781665445092

Recurrent Neural Networks (RNNs) have been widely used in sequence analysis and modeling. However, when processing high-dimensional data, RNNs typically require very large model sizes, thereby bringing a series of deployment challenges. Although various prior works have been proposed to reduce the RNN model sizes, executing RNN models in the resource-restricted environments is still a very challenging problem. In this paper, we propose to develop extremely compact RNN models with fully decomposed hierarchical Tucker (FDHT) structure. The HT decomposition does not only provide much higher storage cost reduction than the other tensor decomposition approaches, but also brings better accuracy performance improvement for the compact RNN models. Meanwhile, unlike the existing tensor decomposition-based methods that can only decompose the input-to-hidden layer of RNNs, our proposed fully decomposition approach enables the comprehensive compression for the entire RNN models with maintaining very high accuracy. Our experimental results on several popular video recognition datasets show that, our proposed fully decomposed hierarchical tucker-based LSTM (FDHT-LSTM) is extremely compact and highly efficient. To the best of our knowledge, FDHT-LSTM, for the first time, consistently achieves very high accuracy with only few thousand parameters (3,132 to 8,808) on different datasets. Compared with the state-of-the-art compressed RNN models, such as TT-LSTM, TR-LSTM and BT-LSTM, our FDHT-LSTM simultaneously enjoys both order-of-magnitude (3,985x to 10,711x) fewer parameters and significant accuracy improvement (0.6% to 12.7%).

关键词： computer vision Analytical models Tensors Recurrent neural networks Sequences Costs Computational modeling

来源：评论

学校读者我要写书评

暂无评论

A Closer Look at Self-training for Zero-Label Semantic Segmentation

A Closer Look at Self-training for Zero-Label Semantic Segme...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Pastore, Giuseppe Cermelli, Fabio Xian, Yongqin Mancini, Massimiliano Akata, Zeynep Caputo, Barbara Politecn Torino Turin Italy Italian Inst Technol Genoa Italy MPI Informat Saarbrucken Germany Univ Tubingen Tubingen Germany MPI Intelligent Syst Saarbrucken Germany

ISBN: (纸本)9781665448994

Being able to segment unseen classes not observed during training is an important technical challenge in deep learning, because of its potential to reduce the expensive annotation required for semantic segmentation. Prior zero-label semantic segmentation works approach this task by learning visual-semantic embeddings or generative models. However, they are prone to overfitting on the seen classes because there is no training signal for them. In this paper, we study the challenging generalized zero-label semantic segmentation task where the model has to segment both seen and unseen classes at test time. We assume that pixels of unseen classes could be present in the training images but without being annotated. Our idea is to capture the latent information on unseen classes by supervising the model with self-produced pseudo-labels for unlabeled pixels. We propose a consistency regularizer to filter out noisy pseudolabels by taking the intersections of the pseudo-labels generated from different augmentations of the same image. Our framework generates pseudo-labels and then retrain the model with human-annotated and pseudo-labelled data. This procedure is repeated for several iterations. As a result, our approach achieves the new state-of-the-art on PascalVOC12 and COCO-stuff datasets in the challenging generalized zero-label semantic segmentation setting, surpassing other existing methods addressing this task with more complex strategies. Code can be found at https: //***/giuseppepastore10/STRICT.

关键词： Training Image segmentation Semantics Pipelines Predictive models Information filters pattern recognition

来源：评论

学校读者我要写书评

暂无评论

LFNAT 2023 Challenge on Light Field Depth Estimation: Methods and Results

LFNAT 2023 Challenge on Light Field Depth Estimation: Method...

引用

2023 ieee/cvf conference on computer vision and pattern recognition Workshops, CVPRW 2023

作者： Sheng, Hao Liu, Yebin Yu, Jingyi Wu, Gaochang Xiong, Wei Guo, Longzhao Xie, Yanlin Zhang, Shuo Chang, Song Lin, Youfang Chao, Wentao Wang, Xuechun Wang, Guanghui Duan, Fuqing Wang, Tun Yang, Da Cui, Zhenglong Wang, Sizhe Zhao, Mingyuan Wang, Qiong Chen, Qianyu Liang, Zhengyu Wang, Yingqian Yang, Jungang Yang, Xueting Deng, Junli Cong, Ruixuan Chen, Rongshan State Key Laboratory of Virtual Reality Technology and Systems School of Computer Science and Engineering Beihang University China Beihang Hangzhou Innovation Institute Yuhang China Faculty of Applied Sciences Macao Polytechnic University China Tsinghua University China Shanghaitech University China State Key Laboratory of Synthetical Automation for Process Industries Northeastern University China Beijing Meet Yuan Co. Ltd China Beijing Key Laboratory of Traffic Data Analysis and Mining School of Computer and Information Technology Beijing Jiaotong University China Beijing Normal University China Toronto Metropolitan University Canada College of Computer Science and Technology Zhejiang University of Technology China National University of Defense Technology China School of Information and Communication Engineering Communication University of China China

ISBN: (纸本)9798350302493

This paper reviews the 1st LFNAT challenge on light field depth estimation, which aims at predicting disparity information of central view image in a light field (i.e., pixel offset between central view image and adjacent view image). Compared to multi-view stereo matching, light field depth estimation emphasizes efficient utilization of the 2D angular information from multiple regularly varying views. This challenge specifies UrbanLF [20] light field dataset as the sole data source. There are two phases in total: submission phase and final evaluation phase, in which 75 registered participants successfully submit their predicted results in the first phase and 7 eligible teams compete in the second phase. The performance of all submissions is carefully reviewed and shown in this paper as a new standard for the current state-of-the-art in light field depth estimation. Moreover, the implementation details of these methods are also provided to stimulate related advanced research. © 2023 ieee.

关键词： Stereo image processing

来源：评论

学校读者我要写书评

暂无评论

LAFEAT: Piercing Through Adversarial Defenses with Latent Features

LAFEAT: Piercing Through Adversarial Defenses with Latent Fe...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Yu, Yunrui Gao, Xitong Xu, Cheng-Zhong Univ Macau Macau Sar Peoples R China Chinese Acad Sci Shenzhen Inst Adv Technol Shenzhen Peoples R China

ISBN: (纸本)9781665445092

Deep convolutional neural networks are susceptible to adversarial attacks. They can be easily deceived to give an incorrect output by adding a tiny perturbation to the input. This presents a great challenge in making CNNs robust against such attacks. An influx of new defense techniques have been proposed to this end. In this paper, we show that latent features in certain "robust" models are surprisingly susceptible to adversarial attacks. On top of this, we introduce a unified l(infinity)-norm white-box attack algorithm which harnesses latent features in its gradient descent steps, namely LAFEAT. We show that not only is it computationally much more efficient for successful attacks, but it is also a stronger adversary than the current state-of-the-art across a wide range of defense mechanisms. This suggests that model robustness could be contingent on the effective use of the defender's hidden components, and it should no longer be viewed from a holistic perspective.

关键词： Degradation Schedules computer vision Computational modeling Perturbation methods Lattices Robustness

来源：评论

学校读者我要写书评

暂无评论

Difficulty Estimation with Action Scores for computer vision Tasks

Difficulty Estimation with Action Scores for Computer Vision...

引用

ieee computer Society conference on computer vision and pattern recognition Workshops (CVPRW)

作者： Octavio Arriaga Sebastian Palacio Matias Valdenegro-Toro University of Bremen German Research Center for Artificial Intelligence University of Groningen

As more machine learning models are now being applied in real world scenarios it has become crucial to evaluate their difficulties and biases. In this paper we present an unsupervised method for calculating a difficulty score based on the accumulated loss per epoch. Our proposed method does not require any modification to the model, neither any external supervision, and it can be easily applied to a wide range of machine learning tasks. We provide results for the tasks of image classification, image segmentation, and object detection. We compare our score against similar metrics and provide theoretical and empirical evidence of their difference. Furthermore, we show applications of our proposed score for detecting incorrect labels, and test for possible biases.

关键词：

来源：评论

学校读者我要写书评

暂无评论

ECKPN: Explicit Class Knowledge Propagation Network for Transductive Few-shot Learning

ECKPN: Explicit Class Knowledge Propagation Network for Tran...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Chen, Chaofan Yang, Xiaoshan Xu, Changsheng Huang, Xuhui Ma, Zhe Univ Sci & Technol China USTC Sch Informat Sci & Technol Hefei Anhui Peoples R China Chinese Acad Sci CASIA Inst Automat Natl Lab Pattern Recognit NLPR Beijing Peoples R China Univ Chinese Acad Sci UCAS Sch Artificial Intelligence Beijing Peoples R China Second Acad CASIC X Lab Beijing Peoples R China

ISBN: (数字)9781665445092

ISBN: (纸本)9781665445092

Recently, the transductive graph-based methods have achieved great success in the few-shot classification task. However, most existing methods ignore exploring the class-level knowledge that can be easily learned by humans from just a handful of samples. In this paper, we propose an Explicit Class Knowledge Propagation Network (ECKPN), which is composed of the comparison, squeeze and calibration modules, to address this problem. Specifically, we first employ the comparison module to explore the pairwise sample relations to learn rich sample representations in the instance-level graph. Then, we squeeze the instance-level graph to generate the class-level graph, which can help obtain the class-level visual knowledge and facilitate modeling the relations of different classes. Next, the calibration module is adopted to characterize the relations of the classes explicitly to obtain the more discriminative class-level knowledge representations. Finally, we combine the class-level knowledge with the instance-level sample representations to guide the inference of the query samples. We conduct extensive experiments on four few-shot classification benchmarks, and the experimental results show that the proposed ECKPN significantly outperforms the state-of-the art methods.

关键词： Visualization computer vision Art Computational modeling Knowledge representation Benchmark testing Calibration

来源：评论

学校读者我要写书评

暂无评论

Combining Semantic Guidance and Deep Reinforcement Learning For Generating Human Level Paintings

Combining Semantic Guidance and Deep Reinforcement Learning ...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Singh, Jaskirat Zheng, Liang Australian Natl Univ Canberra ACT Australia

ISBN: (纸本)9781665445092

Generation of stroke-based non-photorealistic imagery, is an important problem in the computer vision community. As an endeavor in this direction, substantial recent research efforts have been focused on teaching machines "how to paint", in a manner similar to a human painter. However, the applicability of previous methods has been limited to datasets with little variation in position, scale and saliency of the foreground object. As a consequence, we find that these methods struggle to cover the granularity and diversity possessed by real world images. To this end, we propose a Semantic Guidance pipeline with 1) a bi-level painting procedure for learning the distinction between foreground and background brush strokes at training time. 2) We also introduce invariance to the position and scale of the foreground object through a neural alignment model, which combines object localization and spatial transformer networks in an end to end manner, to zoom into a particular semantic instance. 3) The distinguishing features of the in-focus object are then amplified by maximizing a novel guided backpropagation based focus reward. The proposed agent does not require any supervision on human stroke-data and successfully handles variations in foreground object attributes, thus, producing much higher quality canvases for the CUB-200 Birds [29] and Stanford Cars-196 [17] datasets. Finally, we demonstrate the further efficacy of our method on complex datasets with multiple foreground object instances by evaluating an extension of our method on the challenging Virtual-KITTI [2] dataset. Source code and models are available at https://***/1jsingh/semantic-guidance.

关键词： Backpropagation Training computer vision Semantics Pipelines Reinforcement learning Stroke (medical condition)

来源：评论

学校读者我要写书评

暂无评论

Generalized Foggy-Scene Semantic Segmentation by Frequency Decoupling

Generalized Foggy-Scene Semantic Segmentation by Frequency D...

引用

ieee computer Society conference on computer vision and pattern recognition Workshops (CVPRW)

作者： Qi Bi Shaodi You Theo Gevers Computer Vision Research Group University of Amsterdam Amsterdam The Netherlands

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Foggy-scene semantic segmentation (FSSS) is highly challenging due to the diverse effects of fog on scene properties and the limited training data. Existing research has mainly focused on domain adaptation for FSSS, which has practical limitations when dealing with new scenes. In our paper, we introduce domain-generalized FSSS, which can work effectively on unknown distributions without extensive training. To address domain gaps, we propose a frequency decoupling (FreD) approach that separates fog-related effects (amplitude) from scene semantics (phase) in feature representations. Our method is compatible with both CNN and vision Transformer backbones and outperforms existing approaches in various scenarios.

关键词： Training computer vision Frequency-domain analysis Semantic segmentation conferences Semantics Training data

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 483 484 485 486 487 488 489 490 491 492 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：