检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

19,438 篇 会议
46 篇 期刊文献
5 册 图书

馆藏范围

19,488 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

12,440 篇 工学
- 10,282 篇 计算机科学与技术...
- 2,395 篇 机械工程
- 2,007 篇 软件工程
- 813 篇 光学工程
- 531 篇 电气工程
- 419 篇 控制科学与工程
- 322 篇 信息与通信工程
- 210 篇 测绘科学与技术
- 80 篇 生物医学工程（可授...
- 73 篇 电子科学与技术（可...
- 70 篇 生物工程
- 60 篇 仪器科学与技术
- 38 篇 建筑学
- 36 篇 土木工程
- 33 篇 力学（可授工学、理...
- 31 篇 航空宇航科学与技...
- 26 篇 安全科学与工程
- 20 篇 材料科学与工程（可...
- 20 篇 交通运输工程
3,409 篇 医学
- 3,408 篇 临床医学
1,980 篇 理学
- 1,006 篇 数学
- 973 篇 物理学
- 359 篇 统计学（可授理学、...
- 336 篇 生物学
- 231 篇 系统科学
- 24 篇 化学
258 篇 管理学
- 138 篇 管理科学与工程(可...
- 122 篇 图书情报与档案管...
- 27 篇 工商管理
19 篇 法学
- 19 篇 社会学
14 篇 农学
8 篇 教育学
7 篇 经济学
3 篇 军事学
3 篇 艺术学

主题

7,893 篇 computer vision
2,727 篇 training
2,680 篇 pattern recognit...
1,760 篇 computational mo...
1,644 篇 visualization
1,410 篇 cameras
1,372 篇 three-dimensiona...
1,327 篇 shape
1,213 篇 face recognition
1,207 篇 image segmentati...
1,164 篇 feature extracti...
1,109 篇 robustness
1,087 篇 semantics
983 篇 layout
959 篇 object detection
949 篇 computer archite...
942 篇 benchmark testin...
931 篇 codes
902 篇 computer science
859 篇 deep learning

机构

174 篇 univ sci & techn...
161 篇 carnegie mellon ...
148 篇 univ chinese aca...
144 篇 chinese univ hon...
110 篇 microsoft resear...
106 篇 tsinghua univ pe...
103 篇 zhejiang univ pe...
99 篇 swiss fed inst t...
92 篇 tsinghua univers...
89 篇 microsoft res as...
88 篇 shanghai ai lab ...
81 篇 zhejiang univers...
76 篇 alibaba grp peop...
73 篇 university of sc...
73 篇 hong kong univ s...
72 篇 peking univ peop...
72 篇 university of ch...
68 篇 shanghai jiao to...
66 篇 univ oxford oxfo...
66 篇 shanghai jiao to...

作者

79 篇 van gool luc
70 篇 zhang lei
59 篇 timofte radu
48 篇 yang yi
47 篇 xiaoou tang
45 篇 luc van gool
43 篇 darrell trevor
43 篇 tian qi
42 篇 loy chen change
42 篇 sun jian
42 篇 li fei-fei
40 篇 qi tian
38 篇 li stan z.
36 篇 chen xilin
36 篇 torralba antonio
35 篇 vasconcelos nuno
35 篇 shan shiguang
35 篇 liu yang
34 篇 liu xiaoming
34 篇 tao dacheng

语言

19,483 篇 英文
2 篇 日文
2 篇 其他
2 篇 中文

检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2000"

共 19489 条记录，以下是4791-4800 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Delving into Data: Effectively Substitute Training for Black-box Attack

Delving into Data: Effectively Substitute Training for Black...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Wang, Wenxuan Yin, Bangjie Yao, Taiping Zhang, Li Fu, Yanwei Ding, Shouhong Li, Jilin Huang, Feiyue Xue, Xiangyang Fudan Univ Sch Comp Sci Shanghai Peoples R China Fudan Univ Shanghai Key Lab Intelligent Informat Proc Shanghai Peoples R China Fudan Univ MOE Frontiers Ctr Brain Sci Sch Data Sci Shanghai Peoples R China Tencent Youtu Lab Shenzhen Guangdong Peoples R China

ISBN: (纸本)9781665445092

Deep models have shown their vulnerability when processing adversarial samples. As for the black-box attack, without access to the architecture and weights of the attacked model, training a substitute model for adversarial attacks has attracted wide attention. Previous substitute training approaches focus on stealing the knowledge of the target model based on real training data or synthetic data, without exploring what kind of data can further improve the transferability between the substitute and target models. In this paper, we propose a novel perspective substitute training that focuses on designing the distribution of data used in the knowledge stealing process. More specifically, a diverse data generation module is proposed to synthesize large-scale data with wide distribution. And adversarial substitute training strategy is introduced to focus on the data distributed near the decision boundary. The combination of these two modules can further boost the consistency of the substitute model and target model, which greatly improves the effectiveness of adversarial attack. Extensive experiments demonstrate the efficacy of our method against state-of-the-art competitors under non-target and target attack settings. Detailed visualization and analysis are also provided to help understand the advantage of our method.

关键词： Training computer vision Computational modeling Training data Distributed databases Data visualization Data models

来源：评论

学校读者我要写书评

暂无评论

Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks

Neural Parts: Learning Expressive 3D Shape Abstractions with...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Paschalidou, Despoina Katharopoulos, Angelos Geiger, Andreas Fidler, Sanja Max Planck Inst Intelligent Syst Tubingen Tubingen Germany Univ Tubingen Tubingen Germany Idiap Res Inst Martigny Switzerland Ecole Polytech Fed Lausanne EPFL Lausanne Switzerland Max Planck ETH Ctr Learning Syst Zurich Switzerland NVIDIA Santa Clara CA 95051 USA Univ Toronto Toronto ON Canada Vector Inst Toronto ON Canada

ISBN: (纸本)9781665445092

Impressive progress in 3D shape extraction led to representations that can capture object geometries with high fidelity. In parallel, primitive-based methods seek to represent objects as semantically consistent part arrangements. However, due to the simplicity of existing primitive representations, these methods fail to accurately reconstruct 3D shapes using a small number of primitives/parts. We address the trade-off between reconstruction quality and number of parts with Neural Parts, a novel 3D primitive representation that defines primitives using an Invertible Neural Network (INN) which implements homeomorphic mappings between a sphere and the target object. The INN allows us to compute the inverse mapping of the homeomorphism, which in turn, enables the efficient computation of both the implicit surface function of a primitive and its mesh, without any additional post-processing. Our model learns to parse 3D objects into semantically consistent part arrangements without any part-level supervision. Evaluations on ShapeNet, D-FAUST and FreiHAND demonstrate that our primitives can capture complex geometries and thus simultaneously achieve geometrically accurate as well as interpretable reconstructions using an order of magnitude fewer primitives than state-of-the-art shape abstraction methods.

关键词： Geometry Solid modeling computer vision Three-dimensional displays Shape Computational modeling Neural networks

来源：评论

学校读者我要写书评

暂无评论

Delving into Localization Errors for Monocular 3D Object Detection

Delving into Localization Errors for Monocular 3D Object Det...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Ma, Xinzhu Zhang, Yinmin Xu, Dan Zhou, Dongzhan Yi, Shuai Li, Haojie Ouyang, Wanli Univ Sydney Sydney NSW Australia Hong Kong Univ Sci & Technol Hong Kong Peoples R China SenseTime Res Hong Kong Peoples R China Dalian Univ Technol Dalian Peoples R China

ISBN: (纸本)9781665445092

Estimating 3D bounding boxes from monocular images is an essential component in autonomous driving, while accurate 3D object detection from this kind of data is very challenging. In this work, by intensive diagnosis experiments, we quantify the impact introduced by each sub-task and found the 'localization error' is the vital factor in restricting monocular 3D detection. Besides, we also investigate the underlying reasons behind localization errors, analyze the issues they might bring, and propose three strategies. First, we revisit the misalignment between the center of the 2D bounding box and the projected center of the 3D object, which is a vital factor leading to low localization accuracy. Second, we observe that accurately localizing distant objects with existing technologies is almost impossible, while those samples will mislead the learned network. To this end, we propose to remove such samples from the training set for improving the overall performance of the detector. Lastly, we also propose a novel 3D IoU oriented loss for the size estimation of the object, which is not affected by 'localization error'. We conduct extensive experiments on the KITTI dataset, where the proposed method achieves real-time detection and outperforms previous methods by a large margin. The code will be made available at: https://***/xinzhuma/monodle.

关键词： Location awareness Training computer vision Three-dimensional displays Estimation Object detection Detectors

来源：评论

学校读者我要写书评

暂无评论

Color Me Good: Branding in the Coloring Style of Movie Posters

Color Me Good: Branding in the Coloring Style of Movie Poste...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Agrawal, Rishabh Sivaprasad, Sarath Pedanekar, Niranjan TCS Res Tata Consultancy Serv Pune 411013 Maharashtra India

ISBN: (纸本)9781665448994

Brand logos are often rendered in a different style based on a context such as an event promotion. For example, Warner Bros. uses a different variety of their brand logo for different movies for promotion and aesthetic appeal. In this paper, we propose an automated method to render brand logos in the coloring style of branding material such as movie posters. For this, we adopt a photo-realistic neural style transfer method using movie posters as the style source. We propose a color-based image segmentation and matching method to assign style segments to logo segments. Using these, we render the well-known Warner Bros. logo in the coloring style of 141 movie posters. We also present survey results where 287 participants rate the machine-stylized logos for their representativeness and visual appeal.

关键词： Image segmentation Visualization computer vision Image color analysis conferences Semantics Brightness

来源：评论

学校读者我要写书评

暂无评论

3DInAction: Understanding Human Actions in 3D Point Clouds

3DInAction: Understanding Human Actions in 3D Point Clouds

引用

conference on computer vision and pattern recognition (cvpr)

作者： Yizhak Ben-Shabat Oren Shrout Stephen Gould Australian National University Technion Israel Institute of Technology

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

We propose a novel method for 3D point cloud action recognition. Understanding human actions in RGB videos has been widely studied in recent years, however, its 3D point cloud counterpart remains under-explored despite the clear value that 3D information may bring. This is mostly due to the inherent limitation of the point cloud data modality-lack of structure, permutation invariance, and varying number of points-which makes it difficult to learn a spatiotemporal representation. To address this limitation, we propose the 3DinAction pipeline that first estimates patches moving in time (t-patches) as a key building block, alongside a hierarchical architecture that learns an informative spatiotemporal representation. We show that our method achieves improved performance on existing datasets, including DFAUST and IKEA ASM. Code is publicly available at https://***/sitzikbs/3dincaction.

关键词： Point cloud compression computer vision Three-dimensional displays Codes Architecture Pipelines computer architecture

来源：评论

学校读者我要写书评

暂无评论

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Rethinking Semantic Segmentation from a Sequence-to-Sequence...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Zheng, Sixiao Lu, Jiachen Zhao, Hengshuang Zhu, Xiatian Luo, Zekun Wang, Yabiao Fu, Yanwei Feng, Jianfeng Xiang, Tao Torr, Philip H. S. Zhang, Li Fudan Univ Shanghai Peoples R China Univ Oxford Oxford England Univ Surrey Guildford Surrey England Tencent Youtu Lab Shanghai Peoples R China Facebook AI Menlo Pk CA USA

ISBN: (纸本)9781665445092

Most recent semantic segmentation methods adopt a fully-convolutional network (FCN) with an encoder-decoder architecture. The encoder progressively reduces the spatial resolution and learns more abstract/semantic visual concepts with larger receptive fields. Since context modeling is critical for segmentation, the latest efforts have been focused on increasing the receptive field, through either dilated/atrous convolutions or inserting attention modules. However, the encoder-decoder based FCN architecture remains unchanged. In this paper, we aim to provide an alternative perspective by treating semantic segmentation as a sequence-to-sequence prediction task. Specifically, we deploy a pure transformer (i.e., without convolution and resolution reduction) to encode an image as a sequence of patches. With the global context modeled in every layer of the transformer, this encoder can be combined with a simple decoder to provide a powerful segmentation model, termed SEgmentation TRansformer (SETR). Extensive experiments show that SETR achieves new state of the art on ADE20K (50.28% mIoU), Pascal Context (55.83% mIoU) and competitive results on Cityscapes. Particularly, we achieve the first position in the highly competitive ADE2OK test server leaderboard on the day of submission.

关键词： Image segmentation Visualization Semantics Transformers Decoding pattern recognition Servers

来源：评论

学校读者我要写书评

暂无评论

DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-scale Consistency

DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-s...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Yang, Zongxin Yu, Xin Yang, Yi Baidu Res Beijing Peoples R China Univ Technol Sydney ReLER Sydney NSW Australia

ISBN: (纸本)9781665445092

Compared to 2D object bounding-box labeling, it is very difficult for humans to annotate 3D object poses, especially when depth images of scenes are unavailable. This paper investigates whether we can estimate the object poses effectively when only RGB images and 2D object annotations are given. To this end, we present a two-step pose estimation framework to attain 6DoF object poses from 2D object bounding-boxes. In the first step, the framework learns to segment objects from real and synthetic data in a weakly-supervised fashion, and the segmentation masks will act as a prior for pose estimation. In the second step, we design a dual-scale pose estimation network, namely DSC-PoseNet, to predict object poses by employing a differential renderer. To be specific, our DSC-PoseNet firstly predicts object poses in the original image scale by comparing the segmentation masks and the rendered visible object masks. Then, we resize object regions to a fixed scale to estimate poses once again. In this fashion, we eliminate large scale variations and focus on rotation estimation, thus facilitating pose estimation. Moreover, we exploit the initial pose estimation to generate pseudo ground-truth to train our DSC-PoseNet in a self-supervised manner. The estimation results in these two scales are ensembled as our final pose estimation. Extensive experiments on widely-used benchmarks demonstrate that our method outperforms state-of-the-art models trained on synthetic data by a large margin and even is on par with several fully-supervised methods.

关键词： Training Image segmentation computer vision Three-dimensional displays Annotations Pose estimation Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Pang, Jiangmiao Qiu, Linlu Li, Xia Chen, Haofeng Li, Qi Darrell, Trevor Yu, Fisher Zhejiang Univ Hangzhou Zhejiang Peoples R China Georgia Inst Technol Atlanta GA 30332 USA Swiss Fed Inst Technol Zurich Switzerland Stanford Univ Stanford CA 94305 USA Univ Calif Berkeley Berkeley CA USA

ISBN: (纸本)9781665445092

Similarity learning has been recognized as a crucial step for object tracking. However, existing multiple object tracking methods only use sparse ground truth matching as the training objective, while ignoring the majority of the informative regions on the images. In this paper, we present Quasi-Dense Similarity Learning, which densely samples hundreds of region proposals on a pair of images for contrastive learning. We can directly combine this similarity learning with existing detection methods to build Quasi-Dense Tracking (QDTrack) without turning to displacement regression or motion priors. We also find that the resulting distinctive feature space admits a simple nearest neighbor search at the inference time. Despite its simplicity, QDTrack outperforms all existing methods on MOT, BDD100K, Waymo, and TAO tracking benchmarks. It achieves 68.7 MOTA at 20.3 FPS on MOT17 without using external training data. Compared to methods with similar detectors, it boosts almost 10 points of MOTA and significantly decreases the number of ID switches on BDD100K and Waymo datasets.

关键词： Training Image segmentation computer vision Codes Training data Detectors Nearest neighbor methods

来源：评论

学校读者我要写书评

暂无评论

Combinatorial Learning of Graph Edit Distance via Dynamic Embedding

Combinatorial Learning of Graph Edit Distance via Dynamic Em...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Wang, Runzhong Zhang, Tianqi Yu, Tianshu Yan, Junchi Yang, Xiaokang Shanghai Jiao Tong Univ Dept Comp Sci & Engn Shanghai Peoples R China Shanghai Jiao Tong Univ MoE Key Lab Artificial Intelligence Shanghai Peoples R China Arizona State Univ Tempe AZ 85287 USA

ISBN: (纸本)9781665445092

Graph Edit Distance (GED) is a popular similarity measurement for pairwise graphs and it also refers to the recovery of the edit path from the source graph to the target graph. Traditional A* algorithm suffers scalability issues due to its exhaustive nature, whose search heuristics heavily rely on human prior knowledge. This paper presents a hybrid approach by combing the interpretability of traditional search-based techniques for producing the edit path, as well as the efficiency and adaptivity of deep embedding models to achieve a cost-effective GED solver. Inspired by dynamic programming, node-level embedding is designated in a dynamic reuse fashion and suboptimal branches are encouraged to be pruned. To this end, our method can be readily integrated into A* procedure in a dynamic fashion, as well as significantly reduce the computational burden with a learned heuristic. Experimental results on different graph datasets show that our approach can remarkably ease the search process of A* without sacrificing much accuracy. To our best knowledge, this work is also the first deep learning-based GED method for recovering the edit path.

关键词： computer vision Adaptation models Costs Scalability Heuristic algorithms Computational modeling Search problems

来源：评论

学校读者我要写书评

暂无评论

Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges

Towards Semantic Segmentation of Urban-Scale 3D Point Clouds...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Hu, Qingyong Yang, Bo Khalid, Sheikh Xiao, Wen Trigoni, Niki Markham, Andrew Univ Oxford Oxford England Hong Kong Polytech Univ Hong Kong Peoples R China Sensat Ltd London England Newcastle Univ Newcastle Upon Tyne Tyne & Wear England

ISBN: (纸本)9781665445092

An essential prerequisite for unleashing the potential of supervised deep learning algorithms in the area of 3D scene understanding is the availability of large-scale and richly annotated datasets. However, publicly available datasets are either in relative small spatial scales or have limited semantic annotations due to the expensive cost of data acquisition and data annotation, which severely limits the development of fine-grained semantic understanding in the context of 3D point clouds. In this paper, we present an urban-scale photogrammetric point cloud dataset with nearly three billion richly annotated points, which is three times the number of labeled points than the existing largest photogrammetric point cloud dataset. Our dataset consists of large areas from three UK cities, covering about 7.6 km(2) of the city landscape. In the dataset, each 3D point is labeled as one of 13 semantic classes. We extensively evaluate the performance of state-of-the-art algorithms on our dataset and provide a comprehensive analysis of the results. In particular;we identify several key challenges towards urban-scale point cloud understanding. The dataset is available at https://***/QingyongHu/SensatUrban.

关键词： Deep learning computer vision Three-dimensional displays Costs Annotations Semantics Urban areas

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 476 477 478 479 480 481 482 483 484 485 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：