检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

29,393 篇 会议
1,358 册 图书
225 篇 期刊文献

馆藏范围

30,974 篇 电子文献
2 种 纸本馆藏

日期分布

学科分类号

17,280 篇 工学
- 13,630 篇 计算机科学与技术...
- 5,193 篇 软件工程
- 2,983 篇 机械工程
- 2,643 篇 光学工程
- 1,421 篇 控制科学与工程
- 1,411 篇 电气工程
- 1,344 篇 信息与通信工程
- 656 篇 生物工程
- 577 篇 仪器科学与技术
- 513 篇 生物医学工程（可授...
- 468 篇 电子科学与技术（可...
- 251 篇 化学工程与技术
- 212 篇 安全科学与工程
- 140 篇 交通运输工程
- 132 篇 建筑学
- 123 篇 材料科学与工程（可...
- 119 篇 土木工程
5,054 篇 理学
- 3,127 篇 物理学
- 2,406 篇 数学
- 824 篇 生物学
- 802 篇 统计学（可授理学、...
- 299 篇 系统科学
- 228 篇 化学
3,831 篇 医学
- 3,799 篇 临床医学
- 185 篇 基础医学(可授医学...
- 140 篇 药学(可授医学、理...
1,059 篇 管理学
- 617 篇 图书情报与档案管...
- 467 篇 管理科学与工程(可...
- 145 篇 工商管理
373 篇 艺术学
- 373 篇 设计学（可授艺术学...
116 篇 法学
81 篇 农学
48 篇 教育学
43 篇 经济学
18 篇 军事学
8 篇 文学

主题

12,602 篇 computer vision
5,697 篇 pattern recognit...
3,180 篇 training
2,263 篇 cameras
2,178 篇 computational mo...
2,116 篇 feature extracti...
2,048 篇 image segmentati...
1,970 篇 visualization
1,967 篇 shape
1,642 篇 robustness
1,493 篇 layout
1,476 篇 three-dimensiona...
1,445 篇 computer science
1,338 篇 computer archite...
1,296 篇 object detection
1,220 篇 semantics
1,142 篇 face recognition
1,107 篇 conferences
1,077 篇 benchmark testin...
1,056 篇 humans

机构

137 篇 univ sci & techn...
134 篇 tsinghua univers...
134 篇 univ chinese aca...
118 篇 chinese univ hon...
101 篇 microsoft resear...
97 篇 zhejiang univers...
94 篇 national laborat...
93 篇 shanghai jiao to...
93 篇 zhejiang univ pe...
85 篇 university of sc...
79 篇 shanghai ai lab ...
78 篇 swiss fed inst t...
65 篇 microsoft res as...
62 篇 adobe research
62 篇 computer vision ...
61 篇 peking univ peop...
58 篇 univ oxford oxfo...
57 篇 google mountain ...
57 篇 hong kong univ s...
56 篇 google res mount...

作者

107 篇 umapada pal
81 篇 van gool luc
68 篇 zhang lei
59 篇 timofte radu
41 篇 yang yi
37 篇 loy chen change
37 篇 hanqing lu
33 篇 liu yang
33 篇 xiaoou tang
32 篇 nassir navab
32 篇 wang liang
30 篇 tian qi
29 篇 h. bischof
29 篇 jan-michael frah...
29 篇 vittorio murino
29 篇 darrell trevor
27 篇 li xin
27 篇 vasconcelos nuno
27 篇 murino vittorio
27 篇 chen chen

语言

30,833 篇 英文
92 篇 中文
73 篇 其他
6 篇 土耳其文
2 篇 日文
2 篇 俄文
1 篇 西班牙文

检索条件"任意字段=Conference on Computer Vision and Pattern Recognition"

共 30976 条记录，以下是4991-5000 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Humble Teachers Teach Better Students for Semi-Supervised Object Detection

Humble Teachers Teach Better Students for Semi-Supervised Ob...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Tang, Yihe Chen, Weifeng Luo, Yijun Zhang, Yuting Carnegie Mellon Univ Pittsburgh PA 15213 USA Amazon Web Serv Seattle WA USA

ISBN: (纸本)9781665445092

We propose a semi-supervised approach for contemporary object detectors following the teacher-student dual model framework. Our method I is featured with 1) the exponential moving averaging strategy to update the teacher from the student online, 2) using plenty of region proposals and soft pseudo-labels as the student's training targets, and 3) a light-weighted detection-specific data ensemble for the teacher to generate more reliable pseudo-labels. Compared to the recent state-of-the-art - STAC, which uses hard labels on sparsely selected hard pseudo samples, the teacher in our model exposes richer information to the student with soft-labels on many proposals. Our model achieves COCO-style AP of 53.04% on VOC07 val set, 8.4% better than STAC, when using VOC12 as unlabeled data. On MSCOCO, it outperforms prior work when only a small percentage of data is taken as labeled. It also reaches 53.8% AP on MS-COCO test-dev with 3.1% gain over the fully supervised ResNet-152 Cascaded R-CNN, by tapping into unlabeled data of a similar size to the labeled data.

关键词： Training computer vision Object detection Detectors Benchmark testing Feature extraction Data models

来源：评论

学校读者我要写书评

暂无评论

HumanGPS: Geodesic PreServing Feature for Dense Human Correspondences

HumanGPS: Geodesic PreServing Feature for Dense Human Corres...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Tan, Feitong Tang, Danhang Dou, Mingsong Guo, Kaiwen Pandey, Rohit Keskin, Cem Du, Ruofei Sun, Deqing Bouaziz, Sofien Fanello, Sean Tan, Ping Zhang, Yinda Google Mountain View CA 94043 USA Simon Fraser Univ Burnaby BC Canada

ISBN: (纸本)9781665445092

In this paper, we address the problem of building dense correspondences between human images under arbitrary camera viewpoints and body poses. Prior art either assumes small motion between frames or relies on local descriptors, which cannot handle large motion or visually ambiguous body parts, e.g., left vs. right hand. In contrast, we propose a deep learning framework that maps each pixel to a feature space, where the feature distances reflect the geodesic distances among pixels as if they were projected onto the surface of a 3D human scan. To this end, we introduce novel loss functions to push features apart according to their geodesic distances on the surface. Without any semantic annotation, the proposed embeddings automatically learn to differentiate visually similar parts and align different subjects into an unified feature space. Extensive experiments show that the learned embeddings can produce accurate correspondences between images with remarkable generalization capabilities on both intra and inter subjects.(1)

关键词： Deep learning Image segmentation computer vision Three-dimensional displays Shape Semantics Buildings

来源：评论

学校读者我要写书评

暂无评论

GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation 31

GeoNet: Geometric Neural Network for Joint Depth and Surface...

引用

31st IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Qi, Xiaojuan Liao, Renjie Liu, Zhengzhe Urtasun, Raquel Jia, Jiaya Chinese Univ Hong Kong Hong Kong Peoples R China Univ Toronto Toronto ON Canada Uber Adv Technol Grp Pittsburgh PA 15201 USA Tencent YouTu Lab Shenzhen Peoples R China

ISBN: (纸本)9781538664209

In this paper, we propose Geometric Neural Network (GeoNet) to jointly predict depth and surface normal maps from a single image. Building on top of two-stream CNNs, our GeoNet incorporates geometric relation between depth and surface normal via the new depth-to-normal and normal to -depth networks. Depth-to-normal network exploits the least square solution of surface normal from depth and improves its quality with a residual module. Normal-to-depth network, contrarily, refines the depth map based on the constraints from the surface normal through a kernel regression module, which has no parameter to learn. These two networks enforce the underlying model to efficiently predict depth and surface normal for high consistency and corresponding accuracy. Our experiments on NYU v2 dataset verify that our GeoNet is able to predict geometrically consistent depth and normal maps. It achieves top performance on surface normal estimation and is on par with state-of-theart depth estimation methods.

关键词： Neural networks Estimation Three-dimensional displays computer architecture Rough surfaces Surface roughness Kernel

来源：评论

学校读者我要写书评

暂无评论

Bidirectional Retrieval Made Simple 31

Bidirectional Retrieval Made Simple

引用

31st IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Wehrmann, Jonatas Barros, Rodrigo C. Pontiffcia Univ Catolica Rio Grande do Sul Sch Technol Porto Alegre RS Brazil

ISBN: (纸本)9781538664209

This paper provides a very simple yet effective character-level architecture for learning bidirectional retrieval models. Aligning multimodal content is particularly challenging considering the difficulty in finding semantic correspondence between images and descriptions. We introduce an efficient character-level inception module, designed to learn textual semantic embeddings by convolving raw characters in distinct granularity levels. Our approach is capable of explicitly encoding hierarchical information from distinct base-level representations (e.g., characters, words, and sentences) into a shared multimodal space, where it maps the semantic correspondence between images and descriptions via a contrastive pairwise loss function that minimizes order-violations. Models generated by our approach are far more robust to input noise than state-of-the-art strategies based on word-embeddings. Despite being conceptually much simpler and requiring fewer parameters, our models outperform the state-of-the-art approaches by 4.8% in the task of description retrieval and 2.7% (absolute R@ 1 values) in the task of image retrieval in the popular MS COCO retrieval dataset. We also show that our models present solid performance for text classification, specially in multilingual and noisy domains.

关键词： computer architecture Semantics Training Feature extraction Task analysis Complexity theory Image coding

来源：评论

学校读者我要写书评

暂无评论

Picture: A Probabilistic Programming Language for Scene Perception

<i>Picture</i>: A Probabilistic Programming Language for Sce...

引用

IEEE conference on computer vision and pattern recognition (CVPR)

作者： Kulkarni, Tejas D. Kohli, Pushmeet Tenenbaum, Joshua B. Mansinghka, Vikash MIT Cambridge MA 02139 USA Microsoft Res Redmond WA USA

ISBN: (纸本)9781467369640

Recent progress on probabilistic modeling and statistical learning, coupled with the availability of large training datasets, has led to remarkable progress in computer vision. Generative probabilistic models, or "analysis-by-synthesis" approaches, can capture rich scene structure but have been less widely applied than their discriminative counterparts, as they often require considerable problem-specific engineering in modeling and inference, and inference is typically seen as requiring slow, hypothesize-and-test Monte Carlo methods. Here we present Picture, a probabilistic programming language for scene understanding that allows researchers to express complex generative vision models, while automatically solving them using fast general-purpose inference machinery. Picture provides a stochastic scene language that can express generative models for arbitrary 2D/3D scenes, as well as a hierarchy of representation layers for comparing scene hypotheses with observed images by matching not simply pixels, but also more abstract features (e.g., contours, deep neural network activations). Inference can flexibly integrate advanced Monte Carlo strategies with fast bottom-up data-driven methods. Thus both representations and inference strategies can build directly on progress in discriminatively trained systems to make generative vision more robust and efficient. We use Picture to write programs for 3D face analysis, 3D human pose estimation, and 3D object reconstruction - each competitive with specially engineered baselines.

关键词： picture computer Programming Languages Probabilistic Model Statistical learning Programming image matching Monte Carlo technique Social Hierarchy

来源：评论

学校读者我要写书评

暂无评论

Towards Robust and Reproducible Active Learning using Neural Networks

Towards Robust and Reproducible Active Learning using Neural...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Munjal, Prateek Hayat, Nasir Hayat, Munawar Sourati, Jamshid Khan, Shadab G42 Healthcare Abu Dhabi U Arab Emirates NYUAD Abu Dhabi U Arab Emirates Monash Univ Clayton Vic Australia Univ Chicago Chicago IL 60637 USA

ISBN: (数字)9781665469463

ISBN: (纸本)9781665469463

Active learning (AL) is a promising ML paradigm that has the potential to parse through large unlabeled data and help reduce annotation cost in domains where labeling data can be prohibitive. Recently proposed neural network based AL methods use different heuristics to accomplish this goal. In this study, we demonstrate that under identical experimental settings, different types of AL algorithms (uncertainty based, diversity based, and committee based) produce an inconsistent gain over random sampling baseline. Through a variety of experiments, controlling for sources of stochasticity, we show that variance in performance metrics achieved by AL algorithms can lead to results that are not consistent with the previously reported results. We also found that under strong regularization, AL methods show marginal or no advantage over the random sampling baseline under a variety of experimental conditions. Finally, we conclude with a set of recommendations on how to assess the results using a new AL algorithm to ensure results are reproducible and robust under changes in experimental conditions. We share our codes to facilitate AL evaluations. We believe our findings and recommendations will help advance reproducible research in AL using neural networks.

关键词： Measurement computer vision Uncertainty Costs Codes Annotations Neural networks

来源：评论

学校读者我要写书评

暂无评论

Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in Videos

Cannot See the Forest for the Trees: Aggregating Multiple Vi...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Hwang, Sukjun Heo, Miran Oh, Seoung Wug Kim, Seon Joo Yonsei Univ Seoul South Korea Adobe Res San Jose CA USA

ISBN: (数字)9781665469463

ISBN: (纸本)9781665469463

Recently, both long-tailed recognition and object tracking have made great advances individually. MO benchmark presented a mixture of the two, long-tailed object tracking, in order to further reflect the aspect of the real-world. To date, existing solutions have adopted detectors showing robustness in long-tailed distributions, which derive per-frame results. Then, they used tracking algorithms that combine the temporally independent detections to finalize tracklets. However, as the approaches did not take temporal changes in scenes into account, inconsistent classification results in videos led to low overall performance. In this paper, we present a set classifier that improves accuracy of classifying tracklets by aggregating information from multiple viewpoints contained in a tracklet. To cope with sparse annotations in videos, we further propose augmentation of tracklets that can maximize data efficiency. The set classifier is plug-and-playable to existing object trackers, and highly improves the performance of long-tailed object tracking. By simply attaching our method to QDTrack on top of ResNet-101, we achieve the new state-of-the-art, 19.9% and 15.7% TrackAP(50) on TAO validation and test sets, respectively. Our code is available at this link(1).

关键词： Vocabulary computer vision Codes Annotations Detectors Benchmark testing Robustness

来源：评论

学校读者我要写书评

暂无评论

Heterogeneous Grid Convolution for Adaptive, Efficient, and Controllable Computation

Heterogeneous Grid Convolution for Adaptive, Efficient, and ...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Hamaguchi, Ryuhei Furukawa, Yasutaka Onishi, Masaki Sakurada, Ken Natl Inst Adv Ind Sci & Technol Tokyo Japan Simon Fraser Univ Burnaby BC Canada

ISBN: (纸本)9781665445092

This paper proposes a novel heterogeneous grid convolution that builds a graph-based image representation by exploiting heterogeneity in the image content, enabling adaptive, efficient, and controllable computations in a convolutional architecture. More concretely, the approach builds a data-adaptive graph structure from a convolutional layer by a differentiable clustering method, pools features to the graph, performs a novel direction-aware graph convolution, and unpool features back to the convolutional layer. By using the developed module, the paper proposes heterogeneous grid convolutional networks, highly efficient yet strong extension of existing architectures. We have evaluated the proposed approach on four image understanding tasks, semantic segmentation, object localization, road extraction, and salient object detection. The proposed method is effective on three of the four tasks. Especially, the method outperforms a strong baseline with more than 90% reduction in floating-point operations for semantic segmentation, and achieves the state-of-the-art result for road extraction. We will share our code, model, and data.

关键词： Convolutional codes Location awareness Image segmentation Convolution Roads Semantics computer architecture

来源：评论

学校读者我要写书评

暂无评论

Extracting Triangular 3D Models, Materials, and Lighting From Images

Extracting Triangular 3D Models, Materials, and Lighting Fro...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Munkberg, Jacob Chen, Wenzheng Hasselgren, Jon Evans, Alex Shen, Tianchang Muller, Thomas Gao, Jun Fidler, Sanja NVIDIA Santa Clara CA 95051 USA Univ Toronto Toronto ON Canada Vector Inst Toronto ON Canada

ISBN: (数字)9781665469463

ISBN: (纸本)9781665469463

We present an efficient method for joint optimization of topology, materials and lighting from multi-view image observations. Unlike recent multi-view reconstruction approaches, which typically produce entangled 3D representations encoded in neural networks, we output triangle meshes with spatially-varying materials and environment lighting that can be deployed in any traditional graphics engine unmodified. We leverage recent work in differentiable rendering, coordinate-based networks to compactly represent volumetric texturing, alongside differentiable marching tetrahedrons to enable gradient-based optimization directly on the surface mesh. Finally, we introduce a differentiable formulation of the split sum approximation of environment lighting to efficiently recover all-frequency lighting. Experiments show our extracted models used in advanced scene editing, material decomposition, and high quality view interpolation, all running at interactive rates in triangle-based renderers (rasterizers and path tracers).

关键词： Graphics Solid modeling Three-dimensional displays Computational modeling Lighting Rendering (computer graphics) Topology

来源：评论

学校读者我要写书评

暂无评论

Ray3D: ray-based 3D human pose estimation for monocular absolute 3D localization

Ray3D: ray-based 3D human pose estimation for monocular abso...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Zhan, Yu Li, Fenghai Weng, Renliang Choi, Wongun Aibee Inc Beijing Peoples R China Beijing Technol & Business Univ Beijing Peoples R China

ISBN: (数字)9781665469463

ISBN: (纸本)9781665469463

In this paper, we propose a novel monocular ray-based 3D (Ray3D) absolute human pose estimation with calibrated camera. Accurate and generalizable absolute 3D human pose estimation from monocular 2D pose input is an ill-posed problem. To address this challenge, we convert the input from pixel space to 3D normalized rays. This conversion makes our approach robust to camera intrinsic parameter changes. To deal with the in-the-wild camera extrinsic parameter variations, Ray3D explicitly takes the camera extrinsic parameters as an input and jointly models the distribution between the 3D pose rays and camera extrinsic parameters. This novel network design is the key to the outstanding generalizability of Ray3D approach. To have a comprehensive understanding of how the camera intrinsic and extrinsic parameter variations affect the accuracy of absolute 3D key-point localization, we conduct in-depth systematic experiments on three single person 3D benchmarks as well as one synthetic benchmark. These experiments demonstrate that our method significantly outperforms existing state-of-the-art models.

关键词： Location awareness Solid modeling computer vision Three-dimensional displays Systematics Image analysis Pose estimation

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 491 492 493 494 495 496 497 498 499 500 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：