检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

12,844 篇 会议
13 篇 期刊文献
2 册 图书

馆藏范围

12,859 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

7,573 篇 工学
- 6,863 篇 计算机科学与技术...
- 880 篇 机械工程
- 814 篇 软件工程
- 435 篇 控制科学与工程
- 360 篇 光学工程
- 306 篇 电气工程
- 209 篇 仪器科学与技术
- 124 篇 信息与通信工程
- 91 篇 生物工程
- 62 篇 生物医学工程（可授...
- 39 篇 电子科学与技术（可...
- 34 篇 安全科学与工程
- 26 篇 化学工程与技术
- 21 篇 交通运输工程
- 20 篇 建筑学
- 18 篇 土木工程
2,957 篇 医学
- 2,956 篇 临床医学
- 15 篇 基础医学(可授医学...
- 12 篇 药学(可授医学、理...
700 篇 理学
- 359 篇 物理学
- 225 篇 数学
- 175 篇 系统科学
- 95 篇 统计学（可授理学、...
- 93 篇 生物学
- 22 篇 化学
201 篇 艺术学
- 201 篇 设计学（可授艺术学...
84 篇 管理学
- 59 篇 图书情报与档案管...
- 25 篇 管理科学与工程(可...
- 14 篇 工商管理
23 篇 法学
- 21 篇 社会学
5 篇 农学
4 篇 教育学
2 篇 经济学
1 篇 军事学

主题

6,464 篇 computer vision
2,688 篇 training
2,437 篇 pattern recognit...
1,780 篇 computational mo...
1,522 篇 visualization
1,348 篇 three-dimensiona...
1,091 篇 computer archite...
1,063 篇 semantics
997 篇 benchmark testin...
976 篇 codes
970 篇 conferences
854 篇 feature extracti...
830 篇 cameras
771 篇 task analysis
707 篇 deep learning
646 篇 image segmentati...
611 篇 object detection
595 篇 shape
554 篇 transformers
538 篇 neural networks

机构

132 篇 univ sci & techn...
122 篇 carnegie mellon ...
120 篇 tsinghua univ pe...
114 篇 univ chinese aca...
113 篇 chinese univ hon...
94 篇 tsinghua univers...
91 篇 zhejiang univ pe...
91 篇 swiss fed inst t...
85 篇 peng cheng lab p...
81 篇 university of ch...
80 篇 zhejiang univers...
77 篇 shanghai ai lab ...
77 篇 peng cheng labor...
75 篇 university of sc...
69 篇 shanghai jiao to...
68 篇 shanghai jiao to...
67 篇 alibaba grp peop...
67 篇 stanford univ st...
66 篇 univ hong kong p...
64 篇 sensetime res pe...

作者

77 篇 timofte radu
63 篇 van gool luc
45 篇 zhang lei
36 篇 yang yi
36 篇 luc van gool
34 篇 tao dacheng
31 篇 loy chen change
29 篇 chen chen
28 篇 sun jian
28 篇 qi tian
25 篇 li xin
24 篇 liu yang
24 篇 tian qi
24 篇 ying shan
23 篇 wang xinchao
23 篇 zha zheng-jun
23 篇 boxin shi
21 篇 zhou jie
21 篇 vasconcelos nuno
20 篇 luo ping

语言

12,849 篇 英文
9 篇 其他
1 篇 中文

检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops"

共 12859 条记录，以下是4511-4520 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Spatially Consistent Representation Learning

Spatially Consistent Representation Learning

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Roh, Byungseok Shin, Wuhyun Kim, Ildoo Kim, Sungwoong Kakao Brain Seoul South Korea

ISBN: (纸本)9781665445092

Self-supervised learning has been widely used to obtain transferrable representations from unlabeled images. Especially, recent contrastive learning methods have shown impressive performances on downstream image classification tasks. While these contrastive methods mainly focus on generating invariant global representations at the image-level under semantic-preserving transformations, they are prone to overlook spatial consistency of local representations and therefore have a limitation in pretraining for localization tasks such as object detection and instance segmentation. Moreover, aggressively cropped views used in existing contrastive methods can minimize representation distances between the semantically different regions of a single image. In this paper, we propose a spatially consistent representation learning algorithm (SCRL) for multi-object and location-specific tasks. In particular, we devise a novel self-supervised objective that tries to produce coherent spatial representations of a randomly cropped local region according to geometric translations and zooming operations. On various downstream localization tasks with benchmark datasets, the proposed SCRL shows significant performance improvements over the image-level supervised pretraining as well as the state-of-the-art self-supervised learning methods.

关键词： Location awareness Learning systems Image segmentation computer vision Codes Object detection Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

Differentiable SLAM-net: Learning Particle SLAM for Visual Navigation

Differentiable SLAM-net: Learning Particle SLAM for Visual N...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Karkus, Peter Cai, Shaojun Hsu, David Natl Univ Singapore Singapore Singapore

ISBN: (纸本)9781665445092

Simultaneous localization and mapping (SLAM) remains challenging for a number of downstream applications, such as visual robot navigation, because of rapid turns, featureless walls, and poor camera quality. We introduce the Differentiable SLAM Network (SLAM-net) along with a navigation architecture to enable planar robot navigation in previously unseen indoor environments. SLAM-net encodes a particle filter based SLAM algorithm in a differentiable computation graph, and learns task-oriented neural network components by backpropagating through the SLAM algorithm. Because it can optimize all model components jointly for the end-objective, SLAM-net learns to be robust in challenging conditions. We run experiments in the Habitat platform with different real-world RGB and RGB-D datasets. SLAM-net significantly outperforms the widely adapted ORB-SLAM in noisy conditions. Our navigation architecture with SLAMnet improves the state-of-the-art for the Habitat Challenge 2020 PointNav task by a large margin (37% to 64% success).

关键词： Visualization Simultaneous localization and mapping Navigation Robot vision systems Neural networks computer architecture Particle filters

来源：评论

学校读者我要写书评

暂无评论

Intrinsic Image Harmonization

Intrinsic Image Harmonization

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Guo, Zonghui Zheng, Haiyong Jiang, Yufeng Gu, Zhaorui Zheng, Bing Ocean Univ China Underwater Vis Lab Qingdao Shandong Peoples R China Ocean Univ China Sanya Oceanog Inst Qingdao Shandong Peoples R China

ISBN: (纸本)9781665445092

Compositing an image usually inevitably suffers from inharmony problem that is mainly caused by incompatibility of foreground and background from two different images with distinct surfaces and lights, corresponding to material-dependent and light-dependent characteristics, namely, reflectance and illumination intrinsic images, respectively. Therefore, we seek to solve image harmonization via separable harmonization of reflectance and illumination, i.e., intrinsic image harmonization. Our method is based on an autoencoder that disentangles composite image into reflectance and illumination for further separate harmonization. Specifically, we harmonize reflectance through material-consistency penalty, while harmonize illumination by learning and transferring light from background to foreground, moreover, we model patch relations between foreground and background of composite images in an inharmony-free learning way, to adaptively guide our intrinsic image harmonization. Both extensive experiments and ablation studies demonstrate the power of our method as well as the efficacy of each component. We also contribute a new challenging dataset for benchmarking illumination harmonization. Code and dataset are at https://***/zhenglab/IntrinsicHarmony.

关键词： Reflectivity computer vision Adaptation models Codes Lighting Benchmark testing pattern recognition

来源：评论

学校读者我要写书评

暂无评论

End-to-End Object Detection with Fully Convolutional Network

End-to-End Object Detection with Fully Convolutional Network

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Wang, Jianfeng Song, Lin Li, Zeming Sun, Hongbin Sun, Jian Zheng, Nanning Megvii Technol Beijing Peoples R China Xi An Jiao Tong Univ Coll Artificial Intelligence Xian Peoples R China

ISBN: (纸本)9781665445092

Mainstream object detectors based on the fully convolutional network has achieved impressive performance. While most of them still need a hand-designed non-maximum suppression (NMS) post-processing, which impedes fully endto-end training. In this paper, we give the analysis of discarding NMS, where the results reveal that a proper label assignment plays a crucial role. To this end, for fully convolutional detectors, we introduce a Prediction-aware One-To-One (POTO) label assignment for classification to enable end-to-end detection, which obtains comparable performance with NMS. Besides, a simple 3D Max Filtering (3DMF) is proposed to utilize the multi-scale features and improve the discriminability of convolutions in the local region. With these techniques, our end-to-end framework achieves competitive performance against many state-of-the-art detectors with NMS on COCO and CrowdHuman datasets.

关键词： Convolutional codes Training computer vision Three-dimensional displays Filtering Detectors Object detection

来源：评论

学校读者我要写书评

暂无评论

Blind Deblurring for Saturated Images

Blind Deblurring for Saturated Images

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Chen, Liang Zhang, Jiawei Lin, Songnan Fang, Faming Ren, Jimmy S. East China Normal Univ Proc Sch Comp Sci & Technol Shanghai Key Lab Multidimens Informat Shanghai Peoples R China SenseTime Res Shanghai Peoples R China Shanghai Jiao Tong Univ Qing Yuan Res Inst Shanghai Peoples R China

ISBN: (纸本)9781665445092

Blind deblurring has received considerable attention in recent years. However, state-of-the-art methods often fail to process saturated blurry images. The main reason is that pixels around saturated regions are not conforming to the commonly used linear blur model. Pioneer arts suggest excluding these pixels during the deblurring process, which sometimes simultaneously removes the informative edges around saturated regions and results in insufficient information for kernel estimation when large saturated regions exist. To address this problem, we introduce a new blur model to fit both saturated and unsaturated pixels, and all informative pixels can be considered during the deblurring process. Based on our model, we develop an effective maximum a posterior (MAP)-based optimization framework. Quantitative and qualitative evaluations on benchmark datasets and challenging real-world examples show that the proposed method performs favorably against existing methods.

关键词： computer vision Art Image edge detection Estimation Benchmark testing pattern recognition Image restoration

来源：评论

学校读者我要写书评

暂无评论

Sparse Multi-Path Corrections in Fringe Projection Profilometry

Sparse Multi-Path Corrections in Fringe Projection Profilome...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Zhang, Yu Lau, Daniel Wipf, David Nanjing Univ Nanjing Peoples R China Univ Kentucky Lexington KY 40506 USA Amazon Seattle WA USA

ISBN: (纸本)9781665445092

Three-dimensional scanning by means of structured light illumination is an active imaging technique involving projecting and capturing a series of striped patterns and then using the observed warping of stripes to reconstruct the target object's surface through triangulating each pixel in the camera to a unique projector coordinate corresponding to a particular feature in the projected patterns. The undesirable phenomenon of multi-path occurs when a camera pixel simultaneously sees features from multiple projector coordinates. Bimodal multi-path is a particularly common situation found along step edges, where the camera pixel sees both a foreground and background surface. Generalized from bimodal multi-path, this paper examines the phenomenon of sparse or N-modal multi-path as a more general case, where the camera pixel sees no fewer than two reflective surfaces, resulting in decoding errors. Using fringe projection profilometry, our proposed solution is to treat each camera pixel as an underdetermined linear system of equations and to find the sparsest (least number of paths) solution by taking an application-specific Bayesian learning approach. We validate this algorithm with both simulations and a number of challenging real-world scenarios, demonstrating that it outperforms state-of-the-art techniques.

关键词： Linear systems Surface reconstruction computer vision Lighting Cameras Mathematical models pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Practical Wide-Angle Portraits Correction with Deep Structured Models

Practical Wide-Angle Portraits Correction with Deep Structur...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Tan, Jing Zhao, Shan Xiong, Pengfei Liu, Jiangyu Fan, Haoqiang Liu, Shuaicheng Megvii Res Chengdu Peoples R China Tencent Chengdu Peoples R China Univ Elect Sci & Technol China Chengdu Peoples R China

ISBN: (纸本)9781665445092

Wide-angle portraits often enjoy expanded views. However, they contain perspective distortions, especially noticeable when capturing group portrait photos, where the background is skewed and faces are stretched. This paper introduces the first deep learning based approach to remove such artifacts from freely-shot photos. Specifically, given a wide-angle portrait as input, we build a cascaded network consisting of a LineNet, a ShapeNet, and a transition module (TM), which corrects perspective distortions on the background, adapts to the stereographic projection on facial regions, and achieves smooth transitions between these two projections, accordingly. To train our network, we build the first perspective portrait dataset with a large diversity in identities, scenes and camera modules. For the quantitative evaluation, we introduce two novel metrics, line consistency and face congruence. Compared to the previous state-of-the-art approach, our method does not require camera distortion parameters. We demonstrate that our approach significantly outperforms the previous state-of-the-art approach both qualitatively and quantitatively.

关键词： Measurement Deep learning computer vision Distortion Cameras Robustness pattern recognition

来源：评论

学校读者我要写书评

暂无评论

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions

NExT-QA: Next Phase of Question-Answering to Explaining Temp...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Xiao, Junbin Shang, Xindi Yao, Angela Chua, Tat-Seng Natl Univ Singapore Dept Comp Sci Singapore Singapore

ISBN: (纸本)9781665445092

We introduce NExT-QA, a rigorously designed video question answering (VideoQA) benchmark to advance video understanding from describing to explaining the temporal actions. Based on the dataset, we set up multi-choice and open-ended QA tasks targeting causal action reasoning, temporal action reasoning, and common scene comprehension. Through extensive analysis of baselines and established VideoQA techniques, we find that top-performing methods excel at shallow scene descriptions but are weak in causal and temporal action reasoning. Furthermore, the models that are effective on multi-choice QA, when adapted to open-ended QA, still struggle in generalizing the answers. This raises doubt on the ability of these models to reason and highlights possibilities for improvement. With detailed results for different question types and heuristic observations for future works, we hope NExT-QA will guide the next generation of VQA research to go beyond superficial description towards a deeper understanding of videos. (The dataset and related resources are available at https://***/doc-doc/***)

关键词： Adaptation models computer vision Benchmark testing Knowledge discovery Cognition pattern recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

FixBi: Bridging Domain Spaces for Unsupervised Domain Adaptation

FixBi: Bridging Domain Spaces for Unsupervised Domain Adapta...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Na, Jaemin Jung, Heechul Chang, Hyung Jin Hwang, Wonjun Ajou Univ Suwon South Korea Kyungpook Natl Univ Seoul South Korea Univ Birmingham Birmingham W Midlands England

ISBN: (纸本)9781665445092

Unsupervised domain adaptation (UDA) methods for learning domain invariant representations have achieved remarkable progress. However, most of the studies were based on direct adaptation from the source domain to the target domain and have suffered from large domain discrepancies. In this paper, we propose a UDA method that effectively handles such large domain discrepancies. We introduce a fixed ratio-based mixup to augment multiple intermediate domains between the source and target domain. From the augmented-domains, we train the source-dominant model and the target-dominant model that have complementary characteristics. Using our confidence-based learning methodologies, e.g., bidirectional matching with high-confidence predictions and self-penalization using low-confidence predictions, the models can learn from each other or from its own results. Through our proposed methods, the models gradually transfer domain knowledge from the source to the target domain. Extensive experiments demonstrate the superiority of our proposed method on three public benchmarks: Office-31, Office-Home, and VisDA-2017.(1)

关键词： computer vision Adaptation models Benchmark testing Predictive models pattern recognition Standards

来源：评论

学校读者我要写书评

暂无评论

3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data

3D Kinematics Estimation from Video with a Biomechanical Mod...

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Zhi-Yi Lin Bofan Lyu Judith Cueto Fernandez Eline van der Kruk Ajay Seth Xucong Zhang Department of Intelligent Systems Computer Vision Lab Delft University of Technology Department of BioMechanical Engineering Faculty of Mechanical Engineering Delft University of Technology

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Accurate 3D kinematics estimation of human body is crucial in various applications for human health and mobility, such as rehabilitation, injury prevention, and diagnosis, as it helps to understand the biomechanical loading experienced during movement. Conventional marker-based motion capture is expensive in terms of financial investment, time, and the expertise required. Moreover, due to the scarcity of datasets with accurate annotations, existing markerless motion capture methods suffer from challenges including unreliable 2D keypoint detection, limited anatomic accuracy, and low generalization capability. In this work, we propose a novel biomechanics-aware network that directly outputs 3D kinematics from two input views with consideration of biomechanical prior and spatio-temporal information. To train the model, we create synthetic dataset ODAH with accurate kinematics annotations generated by aligning the body mesh from the SMPL-X model and a full-body OpenSim skeletal model. Our extensive experiments demonstrate that the proposed approach, only trained on synthetic data, outperforms previous state-of-the-art methods when evaluated across multiple datasets, revealing a promising direction for enhancing video-based human motion capture.

关键词： Biomechanics Visualization computer vision Accuracy Three-dimensional displays Biological system modeling Estimation

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 448 449 450 451 452 453 454 455 456 457 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：