检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

19,438 篇 会议
46 篇 期刊文献
5 册 图书

馆藏范围

19,488 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

12,440 篇 工学
- 10,282 篇 计算机科学与技术...
- 2,395 篇 机械工程
- 2,007 篇 软件工程
- 813 篇 光学工程
- 531 篇 电气工程
- 419 篇 控制科学与工程
- 322 篇 信息与通信工程
- 210 篇 测绘科学与技术
- 80 篇 生物医学工程（可授...
- 73 篇 电子科学与技术（可...
- 70 篇 生物工程
- 60 篇 仪器科学与技术
- 38 篇 建筑学
- 36 篇 土木工程
- 33 篇 力学（可授工学、理...
- 31 篇 航空宇航科学与技...
- 26 篇 安全科学与工程
- 20 篇 材料科学与工程（可...
- 20 篇 交通运输工程
3,409 篇 医学
- 3,408 篇 临床医学
1,980 篇 理学
- 1,006 篇 数学
- 973 篇 物理学
- 359 篇 统计学（可授理学、...
- 336 篇 生物学
- 231 篇 系统科学
- 24 篇 化学
258 篇 管理学
- 138 篇 管理科学与工程(可...
- 122 篇 图书情报与档案管...
- 27 篇 工商管理
19 篇 法学
- 19 篇 社会学
14 篇 农学
8 篇 教育学
7 篇 经济学
3 篇 军事学
3 篇 艺术学

主题

7,893 篇 computer vision
2,727 篇 training
2,680 篇 pattern recognit...
1,760 篇 computational mo...
1,644 篇 visualization
1,410 篇 cameras
1,372 篇 three-dimensiona...
1,327 篇 shape
1,213 篇 face recognition
1,207 篇 image segmentati...
1,164 篇 feature extracti...
1,109 篇 robustness
1,087 篇 semantics
983 篇 layout
959 篇 object detection
949 篇 computer archite...
942 篇 benchmark testin...
931 篇 codes
902 篇 computer science
859 篇 deep learning

机构

174 篇 univ sci & techn...
161 篇 carnegie mellon ...
148 篇 univ chinese aca...
144 篇 chinese univ hon...
110 篇 microsoft resear...
106 篇 tsinghua univ pe...
103 篇 zhejiang univ pe...
99 篇 swiss fed inst t...
92 篇 tsinghua univers...
89 篇 microsoft res as...
88 篇 shanghai ai lab ...
81 篇 zhejiang univers...
76 篇 alibaba grp peop...
73 篇 university of sc...
73 篇 hong kong univ s...
72 篇 peking univ peop...
72 篇 university of ch...
68 篇 shanghai jiao to...
66 篇 univ oxford oxfo...
66 篇 shanghai jiao to...

作者

79 篇 van gool luc
70 篇 zhang lei
59 篇 timofte radu
48 篇 yang yi
47 篇 xiaoou tang
45 篇 luc van gool
43 篇 darrell trevor
43 篇 tian qi
42 篇 loy chen change
42 篇 sun jian
42 篇 li fei-fei
40 篇 qi tian
38 篇 li stan z.
36 篇 chen xilin
36 篇 torralba antonio
35 篇 vasconcelos nuno
35 篇 shan shiguang
35 篇 liu yang
34 篇 liu xiaoming
34 篇 tao dacheng

语言

19,483 篇 英文
2 篇 日文
2 篇 其他
2 篇 中文

检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2000"

共 19489 条记录，以下是4861-4870 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

LaPred: Lane-Aware Prediction of Multi-Modal Future Trajectories of Dynamic Agents

LaPred: Lane-Aware Prediction of Multi-Modal Future Trajecto...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Kim, Byeoung Do Park, Seong Hyeon Lee, Seokhwan Khoshimjonov, Elbek Kum, Dongsuk Kim, Junsoo Kim, Jeong Soo Choi, Jun Won Hanyang Univ Seoul South Korea Korea Adv Inst Sci & Technol Seoul South Korea Hyundai Motor Co Seoul South Korea

ISBN: (纸本)9781665445092

In this paper, we address the problem of predicting the future motion of a dynamic agent (called a target agent) given its current and past states as well as the information on its environment. It is paramount to develop a prediction model that can exploit the contextual information in both static and dynamic environments surrounding the target agent and generate diverse trajectory samples that are meaningful in a traffic context. We propose a novel prediction model, referred to as the lane-aware prediction (LaPred) network, which uses the instance-level lane entities extracted from a semantic map to predict the multi-modal future trajectories. For each lane candidate found in the neighborhood of the target agent, LaPred extracts the joint features relating the lane and the trajectories of the neighboring agents. Then, the features for all lane candidates are fused with the attention weights learned through a self-supervised learning task that identifies the lane candidate likely to be followed by the target agent. Using the instance-level lane information, LaPred can produce the trajectories compliant with the surroundings better than 2D raster image-based methods and generate the diverse future trajectories given multiple lane candidates. The experiments conducted on the public nuScenes dataset and Argoverse dataset demonstrate that the proposed LaPred method significantly outperforms the existing prediction models, achieving state-of-the-art performance in the benchmarks.

关键词： Measurement Dynamics Semantics Predictive models Feature extraction Trajectory pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Semi-Supervised Video Deraining with Dynamical Rain Generator

Semi-Supervised Video Deraining with Dynamical Rain Generato...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Yue, Zongsheng Xie, Jianwen Zhao, Qian Meng, Deyu Xi An Jiao Tong Univ Xian Peoples R China Baidu Res Cognit Comp Lab Bellevue WA USA Macau Univ Sci & Technol Macau Peoples R China

ISBN: (纸本)9781665445092

While deep learning (DL)-based video deraining methods have achieved significant successes in recent years, they still have two major drawbacks. Firstly, most of them are insufficient to model the characteristics of rain layers contained in rainy videos. In fact, the rain layers exhibit strong visual properties (e.g., direction, scale, and thickness) in spatial dimension and causal properties (e.g., velocity and acceleration) in temporal dimension, and thus can be modeled by the spatial-temporal process in statistics. Secondly, current DL-based methods rely heavily on the labeled training data, whose rain layers are synthetic, thus leading to a deviation from real data. Such a gap between synthetic and real data sets results in poor performance when applying them to real scenarios. To address these issues, this paper proposes a new semi-supervised video deraining method, in which a dynamical rain generator is employed to fit the rain layer for the sake of better depicting its intrinsic characteristics. Specifically, the dynamical generator consists of one emission model and one transition model to simultaneously encode the spatial appearance and temporal dynamics of rain streaks, respectively, both of which are parameterized by deep neural networks (DNNs). Furthermore, different prior formats are designed for the labeled synthetic and unlabeled real data so as to fully exploit their underlying common knowledge. Last but not least, we design a Monte Carlo-based EM algorithm to learn the model. Extensive experiments are conducted to verify the superiority of the proposed semi-supervised deraining model.

关键词： Deep learning Visualization computer vision Rain Monte Carlo methods Heuristic algorithms Training data

来源：评论

学校读者我要写书评

暂无评论

Learning Fine-Grained Segmentation of 3D Shapes without Part Labels

Learning Fine-Grained Segmentation of 3D Shapes without Part...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Wang, Xiaogang Sun, Xun Cao, Xinyu Xu, Kai Zhou, Bin Southwest Univ Chongqing Peoples R China Beihang Univ State Key Lab Virtual Real Technol & Syst Beijing Peoples R China Natl Univ Def Technol Changsha Peoples R China

ISBN: (纸本)9781665445092

Learning-based 3D shape segmentation is usually formulated as a semantic labeling problem, assuming that all parts of training shapes are annotated with a given set of tags. This assumption, however, is impractical for learning fine-grained segmentation. Although most off-the-shelf CAD models are, by construction, composed of fine-grained parts, they usually miss semantic tags and labeling those fine-grained parts is extremely tedious. We approach the problem with deep clustering, where the key idea is to learn part priors from a shape dataset with fine-grained segmentation but no part labels. Given point sampled 3D shapes, we model the clustering priors of points with a similarity matrix and achieve part segmentation through minimizing a novel low rank loss. To handle highly densely sampled point sets, we adopt a divide-and-conquer strategy. We partition the large point set into a number of blocks. Each block is segmented using a deep-clustering-based part prior network trained in a category-agnostic manner. We then train a graph convolution network to merge the segments of all blocks to form the final segmentation result. Our method is evaluated with a challenging benchmark of fine-grained segmentation, showing state-of-the-art performance.

关键词： Training Solid modeling Three-dimensional displays Shape Semantics Graph neural networks pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Ultra-High-Definition Image Dehazing via Multi-Guided Bilateral Learning

Ultra-High-Definition Image Dehazing via Multi-Guided Bilate...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Zheng, Zhuoran Ren, Wenqi Cao, Xiaochun Hu, Xiaobin Wang, Tao Song, Fenglong Jia, Xiuyi Nanjing Univ Sci & Technol CSE Nanjing Peoples R China Chinese Acad Sci IIE SKLOIS Beijing Peoples R China Tech Univ Munich Informat Munich Germany Huawei Noahs Ark Lab Montreal PQ Canada

ISBN: (纸本)9781665445092

Convolutional neural networks (CNNs) have achieved significant success in the single image dehazing task. Unfortunately, most existing deep dehazing models have high computational complexity, which hinders their application to high-resolution images, especially for UHD (ultra-high-definition) or 4K resolution images. To address the problem, we propose a novel network capable of real-time dehazing of 4K images on a single GPU, which consists of three deep CNNs. The first CNN extracts haze-relevant features at a reduced resolution of the hazy input and then fits locally-affine models in the bilateral space. Another CNN is used to learn multiple full-resolution guidance maps corresponding to the learned bilateral model. As a result, the feature maps with high-frequency can be reconstructed by multi-guided bilateral upsampling. Finally, the third CNN fuses the high-quality feature maps into a dehazed image. In addition, we create a large-scale 4K image dehazing dataset to support the training and testing of compared models. Experimental results demonstrate that the proposed algorithm performs favorably against the state-of-the-art dehazing approaches on various benchmarks.

关键词： Training Image resolution Image edge detection Feature extraction Real-time systems pattern recognition Image restoration

来源：评论

学校读者我要写书评

暂无评论

Adversarial Robust Model Compression using In-Train Pruning

Adversarial Robust Model Compression using In-Train Pruning

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Vemparala, Manoj-Rohit Fasfous, Nael Frickenstein, Alexander Sarkar, Sreetama Zhao, Qi Kuhn, Sabine Frickenstein, Lukas Singh, Anmol Unger, Christian Nagaraja, Naveen-Shankar Wressnegger, Christian Stechele, Walter BMW Autonomous Driving Munich Germany Tech Univ Munich Munich Germany Karlsruhe Inst Technol Karlsruhe Germany

ISBN: (纸本)9781665448994

Efficiently deploying learning-based systems on embedded hardware is challenging for various reasons, two of which are considered in this paper: The model's size and its robustness against attacks. Both need to be addressed even-handedly. We combine adversarial training and model pruning in a joint formulation of the fundamental learning objective during training. Unlike existing post-train pruning approaches, our method does not use heuristics and eliminates the need for a pre-trained model. This allows for a classifier which is robust against attacks and enables better compression of the model, reducing its computational effort. In comparison to prior work, our approach yields 6.21 pp higher accuracy for an 85 % reduction in parameters for ResNet20 on the CIFAR-10 dataset.

关键词： Training computer vision Computational modeling conferences Robustness Hardware pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval

Multi-Modal Relational Graph for Cross-Modal Video Moment Re...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Zeng, Yawen Cao, Da Wei, Xiaochi Liu, Meng Zhao, Zhou Qin, Zheng Hunan Univ Changsha Hunan Peoples R China Baidu Inc Beijing Peoples R China Shandong Jianzhu Univ Jinan Peoples R China Zhejiang Univ Hangzhou Peoples R China

ISBN: (纸本)9781665445092

Given an untrimmed video and a query sentence, cross-modal video moment retrieval aims to rank a video moment from pre-segmented video moment candidates that best matches the query sentence. Pioneering work typically learns the representations of the textual and visual content separately and then obtains the interactions or alignments between different modalities. However, the task of cross-modal video moment retrieval is not yet thoroughly addressed as it needs to further identify the fine-grained differences of video moment candidates with high repeatability and similarity. Moveover, the relation among objects in both video and sentence is intuitive and efficient for understanding semantics but is rarely considered. Toward this end, we contribute a multi-modal relational graph to capture the interactions among objects from the visual and textual content to identify the differences among similar video moment candidates. Specifically, we first introduce a visual relational graph and a textual relational graph to form relation-aware representations via message propagation. Thereafter, a multi-task pre-training is designed to capture domain-specific knowledge about objects and relations, enhancing the structured visual representation after explicitly defined relation. Finally, the graph matching and boundary regression are employed to perform the cross-modal retrieval. We conduct extensive experiments on two datasets about daily activities and cooking activities, demonstrating significant improvements over state-of-the-art solutions.

关键词： Visualization computer vision Semantics pattern recognition Object recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

Adapt Before Comparison: A New Perspective on Cross-Domain Few-Shot Segmentation

Adapt Before Comparison: A New Perspective on Cross-Domain F...

引用

conference on computer vision and pattern recognition (cvpr)

作者： Jonas Herzog Zhejiang University

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

Few-shot segmentation performance declines substantially when facing images from a domain different than the training domain, effectively limiting real-world use cases. To alleviate this, recently cross-domain few-shot segmentation (CD-FSS) has emerged. Works that address this task mainly attempted to learn segmentation on a source domain in a manner that generalizes across domains. Surprisingly, we can outperform these approaches while eliminating the training stage and removing their main segmentation network. We show test-time task-adaption is the key for successful CD-FSS instead. Task-adaption is achieved by appending small networks to the feature pyramid of a conventionally classification-pretrained backbone. To avoid overfitting to the few labeled samples in supervised fine-tuning, consistency across augmented views of input images serves as guidance while learning the parameters of the attached layers. Despite our self-restriction not to use any images other than the few labeled samples at test time, we achieve new state-of-the-art performance in CD-FSS, evidencing the need to rethink approaches for the task. Code is available at https://***/vision-Kek/ABCDFSS.

关键词： Training Image segmentation computer vision Adaptation models Limiting Codes pattern recognition

来源：评论

学校读者我要写书评

暂无评论

ABMDRNet: Adaptive-weighted Bi-directional Modality Difference Reduction Network for RGB-T Semantic Segmentation

ABMDRNet: Adaptive-weighted Bi-directional Modality Differen...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Zhang, Qiang Zhao, Shenlu Luo, Yongjiang Zhang, Dingwen Huang, Nianchang Han, Jungong Xidian Univ Sch Mechano Elect Engn Xian Peoples R China Xidian Univ Sch Elect Engn Xian Peoples R China Aberystwyth Univ Comp Sci Dept Aberystwyth Dyfed Wales

ISBN: (纸本)9781665445092

Semantic segmentation models gain robustness against poor lighting conditions by virtue of complementary information from visible (RGB) and thermal images. Despite its importance, most existing RGB-T semantic segmentation models perform primitive fusion strategies, such as concatenation, element-wise summation and weighted summation, to fuse features from different modalities. These strategies, unfortunately, overlook the modality differences due to different imaging mechanisms, so that they suffer from the reduced discriminability of the fused features. To address such an issue, we propose, for the first time, the strategy of bridging-then-fusing, where the innovation lies in a novel Adaptive-weighted Bi-directional Modality Difference Reduction Network (ABMDRNet). Concretely, a Modality Difference Reduction and Fusion (MDRF) subnetwork is designed, which first employs a bi-directional image-to-image translation based method to reduce the modality differences between RGB features and thermal features, and then adaptively selects those discriminative multi-modality features for RGB-T semantic segmentation in a channel-wise weighted fusion way. Furthermore, considering the importance of contextual information in semantic segmentation, a Multi-Scale Spatial Context (MSC) module and a Multi-Scale Channel Context (MCC) module are proposed to exploit the interactions among multi-scale contextual information of cross-modality features together with their long-range dependencies along spatial and channel dimensions, respectively. Comprehensive experiments on MFNet dataset demonstrate that our method achieves new state-of-the-art results.

关键词： Image segmentation Technological innovation computer vision Fuses Semantics Lighting Imaging

来源：评论

学校读者我要写书评

暂无评论

Generalizing Face Forgery Detection with High-frequency Features

Generalizing Face Forgery Detection with High-frequency Feat...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Luo, Yuchen Zhang, Yong Yan, Junchi Liu, Wei Shanghai Jiao Tong Univ Dept Comp Sci & Engn Shanghai Peoples R China Shanghai Jiao Tong Univ MoE Key Lab Artificial Intelligence AI Inst Shanghai Peoples R China Tencent AI Lab Bellevue WA USA Tencent Data Platform Shenzhen Peoples R China

ISBN: (纸本)9781665445092

Current face forgery detection methods achieve high accuracy under the within-database scenario where training and testing forgeries are synthesized by the same algorithm. However, few of them gain satisfying performance under the cross-database scenario where training and testing forgeries are synthesized by different algorithms. In this paper, we find that current CNN-based detectors tend to overfit to method-specific color textures and thus fail to generalize. Observing that image noises remove color textures and expose discrepancies between authentic and tampered regions, we propose to utilize the high-frequency noises for face forgery detection. We carefully devise three functional modules to take full advantage of the high-frequency features. The first is the multi-scale high-frequency feature extraction module that extracts high-frequency noises at multiple scales and composes a novel modality. The second is the residual-guided spatial attention module that guides the low-level RGB feature extractor to concentrate more on forgery traces from a new perspective. The last is the cross-modality attention module that leverages the correlation between the two complementary modalities to promote feature learning for each other. Comprehensive evaluations on several benchmark databases corroborate the superior generalization performance of our proposed method.

关键词： Training Correlation Image color analysis Databases Face recognition Detectors Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Semi-supervised Domain Adaptation based on Dual-level Domain Mixing for Semantic Segmentation

Semi-supervised Domain Adaptation based on Dual-level Domain...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Chen, Shuaijun Jia, Xu He, Jianzhong Shi, Yongjie Liu, Jianzhuang Huawei Technol Noahs Ark Lab Shenzhen Guangdong Peoples R China Dalian Univ Technol Dalian Liaoning Peoples R China Huawei Cloud Data Storage & Intelligent Vis Tech Res Dept Shenzhen Guangdong Peoples R China Peking Univ Key Lab Machine Percept Beijing Peoples R China Noahs Ark Lab Shenzhen Guangdong Peoples R China

ISBN: (纸本)9781665445092

Data-driven based approaches, in spite of great success in many tasks, have poor generalization when applied to unseen image domains, and require expensive cost of annotation especially for dense pixel prediction tasks such as semantic segmentation. Recently, both unsupervised domain adaptation (UDA) from large amounts of synthetic data and semi-supervised learning (SSL) with small set of labeled data have been studied to alleviate this issue. However, there is still a large gap on performance compared to their supervised counterparts. We focus on a more practical setting of semi-supervised domain adaptation (SSDA) where both a small set of labeled target data and large amounts of labeled source data are available. To address the task of SSDA, a novel framework based on dual-level domain mixing is proposed. The proposed framework consists of three stages. First, two kinds of data mixing methods are proposed to reduce domain gap in both region-level and sample-level respectively. We can obtain two complementary domain-mixed teachers based on dual-level mixed data from holistic and partial views respectively. Then, a student model is learned by distilling knowledge from these two teachers. Finally, pseudo labels of unlabeled data are generated in a self-training manner for another few rounds of teachers training. Extensive experimental results have demonstrated the effectiveness of our proposed framework on synthetic-to-real semantic segmentation benchmarks.

关键词： Training Adaptation models Image segmentation computer vision Costs Semantics Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 483 484 485 486 487 488 489 490 491 492 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：