检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

12,844 篇 会议
13 篇 期刊文献
2 册 图书

馆藏范围

12,859 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

7,573 篇 工学
- 6,863 篇 计算机科学与技术...
- 880 篇 机械工程
- 814 篇 软件工程
- 435 篇 控制科学与工程
- 360 篇 光学工程
- 306 篇 电气工程
- 209 篇 仪器科学与技术
- 124 篇 信息与通信工程
- 91 篇 生物工程
- 62 篇 生物医学工程（可授...
- 39 篇 电子科学与技术（可...
- 34 篇 安全科学与工程
- 26 篇 化学工程与技术
- 21 篇 交通运输工程
- 20 篇 建筑学
- 18 篇 土木工程
2,957 篇 医学
- 2,956 篇 临床医学
- 15 篇 基础医学(可授医学...
- 12 篇 药学(可授医学、理...
700 篇 理学
- 359 篇 物理学
- 225 篇 数学
- 175 篇 系统科学
- 95 篇 统计学（可授理学、...
- 93 篇 生物学
- 22 篇 化学
201 篇 艺术学
- 201 篇 设计学（可授艺术学...
84 篇 管理学
- 59 篇 图书情报与档案管...
- 25 篇 管理科学与工程(可...
- 14 篇 工商管理
23 篇 法学
- 21 篇 社会学
5 篇 农学
4 篇 教育学
2 篇 经济学
1 篇 军事学

主题

6,464 篇 computer vision
2,688 篇 training
2,437 篇 pattern recognit...
1,780 篇 computational mo...
1,522 篇 visualization
1,348 篇 three-dimensiona...
1,091 篇 computer archite...
1,063 篇 semantics
997 篇 benchmark testin...
976 篇 codes
970 篇 conferences
854 篇 feature extracti...
830 篇 cameras
771 篇 task analysis
707 篇 deep learning
646 篇 image segmentati...
611 篇 object detection
595 篇 shape
554 篇 transformers
538 篇 neural networks

机构

132 篇 univ sci & techn...
122 篇 carnegie mellon ...
120 篇 tsinghua univ pe...
114 篇 univ chinese aca...
113 篇 chinese univ hon...
94 篇 tsinghua univers...
91 篇 zhejiang univ pe...
91 篇 swiss fed inst t...
85 篇 peng cheng lab p...
81 篇 university of ch...
80 篇 zhejiang univers...
77 篇 shanghai ai lab ...
77 篇 peng cheng labor...
75 篇 university of sc...
69 篇 shanghai jiao to...
68 篇 shanghai jiao to...
67 篇 alibaba grp peop...
67 篇 stanford univ st...
66 篇 univ hong kong p...
64 篇 sensetime res pe...

作者

77 篇 timofte radu
63 篇 van gool luc
45 篇 zhang lei
36 篇 yang yi
36 篇 luc van gool
34 篇 tao dacheng
31 篇 loy chen change
29 篇 chen chen
28 篇 sun jian
28 篇 qi tian
25 篇 li xin
24 篇 liu yang
24 篇 tian qi
24 篇 ying shan
23 篇 wang xinchao
23 篇 zha zheng-jun
23 篇 boxin shi
21 篇 zhou jie
21 篇 vasconcelos nuno
20 篇 luo ping

语言

12,851 篇 英文
7 篇 其他
1 篇 中文

检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops"

共 12859 条记录，以下是4961-4970 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing

Differentiable Multi-Granularity Human Representation Learni...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Zhou, Tianfei Wang, Wenguan Liu, Si Yang, Yi Van Gool, Luc Swiss Fed Inst Technol Comp Vis Lab Zurich Switzerland Beihang Univ Inst Artificial Intelligence Beijing Peoples R China Univ Technol Sydney Sydney NSW Australia

ISBN: (纸本)9781665445092

To address the challenging task of instance-aware human part parsing, a new bottom-up regime is proposed to learn category-level human semantic segmentation as well as multi-person pose estimation in a joint and end-to-end manner. It is a compact, efficient and powerful framework that exploits structural information over different human granularities and eases the difficulty of person partitioning. Specifically, a dense-to-sparse projection field, which allows explicitly associating dense human semantics with sparse keypoints, is learnt and progressively improved over the network feature pyramid for robustness. Then, the difficult pixel grouping problem is cast as an easier, multiperson joint assembling task. By formulating joint association as maximum-weight bipartite matching, a differentiable solution is developed to exploit projected gradient descent and Dykstra's cyclic projection algorithm. This makes our method end-to-end trainable and allows back-propagating the grouping error to directly supervise multi-granularity human representation learning. This is distinguished from current bottom-up human parsers or pose estimators which require sophisticated post-processing or heuristic greedy algorithms. Experiments on three instance-aware human parsing datasets show that our model outperforms other bottom-up alternatives with much more efficient inference.

关键词： Greedy algorithms computer vision Semantics Pose estimation Robustness pattern recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

Adaptive Aggregation Networks for Class-Incremental Learning

Adaptive Aggregation Networks for Class-Incremental Learning

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Liu, Yaoyao Schiele, Bernt Sun, Qianru Saarland Informat Campus Max Planck Inst Informat Saarbrucken Germany Singapore Management Univ Sch Comp & Informat Syst Singapore Singapore

ISBN: (纸本)9781665445092

Class-Incremental Learning (CIL) aims to learn a classification model with the number of classes increasing phase-by-phase. An inherent problem in CIL is the stability-plasticity dilemma between the learning of old and new classes, i.e., high-plasticity models easily forget old classes, but high-stability models are weak to learn new classes. We alleviate this issue by proposing a novel network architecture called Adaptive Aggregation Networks (AANets) in which we explicitly build two types of residual blocks at each residual level (taking ResNet as the baseline architecture): a stable block and a plastic block. We aggregate the output feature maps from these two blocks and then feed the results to the next-level blocks. We adapt the aggregation weights in order to balance these two types of blocks, i.e., to balance stability and plasticity, dynamically. We conduct extensive experiments on three CIL benchmarks: CIFAR-100, ImageNet-Subset, and ImageNet, and show that many existing CIL methods can be straightforwardly incorporated into the architecture of AANets to boost their performances(1).

关键词： Adaptation models computer vision Adaptive systems computer architecture Network architecture Benchmark testing Stability analysis

来源：评论

学校读者我要写书评

暂无评论

Joint Negative and Positive Learning for Noisy Labels

Joint Negative and Positive Learning for Noisy Labels

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Kim, Youngdong Yun, Juseung Shon, Hyounguk Kim, Junmo Korea Adv Inst Sci & Technol Sch Elect Engn Daejeon South Korea

ISBN: (纸本)9781665445092

Training of Convolutional Neural Networks (CNNs) with data with noisy labels is known to be a challenge. Based on the fact that directly providing the label to the data (Positive Learning;PL) has a risk of allowing CNNs to memorize the contaminated labels for the case of noisy data, the indirect learning approach that uses complementary labels (Negative Learning for Noisy Labels;NLNL) has proven to be highly effective in preventing overfitting to noisy data as it reduces the risk of providing faulty target. NLNL further employs a three-stage pipeline to improve convergence. As a result, filtering noisy data through the NLNL pipeline is cumbersome, increasing the training cost. In this study, we propose a novel improvement of NLNL, named Joint Negative and Positive Learning (JNPL), that unifies the filtering pipeline into a single stage. JNPL trains CNN via two losses, NL+ and PL+, which are improved upon NL and PL loss functions, respectively. We analyze the fundamental issue of NL loss function and develop new NL+ loss function producing gradient that enhances the convergence of noisy data. Furthermore, PL+ loss function is designed to enable faster convergence to expected-to-be-clean data. We show that the NL+ and PL+ train CNN simultaneously, significantly simplifying the pipeline, allowing greater ease of practical use compared to NLNL. With a simple semisupervised training technique, our method achieves stateof-the-art accuracy for noisy data classification based on the superior filtering ability.

关键词： Training computer vision Costs Filtering Pipelines Training data pattern recognition

来源：评论

学校读者我要写书评

暂无评论

BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification

BiCnet-TKS: Learning Efficient Spatial-Temporal Representati...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Hou, Ruibing Chang, Hong Ma, Bingpeng Huang, Rui Shan, Shiguang Chinese Acad Sci Inst Comp Technol CAS Key Lab Intelligent Informat Proc Beijing 100190 Peoples R China Univ Chinese Acad Sci Beijing 100049 Peoples R China Chinese Univ Hong Kong Shenzhen Inst Artificial Intelligence & Robot Soc Shenzhen 518172 Guangdong Peoples R China CAS Ctr Excellence Brain Sci & Intelligence Techn Shanghai 200031 Peoples R China

ISBN: (纸本)9781665445092

In this paper, we present an efficient spatial-temporal representation for video person re-identification (reID). Firstly, we propose a Bilateral Complementary Network (BiCnet) for spatial complementarity modeling. Specifically, BiCnet contains two branches. Detail Branch processes frames at original resolution to preserve the detailed visual clues, and Context Branch with a down-sampling strategy is employed to capture long-range contexts. On each branch, BiCnet appends multiple parallel and diverse attention modules to discover divergent body parts for consecutive frames, so as to obtain an integral characteristic of target identity. Furthermore, a Temporal Kernel Selection (TKS) block is designed to capture short-term as well as long-term temporal relations by an adaptive mode. TKS can be inserted into BiCnet at any depth to construct BiCnet-TKS for spatial-temporal modeling. Experimental results on multiple benchmarks show that BiCnet-TKS outperforms state-of-the-arts with about 50% less computations.

关键词： Visualization computer vision Codes Computational modeling Benchmark testing pattern recognition Computational efficiency

来源：评论

学校读者我要写书评

暂无评论

Accurate Few-shot Object Detection with Support-Query Mutual Guidance and Hybrid Loss

Accurate Few-shot Object Detection with Support-Query Mutual...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Zhang, Lu Zhou, Shuigeng Guan, Jihong Zhang, Ji Fudan Univ Shanghai Key Lab Intelligent Informat Proc Shanghai Peoples R China Fudan Univ Sch Comp Sci Shanghai Peoples R China Tongji Univ Dept Comp Sci & Technol Shanghai Peoples R China Zhejiang Lab Hangzhou Peoples R China

ISBN: (纸本)9781665445092

Most object detection methods require huge amounts of annotated data and can detect only the categories that appear in the training set. However, in reality acquiring massive annotated training data is both expensive and time-consuming. In this paper, we propose a novel two-stage detector for accurate few-shot object detection. In the first stage, we employ a support-query mutual guidance mechanism to generate more support-relevant proposals. Concretely, on the one hand, a query-guided support weighting module is developed for aggregating different supports to generate the support feature. On the other hand, a support-guided query enhancement module is designed by dynamic kernels. In the second stage, we score and filter proposals via multi-level feature comparison between each proposal and the aggregated support feature based on a distance metric learnt by an effective hybrid loss, which makes the embedding space of distance metric more discriminative. Extensive experiments on benchmark datasets show that our method substantially outperforms the existing methods and lifts the SOTA of FSOD task to a higher level.

关键词： Measurement Training computer vision Training data Object detection Detectors pattern recognition

来源：评论

学校读者我要写书评

暂无评论

FVC: A New Framework towards Deep Video Compression in Feature Space

FVC: A New Framework towards Deep Video Compression in Featu...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Hu, Zhihao Lu, Guo Xu, Dong Beihang Univ Beijing Peoples R China Beijing Inst Technol Beijing Peoples R China Univ Sydney Sydney NSW Australia

ISBN: (纸本)9781665445092

Learning based video compression attracts increasing attention in the past few years. The previous hybrid coding approaches rely on pixel space operations to reduce spatial and temporal redundancy, which may suffer from inaccurate motion estimation or less effective motion compensation. In this work, we propose a feature-space video coding network (FVC) by performing all major operations (i.e., motion estimation, motion compression, motion compensation and residual compression) in the feature space. Specifically, in the proposed deformable compensation module, we first apply motion estimation in the feature space to produce motion information (i.e., the offset maps), which will be compressed by using the auto-encoder style network. Then we perform motion compensation by using deformable convolution and generate the predicted feature. After that, we compress the residual feature between the feature from the current frame and the predicted feature from our deformable compensation module. For better frame reconstruction, the reference features from multiple previous reconstructed frames are also fused by using the nonlocal attention mechanism in the multi-frame feature fusion module. Comprehensive experimental results demonstrate that the proposed framework achieves the state-of-the-art performance on four benchmark datasets including HEVC, UVG, VTL and MCL-JCV.

关键词： Video coding computer vision Convolution Motion estimation Redundancy Video compression Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

Modeling Multi-Label Action Dependencies for Temporal Action Localization

Modeling Multi-Label Action Dependencies for Temporal Action...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Tirupattur, Praveen Duarte, Kevin Rawat, Yogesh S. Shah, Mubarak Univ Cent Florida Ctr Res Comp Vis Orlando FL 32816 USA

ISBN: (纸本)9781665445092

Real-world videos contain many complex actions with inherent relationships between action classes. In this work, we propose an attention-based architecture that models these action relationships for the task of temporal action localization in untrimmed videos. As opposed to previous works that leverage video-level co-occurrence of actions, we distinguish the relationships between actions that occur at the same time-step and actions that occur at different time-steps (i.e. those which precede or follow each other). We define these distinct relationships as action dependencies. We propose to improve action localization performance by modeling these action dependencies in a novel attention-based Multi-Label Action Dependency (MLAD) layer. The MLAD layer consists of two branches: a Co-occurrence Dependency Branch and a Temporal Dependency Branch to model co-occurrence action dependencies and temporal action dependencies, respectively. We observe that existing metrics used for multi-label classification do not explicitly measure how well action dependencies are modeled, therefore, we propose novel metrics that consider both co-occurrence and temporal dependencies between action classes. Through empirical evaluation and extensive analysis, we show improved performance over state-of-the-art methods on multi-label action localization benchmarks (MultiTHUMOS and Charades) in terms of fmAP and our proposed metric.

关键词： Measurement Location awareness computer vision Codes Network architecture Benchmark testing pattern recognition

来源：评论

学校读者我要写书评

暂无评论

DOTS: Decoupling Operation and Topology in Differentiable Architecture Search

DOTS: Decoupling Operation and Topology in Differentiable Ar...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Gu, Yu-Chao Wang, Li-Juan Liu, Yun Yang, Yi Wu, Yu-Huan Lu, Shao-Ping Cheng, Ming-Ming Nankai Univ CS TKLNDST Tianjin Peoples R China Zhejiang Univ Hangzhou Peoples R China

ISBN: (纸本)9781665445092

Differentiable Architecture Search (DARTS) has attracted extensive attention due to its efficiency in searching for cell structures. DARTS mainly focuses on the operation search and derives the cell topology from the operation weights. However, the operation weights can not indicate the importance of cell topology and result in poor topology rating correctness. To tackle this, we propose to Decouple the Operation and Topology Search (DOTS), which decouples the topology representation from operation weights and makes an explicit topology search. DOTS is achieved by introducing a topology search space that contains combinations of candidate edges. The proposed search space directly reflects the search objective and can be easily extended to support a flexible number of edges in the searched cell. Existing gradient-based NAS methods can be incorporated into DOTS for further improvement by the topology search. Considering that some operations (e.g., Skip-Connection) can affect the topology, we propose a group operation search scheme to preserve topology-related operations for a better topology search. The experiments on CIFAR10/100 and ImageNet demonstrate that DOTS is an effective solution for differentiable NAS.

关键词： computer vision Codes Microprocessors Image edge detection computer architecture US Department of Transportation Search problems

来源：评论

学校读者我要写书评

暂无评论

Graph Attention Tracking

Graph Attention Tracking

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Guo, Dongyan Shao, Yanyan Cui, Ying Wang, Zhenhua Zhang, Liyan Shen, Chunhua Zhejiang Univ Technol Hangzhou Peoples R China Nanjing Univ Aeronaut & Astronaut Nanjing Peoples R China Monash Univ Melbourne Vic Australia

ISBN: (纸本)9781665445092

Siamese network based trackers formulate the visual tracking task as a similarity matching problem. Almost all popular Siamese trackers realize the similarity learning via convolutional feature cross-correlation between a target branch and a search branch. However, since the size of target feature region needs to be pre-fixed, these cross-correlation base methods suffer from either reserving much adverse background information or missing a great deal of foreground information. Moreover, the global matching between the target and search region also largely neglects the target structure and part-level information. In this paper, to solve the above issues, we propose a simple target-aware Siamese graph attention network for general object tracking. We propose to establish part-to-part correspondence between the target and the search region with a complete bipartite graph, and apply the graph attention mechanism to propagate target information from the template feature to the search feature. Further, instead of using the pre-fixed region cropping for template-feature-area selection, we investigate a target-aware area selection mechanism to fit the size and aspect ratio variations of different objects. Experiments on challenging benchmarks including GOT-10k, UAV123, OTB-100 and LaSOT demonstrate that the proposed SiamGAT outperforms many stateof-the-art trackers and achieves leading performance.

关键词： Convolutional codes Visualization computer vision Target tracking Benchmark testing Search problems pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Structured Multi-Level Interaction Network for Video Moment Localization via Language Query

Structured Multi-Level Interaction Network for Video Moment ...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Wang, Hao Zha, Zheng-Jun Li, Liang Liu, Dong Luo, Jiebo Univ Sci & Technol China Hefei Peoples R China Chinese Acad Sci Inst Comp Technol Beijing Peoples R China Univ Rochester Rochester NY 14627 USA

ISBN: (纸本)9781665445092

We address the problem of localizing a specific moment described by a natural language query. Existing works interact the query with either video frame or moment proposal, and neglect the inherent structure of moment construction for both cross-modal understanding and video content comprehension, which are the two crucial challenges for this task. In this paper, we disentangle the activity moment into boundary and content. Based on the explored moment structure, we propose a novel Structured Multi-level Interaction Network (SMIN) to tackle this problem through multi-levels of cross-modal interaction coupled with content-boundary-moment interaction. In particular, for cross-modal interaction, we interact the sentence-level query with the whole moment while interacting the word-level query with content and boundary, as in a coarse-to-fine manner. For content-boundary-moment interaction, we capture the insightful relations between boundary, content, and the whole moment proposal. Through multi-level interactions, the model obtains robust cross-modal representation for accurate moment localization. Extensive experiments conducted on three benchmarks (i.e., Charades-STA, ActivityNet-Captions, and TACoS) demonstrate the proposed approach outperforms the state-of-the-art methods.

关键词： Location awareness computer vision Natural languages Benchmark testing pattern recognition Proposals Task analysis

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 491 492 493 494 495 496 497 498 499 500 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：