检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

12,844 篇 会议
13 篇 期刊文献
2 册 图书

馆藏范围

12,859 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

7,573 篇 工学
- 6,863 篇 计算机科学与技术...
- 880 篇 机械工程
- 814 篇 软件工程
- 435 篇 控制科学与工程
- 360 篇 光学工程
- 306 篇 电气工程
- 209 篇 仪器科学与技术
- 124 篇 信息与通信工程
- 91 篇 生物工程
- 62 篇 生物医学工程（可授...
- 39 篇 电子科学与技术（可...
- 34 篇 安全科学与工程
- 26 篇 化学工程与技术
- 21 篇 交通运输工程
- 20 篇 建筑学
- 18 篇 土木工程
2,957 篇 医学
- 2,956 篇 临床医学
- 15 篇 基础医学(可授医学...
- 12 篇 药学(可授医学、理...
700 篇 理学
- 359 篇 物理学
- 225 篇 数学
- 175 篇 系统科学
- 95 篇 统计学（可授理学、...
- 93 篇 生物学
- 22 篇 化学
201 篇 艺术学
- 201 篇 设计学（可授艺术学...
84 篇 管理学
- 59 篇 图书情报与档案管...
- 25 篇 管理科学与工程(可...
- 14 篇 工商管理
23 篇 法学
- 21 篇 社会学
5 篇 农学
4 篇 教育学
2 篇 经济学
1 篇 军事学

主题

6,464 篇 computer vision
2,688 篇 training
2,437 篇 pattern recognit...
1,780 篇 computational mo...
1,522 篇 visualization
1,348 篇 three-dimensiona...
1,091 篇 computer archite...
1,063 篇 semantics
997 篇 benchmark testin...
976 篇 codes
970 篇 conferences
854 篇 feature extracti...
830 篇 cameras
771 篇 task analysis
707 篇 deep learning
646 篇 image segmentati...
611 篇 object detection
595 篇 shape
554 篇 transformers
538 篇 neural networks

机构

132 篇 univ sci & techn...
122 篇 carnegie mellon ...
120 篇 tsinghua univ pe...
114 篇 univ chinese aca...
113 篇 chinese univ hon...
94 篇 tsinghua univers...
91 篇 zhejiang univ pe...
91 篇 swiss fed inst t...
85 篇 peng cheng lab p...
81 篇 university of ch...
80 篇 zhejiang univers...
77 篇 shanghai ai lab ...
77 篇 peng cheng labor...
75 篇 university of sc...
69 篇 shanghai jiao to...
68 篇 shanghai jiao to...
67 篇 alibaba grp peop...
67 篇 stanford univ st...
66 篇 univ hong kong p...
64 篇 sensetime res pe...

作者

77 篇 timofte radu
63 篇 van gool luc
45 篇 zhang lei
36 篇 yang yi
36 篇 luc van gool
34 篇 tao dacheng
31 篇 loy chen change
29 篇 chen chen
28 篇 sun jian
28 篇 qi tian
25 篇 li xin
24 篇 liu yang
24 篇 tian qi
24 篇 ying shan
23 篇 wang xinchao
23 篇 zha zheng-jun
23 篇 boxin shi
21 篇 zhou jie
21 篇 vasconcelos nuno
20 篇 luo ping

语言

12,856 篇 英文
2 篇 其他
1 篇 中文

检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops"

共 12859 条记录，以下是4551-4560 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Prototype-Guided Saliency Feature Learning for Person Search

Prototype-Guided Saliency Feature Learning for Person Search

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Kim, Hanjae Joung, Sunghun Kim, Ig-Jae Sohn, Kwanghoon Yonsei Univ Seoul South Korea Korea Inst Sci & Technol KIST Seoul South Korea

ISBN: (纸本)9781665445092

Existing person search methods integrate person detection and re-identification (re-ID) module into a unified system. Though promising results have been achieved, the misalignment problem, which commonly occurs in person search, limits the discriminative feature representation for re-ID. To overcome this limitation, we introduce a novel framework to learn the discriminative representation by utilizing prototype in OIM loss. Unlike conventional methods using prototype as a representation of person identity, we utilize it as guidance to allow the attention network to consistently highlight multiple instances across different poses. Moreover, we propose a new prototype update scheme with adaptive momentum to increase the discriminative ability across different instances. Extensive ablation experiments demonstrate that our method can significantly enhance the feature discriminative power, outperforming the state-of-the-art results on two person search benchmarks including CUHK-SYSU and PRW.

关键词： computer vision Adaptation models Annotations Computational modeling Prototypes Benchmark testing Search problems

来源：评论

学校读者我要写书评

暂无评论

Reconstructing 3D Human Pose by Watching Humans in the Mirror

Reconstructing 3D Human Pose by Watching Humans in the Mirro...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Fang, Qi Shuai, Qing Dong, Junting Bao, Hujun Zhou, Xiaowei Zhejiang Univ State Key Lab CAD&CG Hangzhou Zhejiang Peoples R China

ISBN: (纸本)9781665445092

In this paper, we introduce the new task of reconstructing 3D human pose from a single image in which we can see the person and the person's image through a mirror. Compared to general scenarios of 3D pose estimation from a single view, the mirror reflection provides an additional view for resolving the depth ambiguity. We develop an optimization-based approach that exploits mirror symmetry constraints for accurate 3D pose reconstruction. We also provide a method to estimate the surface normal of the mirror from vanishing points in the single image. To validate the proposed approach, we collect a large-scale dataset named Mirrored-Human, which covers a large variety of human subjects, poses and backgrounds. The experiments demonstrate that, when trained on Mirrored-Human with our reconstructed 3D poses as pseudo ground-truth, the accuracy and generalizability of existing single-view 3D pose estimators can be largely improved. The code and dataset are available at https://***/Mirrored-Human/.

关键词： Training Surface reconstruction Three-dimensional displays Shape Pose estimation Reflection pattern recognition

来源：评论

学校读者我要写书评

暂无评论

IronMask: Modular Architecture for Protecting Deep Face Template

IronMask: Modular Architecture for Protecting Deep Face Temp...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Kim, Sunpill Jeong, Yunseong Kim, Jinsu Kim, Jungkon Lee, Hyung Tae Seo, Jae Hong Hanyang Univ Dept Math Seoul South Korea Hanyang Univ Res Inst Nat Sci Seoul South Korea Jeonbuk Natl Univ Coll Engn Div Comp Sci & Engn Jeonju South Korea Samsung Elect Samsung Res Secur Team Suwon South Korea

ISBN: (纸本)9781665445092

Convolutional neural networks have made remarkable progress in the face recognition field. The more the technology of face recognition advances, the greater discriminative features into a face template. However, this increases the threat to user privacy in case the template is exposed. In this paper, we present a modular architecture for face template protection, called IronMask, that can be combined with any face recognition system using angular distance metric. We circumvent the need for binarization, which is the main cause of performance degradation in most existing face template protections, by proposing a new real-valued error-correcting-code that is compatible with real-valued templates and can therefore, minimize performance degradation. We evaluate the efficacy of IronMask by extensive experiments on two face recognitions, ArcFace and Cos-Face with three datasets, CMU-Multi-PIE, FEI, and Color-FERET. According to our experimental results, IronMask achieves a true accept rate (TAR) of 99.79% at a false accept rate (FAR) of 0.0005% when combined with ArcFace, and 95.78% TAR at 0% FAR with CosFace, while providing at least 115-bit security against known attacks.

关键词： Degradation Measurement Privacy computer vision Face recognition computer architecture Security

来源：评论

学校读者我要写书评

暂无评论

Taskology: Utilizing Task Relations at Scale

Taskology: Utilizing Task Relations at Scale

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Lu, Yao Pirk, Soren Dlabal, Jan Brohan, Anthony Pasad, Ankita Chen, Zhao Casser, Vincent Angelova, Anelia Gordon, Ariel Google Robot Mountain View CA 94043 USA Google Res Mountain View CA 94043 USA Toyota Technol Inst Chicago Chicago IL 60637 USA Waymo LLC Mountain View CA USA

ISBN: (纸本)9781665445092

Many computer vision tasks address the problem of scene understanding and are naturally interrelated e.g. object classification, detection, scene segmentation, depth estimation, etc. We show that we can leverage the inherent relationships among collections of tasks, as they are trained jointly, supervising each other through their known relationships via consistency losses. Furthermore, explicitly utilizing the relationships between tasks allows improving their performance while dramatically reducing the need for labeled data, and allows training with additional unsupervised or simulated data. We demonstrate a distributed joint training algorithm with task-level parallelism, which affords a high degree of asynchronicity and robustness. This allows learning across multiple tasks, or with large amounts of input data, at scale. We demonstrate our framework on subsets of the following collection of tasks: depth and normal prediction, semantic segmentation, 3D motion and egomotion estimation, and object tracking and 3D detection in point clouds. We observe improved performance across these tasks, especially in the low-label regime.

关键词： Training computer vision Three-dimensional displays Computational modeling Semantics Estimation Prediction algorithms

来源：评论

学校读者我要写书评

暂无评论

Double low-rank representation with projection distance penalty for clustering

Double low-rank representation with projection distance pena...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Fu, Zhiqiang Zhao, Yao Chang, Dongxia Zhang, Xingxing Wang, Yiming Beijing Jiaotong Univ Inst Informat Sci Beijing Peoples R China Beijing Key Lab Adv Informat Sci & Network Techno Beijing Peoples R China Tsinghua Univ Dept Comp Sci & Technol Beijing Peoples R China

ISBN: (纸本)9781665445092

This paper presents a novel, simple yet robust self-representation method, i.e., Double Low-Rank Representation with Projection Distance penalty (DLRRPD) for clustering. With the learned optimal projected representations, DLRRPD is capable of obtaining an effective similarity graph to capture the multi-subspace structure. Besides the global low-rank constraint, the local geometrical structure is additionally exploited via a projection distance penalty in our DLRRPD, thus facilitating a more favorable graph. Moreover, to improve the robustness of DLRRPD to noises, we introduce a Laplacian rank constraint, which can further encourage the learned graph to be more discriminative for clustering tasks. Meanwhile, Frobenius norm (instead of the popularly used nuclear norm) is employed to enforce the graph to be more block-diagonal with lower complexity. Extensive experiments have been conducted on synthetic, real, and noisy data to show that the proposed method outperforms currently available alternatives by a margin of 1.0%similar to 10.1%.

关键词： computer vision Laplace equations Robustness pattern recognition Complexity theory Noise measurement Task analysis

来源：评论

学校读者我要写书评

暂无评论

Weakly Supervised Video Salient Object Detection

Weakly Supervised Video Salient Object Detection

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Zhao, Wangbo Zhang, Jing Li, Long Barnes, Nick Liu, Nian Han, Junwei Northwestern Polytech Univ Brain & Artificial Intelligence Lab Xian Peoples R China Australian Natl Univ Canberra ACT Australia CSIRO Canberra ACT Australia Incept Inst Artificial Intelligence Abu Dhabi U Arab Emirates

ISBN: (纸本)9781665445092

Significant performance improvement has been achieved for fully-supervised video salient object detection with the pixel-wise labeled training datasets, which are time-consuming and expensive to obtain. To relieve the burden of data annotation, we present the first weakly supervised video salient object detection model based on relabeled "fixation guided scribble annotations". Specifically, an "Appearance-motion fusion module" and bidirectional ConvLSTM based framework are proposed to achieve effective multi-modal learning and long-term temporal context modeling based on our new weak annotations. Further, we design a novel foreground-background similarity loss to further explore the labeling similarity across frames. A weak annotation boosting strategy is also introduced to boost our model performance with a new pseudo-label generation technique. Extensive experimental results on six benchmark video saliency detection datasets illustrate the effectiveness of our solution(1).

关键词： Training computer vision Annotations Object detection Boosting Data models pattern recognition

来源：评论

学校读者我要写书评

暂无评论

UC²: Universal Cross-lingual Cross-modal vision-and-Language Pre-training

UC<SUP>2</SUP>: Universal Cross-lingual Cross-modal Vision-a...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Zhou, Mingyang Zhou, Luowei Wang, Shuohang Cheng, Yu Li, Linjie Yu, Zhou Liu, Jingjing Univ Calif Davis Davis CA 95616 USA Microsoft Dynamics 365 AI Res Redmond WA USA

ISBN: (纸本)9781665445092

vision-and-language pre-training has achieved impressive success in learning multimodal representations between vision and language. To generalize this success to non-English languages, we introduce UC2, the first machine translation-augmented framework for cross-lingual cross-modal representation learning. To tackle the scarcity problem of multilingual captions for image datasets, we first augment existing English-only datasets with other languages via machine translation (MT). Then we extend the standard Masked Language Modeling and Image-Text Matching training objectives to multilingual setting, where alignment between different languages is captured through shared visual context (i.e., using image as pivot). To facilitate the learning of a joint embedding space of images and all languages of interest, we further propose two novel pre-training tasks, namely Masked Region-to-Token Modeling (MRTM) and Visual Translation Language Modeling (VTLM), leveraging MT-enhanced translated data. Evaluation on multilingual image-text retrieval and multilingual visual question answering benchmarks demonstrates that our proposed framework achieves new state of the art on diverse non-English benchmarks while maintaining comparable performance to monolingual pre-trained models on English tasks.

关键词： Training Visualization Benchmark testing Knowledge discovery Data models pattern recognition Machine translation

来源：评论

学校读者我要写书评

暂无评论

End-to-End Human Object Interaction Detection with HOI Transformer

End-to-End Human Object Interaction Detection with HOI Trans...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Zou, Cheng Wang, Bohan Hu, Yue Liu, Junqi Wu, Qian Zhao, Yu Li, Boxun Zhang, Chenguang Zhang, Chi Wei, Yichen Sun, Jian MEGVII Technol Beijing Peoples R China

ISBN: (纸本)9781665445092

We propose HOI Transformer to tackle human object interaction (HOI) detection in an end-to-end manner. Current approaches either decouple HOI task into separated stages of object detection and interaction classification or introduce surrogate interaction problem. In contrast, our method, named HOI Transformer, streamlines the HOI pipeline by eliminating the need for many hand-designed components. HOI Transformer reasons about the relations of objects and humans from global image context and directly predicts HOI instances in parallel. A quintuple matching loss is introduced to force HOI predictions in a unified way. Our method is conceptually much simpler and demonstrates improved accuracy. Without bells and whistles, HOI Transformer achieves 26:61% AP on HICO-DET and 52:9% AProle on V-COCO, surpassing previous methods with the advantage of being much simpler. We hope our approach will serve as a simple and effective alternative for HOI tasks.

关键词： computer vision Pipelines Force Object detection Transformer cores Streaming media Transformers

来源：评论

学校读者我要写书评

暂无评论

QAttn: Efficient GPU Kernels for mixed-precision vision Transformers

QAttn: Efficient GPU Kernels for mixed-precision Vision Tran...

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Piotr Kluska Adrián Castelló Florian Scheidegger A. Cristiano I. Malossi Enrique S. Quintana-Ortí IBM Research Europe Universitat Politècnica de València Universitat Politècnica de València IBM Research Europe

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

vision Transformers have demonstrated outstanding performance in computer vision tasks. Nevertheless, this superior performance for large models comes at the expense of increasing memory usage for storing the parameters and intermediate activations. To accelerate model inference, in this work we develop and evaluate integer and mixed-precision kernels in Triton for the efficient execution of two fundamental building blocks of transformers –linear layer and attention– on graphics processing units (GPUs). On an NVIDIA A100 GPU, our kernel implementations of vision Transformers achieve a throughput speedup of up to 7x compared with reference kernels in PyTorch floating-point single precision (FP32). Additionally, the accuracy for the ViT Large model top-1 drops by less than one percent on the ImageNet1K classification task. We also observe up to 6x increased throughput by applying our kernels to the Segment Anything Model image encoder while keeping the mIOU close to the FP32 reference on the COCO2017 dataset for static and dynamic quantization. Furthermore, our kernels demonstrate improved speed to the TensorRT INT8 linear layer, and we improve the throughput of base FP16 (half precision) Triton attention on average by up to 19 ± 4.01%. We have open-sourced the QAtnn framework, which is tightly integrated with the PyTorch quantization workflow https://***/IBM/qattn.

关键词： computer vision Image segmentation Quantization (signal) Computational modeling Graphics processing units Prototypes Transformers

来源：评论

学校读者我要写书评

暂无评论

DualAST: Dual Style-Learning Networks for Artistic Style Transfer

DualAST: Dual Style-Learning Networks for Artistic Style Tra...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Chen, Haibo Zhao, Lei Wang, Zhizhong Zhang, Huiming Zuo, Zhiwen Li, Ailin Xing, Wei Lu, Dongming Zhejiang Univ Coll Comp Sci & Technol Hangzhou Peoples R China

ISBN: (纸本)9781665445092

Artistic style transfer is an image editing task that aims at repainting everyday photographs with learned artistic styles. Existing methods learn styles from either a single style example or a collection of artworks. Accordingly, the stylization results are either inferior in visual quality or limited in style controllability. To tackle this problem, we propose a novel Dual Style-Learning Artistic Style Transfer (DualAST) framework to learn simultaneously both the holistic artist-style (from a collection of artworks) and the specific artwork-style (from a single style image): the artist-style sets the tone (i.e., the overall feeling) for the stylized image, while the artwork-style determines the details of the stylized image, such as color and texture. Moreover, we introduce a Style-Control Block (SCB) to adjust the styles of generated images with a set of learnable style-control factors. We conduct extensive experiments to evaluate the performance of the proposed framework, the results of which confirm the superiority of our method.

关键词： Visualization computer vision Image color analysis Controllability pattern recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 452 453 454 455 456 457 458 459 460 461 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：