检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

12,844 篇 会议
13 篇 期刊文献
2 册 图书

馆藏范围

12,859 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

7,573 篇 工学
- 6,863 篇 计算机科学与技术...
- 880 篇 机械工程
- 814 篇 软件工程
- 435 篇 控制科学与工程
- 360 篇 光学工程
- 306 篇 电气工程
- 209 篇 仪器科学与技术
- 124 篇 信息与通信工程
- 91 篇 生物工程
- 62 篇 生物医学工程（可授...
- 39 篇 电子科学与技术（可...
- 34 篇 安全科学与工程
- 26 篇 化学工程与技术
- 21 篇 交通运输工程
- 20 篇 建筑学
- 18 篇 土木工程
2,957 篇 医学
- 2,956 篇 临床医学
- 15 篇 基础医学(可授医学...
- 12 篇 药学(可授医学、理...
700 篇 理学
- 359 篇 物理学
- 225 篇 数学
- 175 篇 系统科学
- 95 篇 统计学（可授理学、...
- 93 篇 生物学
- 22 篇 化学
201 篇 艺术学
- 201 篇 设计学（可授艺术学...
84 篇 管理学
- 59 篇 图书情报与档案管...
- 25 篇 管理科学与工程(可...
- 14 篇 工商管理
23 篇 法学
- 21 篇 社会学
5 篇 农学
4 篇 教育学
2 篇 经济学
1 篇 军事学

主题

6,464 篇 computer vision
2,688 篇 training
2,437 篇 pattern recognit...
1,780 篇 computational mo...
1,522 篇 visualization
1,348 篇 three-dimensiona...
1,091 篇 computer archite...
1,063 篇 semantics
997 篇 benchmark testin...
976 篇 codes
970 篇 conferences
854 篇 feature extracti...
830 篇 cameras
771 篇 task analysis
707 篇 deep learning
646 篇 image segmentati...
611 篇 object detection
595 篇 shape
554 篇 transformers
538 篇 neural networks

机构

132 篇 univ sci & techn...
122 篇 carnegie mellon ...
120 篇 tsinghua univ pe...
114 篇 univ chinese aca...
113 篇 chinese univ hon...
94 篇 tsinghua univers...
91 篇 zhejiang univ pe...
91 篇 swiss fed inst t...
85 篇 peng cheng lab p...
81 篇 university of ch...
80 篇 zhejiang univers...
77 篇 shanghai ai lab ...
77 篇 peng cheng labor...
75 篇 university of sc...
69 篇 shanghai jiao to...
68 篇 shanghai jiao to...
67 篇 alibaba grp peop...
67 篇 stanford univ st...
66 篇 univ hong kong p...
64 篇 sensetime res pe...

作者

77 篇 timofte radu
63 篇 van gool luc
45 篇 zhang lei
36 篇 yang yi
36 篇 luc van gool
34 篇 tao dacheng
31 篇 loy chen change
29 篇 chen chen
28 篇 sun jian
28 篇 qi tian
25 篇 li xin
24 篇 liu yang
24 篇 tian qi
24 篇 ying shan
23 篇 wang xinchao
23 篇 zha zheng-jun
23 篇 boxin shi
21 篇 zhou jie
21 篇 vasconcelos nuno
20 篇 luo ping

语言

12,856 篇 英文
2 篇 其他
1 篇 中文

检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops"

共 12859 条记录，以下是4341-4350 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

XoFTR: Cross-modal Feature Matching Transformer

XoFTR: Cross-modal Feature Matching Transformer

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Önder Tuzcuoğlu Aybora Köksal Buğra Sofu Sinan Kalkan A. Aydın Alatan Dept. of Electrical and Electronics Eng. Center for Image Analysis Middle East Technical University Ankara Turkey ROKETSAN Inc. Ankara Turkey Dept. of Computer Eng. Center for Image Analysis Middle East Technical University Ankara Turkey

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

We introduce, XoFTR, a cross-modal cross-view method for local feature matching between thermal infrared (TIR) and visible images. Unlike visible images, TIR images are less susceptible to adverse lighting and weather conditions but present difficulties in matching due to significant texture and intensity differences. Current hand-crafted and learning-based methods for visible-TIR matching fall short in handling viewpoint, scale, and texture diversities. To address this, XoFTR incorporates masked image modeling pre-training and fine-tuning with pseudo-thermal image augmentation to handle the modality differences. Additionally, we introduce a refined matching pipeline that adjusts for scale discrepancies and enhances match reliability through sub-pixel level refinement. To validate our approach, we collect a comprehensive visible-thermal dataset, and show that our method outperforms existing methods on many benchmarks. Code and dataset at https://***/OnderT/XoFTR.

关键词： Learning systems Image matching Pipelines Lighting Benchmark testing Transformers Image augmentation

来源：评论

学校读者我要写书评

暂无评论

Holistic 3D Human and Scene Mesh Estimation from Single View Images

Holistic 3D Human and Scene Mesh Estimation from Single View...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Weng, Zhenzhen Yeung, Serena Stanford Univ Stanford CA 94305 USA

ISBN: (纸本)9781665445092

The 3D world limits the human body pose and the human body pose conveys information about the surrounding objects. Indeed, from a single image of a person placed in an indoor scene, we as humans are adept at resolving ambiguities of the human pose and room layout through our knowledge of the physical laws and prior perception of the plausible object and human poses. However, few computer vision models fully leverage this fact. In this work, we propose a holistically trainable model that perceives the 3D scene from a single RGB image, estimates the camera pose and the room layout, and reconstructs both human body and object meshes. By imposing a set of comprehensive and sophisticated losses on all aspects of the estimations, we show that our model outperforms existing human body mesh methods and indoor scene reconstruction methods. To the best of our knowledge, this is the first model that outputs both object and human predictions at the mesh level, and performs joint optimization on the scene and human poses.

关键词： Measurement Solid modeling computer vision Three-dimensional displays Computational modeling Biological system modeling Pose estimation

来源：评论

学校读者我要写书评

暂无评论

Certified Adversarial Robustness Within Multiple Perturbation Bounds

Certified Adversarial Robustness Within Multiple Perturbatio...

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Soumalya Nandi Sravanti Addepalli Harsh Rangwani R. Venkatesh Babu Vision and AI Lab Indian Institute of Science Bengaluru

Randomized smoothing (RS) is a well known certified defense against adversarial attacks, which creates a smoothed classifier by predicting the most likely class under random noise perturbations of inputs during inference. While initial work focused on robustness to ℓ 2 norm perturbations using noise sampled from a Gaussian distribution, subsequent works have shown that different noise distributions can result in robustness to other ℓ p norm bounds as well. In general, a specific noise distribution is optimal for defending against a given ℓ p norm based attack. In this work, we aim to improve the certified adversarial robustness against multiple perturbation bounds simultaneously. Towards this, we firstly present a novel certification scheme, that effectively combines the certificates obtained using different noise distributions to obtain optimal results against multiple perturbation bounds. We further propose a novel training noise distribution along with a regularized training scheme to improve the certification within both ℓ 1 and ℓ 2 perturbation norms simultaneously. Contrary to prior works, we compare the certified robustness of different training algorithms across the same natural (clean) accuracy, rather than across fixed noise levels used for training and certification. We also empirically invalidate the argument that training and certifying the classifier with the same amount of noise gives the best results. The proposed approach achieves improvements on the ACR (Average Certified Radius) metric across both ℓ 1 and ℓ 2 perturbation bounds. Code available at https://***/valiisc/NU-Certified-Robustness

关键词：

来源：评论

学校读者我要写书评

暂无评论

Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling 18

Localize, Group, and Select: Boosting Text-VQA by Scene Text...

引用

18th ieee/cvf International conference on computer vision (ICCV)

作者： Lu, Xiaopeng Fan, Zhen Wang, Yansen Oh, Jean Rose, Carolyn P. Carnegie Mellon Univ Language Technol Inst 5000 Forbes Ave Pittsburgh PA 15213 USA

ISBN: (纸本)9781665401913

As an important task in multimodal context understanding, Text-VQA (Visual Question Answering) aims at question answering through reading text information in images. It differentiates from the original VQA task as Text-VQA requires large amounts of scene-text relationship understanding, in addition to the cross-modal grounding capability. In this paper, we propose Localize, Group, and Select (LOGOS), a novel model which attempts to tackle this problem from multiple aspects. LOGOS leverages two grounding tasks to better localize the key information of the image, utilizes scene text clustering to group individual OCR tokens, and learns to select the best answer from different sources of OCR (Optical Character recognition) texts. Experiments show that LOGOS outperforms previous state-of-the-art methods on two Text-VQA benchmarks without using additional OCR annotation data. Ablation studies and analysis demonstrate the capability of LOGOS to bridge different modalities and better understand scene text.

关键词： Integrated optics Visualization computer vision Grounding conferences Computational modeling Knowledge discovery

来源：评论

学校读者我要写书评

暂无评论

Achieving robustness in classification using optimal transport with hinge regularization

Achieving robustness in classification using optimal transpo...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Serrurier, Mathieu Mamalet, Franck Gonzalez-Sanz, Alberto Boissin, Thibaut Loubes, Jean-Michel del Barrio, Eustasio Univ Paul Sabatier Toulouse France IRT St Exupery Toulouse France Univ Valladolid Valladolid Spain

ISBN: (纸本)9781665445092

Adversarial examples have pointed out Deep Neural Network's vulnerability to small local noise. It has been shown that constraining their Lipschitz constant should enhance robustness, but make them harder to learn with classical loss functions. We propose a new framework for binary classification, based on optimal transport, which integrates this Lipschitz constraint as a theoretical requirement. We propose to learn 1-Lipschitz networks using a new loss that is an hinge regularized version of the Kantorovich-Rubinstein dual formulation for the Wasserstein distance estimation. This loss function has a direct interpretation in terms of adversarial robustness together with certifiable robustness bound. We also prove that this hinge regularized version is still the dual formulation of an optimal transportation problem, and has a solution. We also establish several geometrical properties of this optimal solution, and extend the approach to multi-class problems. Experiments show that the proposed approach provides the expected guarantees in terms of robustness without any significant accuracy drop. The adversarial examples, on the proposed models, visibly and meaningfully change the input providing an explanation for the classification.

关键词： computer vision Computational modeling Transportation Estimation Fasteners Robustness pattern recognition

来源：评论

学校读者我要写书评

暂无评论

SLADE: A Self-Training Framework For Distance Metric Learning

SLADE: A Self-Training Framework For Distance Metric Learnin...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Duan, Jiali Lin, Yen-Liang Son Tran Davis, Larry S. Kuo, C-C Jay Univ Southern Calif Los Angeles CA 90089 USA Amazon Seattle WA USA

ISBN: (纸本)9781665445092

Most existing distance metric learning approaches use fully labeled data to learn the sample similarities in an embedding space. We present a self-training framework, SLADE, to improve retrieval performance by leveraging additional unlabeled data. We first train a teacher model on the labeled data and use it to generate pseudo labels for the unlabeled data. We then train a student model on both labels and pseudo labels to generate final feature embeddings. We use self-supervised representation learning to initialize the teacher model. To better deal with noisy pseudo labels generated by the teacher network, we design a new feature basis learning component for the student network, which learns basis functions of feature representations for unlabeled data. The learned basis vectors better measure the pairwise similarity and are used to select high-confident samples for training the student network. We evaluate our method on standard retrieval benchmarks: CUB-200, Cars196 and In-shop. Experimental results demonstrate that with additional unlabeled data, our approach significantly improves the performance over the state-of-the-art methods.

关键词： Training computer vision Art Benchmark testing Extraterrestrial measurements Data models pattern recognition

来源：评论

学校读者我要写书评

暂无评论

A Dual-stream Framework for 3D Mask Face Presentation Attack Detection 18

A Dual-stream Framework for 3D Mask Face Presentation Attack...

引用

18th ieee/cvf International conference on computer vision (ICCV)

作者： Chen, Shen Yao, Taiping Zhang, Keyue Chen, Yang Sun, Ke Ding, Shouhong Li, Jilin Huang, Feiyue Ji, Rongrong Tencent YouTu Lab Shenzhen Peoples R China Xiamen Univ Media Analyt & Comp Lab Xiamen Peoples R China

ISBN: (纸本)9781665401913

Face presentation attack detection (PAD) plays a vital role in face recognition systems. Many previous face anti-spoofing methods mainly focus on the 2D face representation attacks, which however, suffer from great performance degradation when facing high-fidelity 3D mask attacks. To address this issue, we propose a novel dual-stream framework consisting of the vanilla convolution stream and the central difference convolution stream. These two streams complement each other and learn more comprehensive features for 3D mask attacks detection. Moreover, we extend 3D PAD to a multi-classification task that contains real face, plaster attack and transparent attack, and utilize various data augmentations and label smoothing techniques to improve the generalizability on unseen attacks. The proposed method achieved the second place in the Chalearn 3D High-Fidelity Mask Face Presentation Attack Detection Challenge@ICCV2021 with a score of 3.15 (ACER).

关键词： Degradation computer vision Three-dimensional displays Smoothing methods Convolution Face recognition conferences

来源：评论

学校读者我要写书评

暂无评论

DECNet: A Non-Contacting Dual-Modality Emotion Classification Network for Driver Health Monitoring

DECNet: A Non-Contacting Dual-Modality Emotion Classificatio...

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Zhekang Dong Chenhao Hu Shiqi Zhou Liyan Zhu Junfan Wang Yi Chen Xudong Lv Xiaoyue Ji Hangzhou Dianzi University Zhejiang Provincial Key Laboratory of Equipment Electronics Zhejiang University Tsinghua University

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Negative emotions have been identified as significant factors influencing driver behavior, easily leading to extremely serious traffic accidents. Hence, there is a pressing need to develop an automatic emotion classification method for driver health monitoring and road safety improvement. Most of the existing methods predominantly focus on single modalities, resulting in suboptimal classification performance due to the underutilization of heterogeneous information. In this work, we propose a novel non-contacting dual-modality driver emotion classification network (DECNet) to address these limitations. DECNet consists of three key modules: 1) facial video modality processing module; 2) driving behavior modality processing module; 3) fusion decision module. Meanwhile, we introduce a combined multi-task learning strategy within DECNet to improve the efficacy in the driver emotion classification task. To evaluate the effectiveness of the proposed DECNet, we conducted experiments on the PPB-Emo dataset, the experimental results showcase the superiority in terms of accuracy (⩾ 6.12% Acc-7) and F1-score (⩾ 7.25% F1-7) compared to existing state-of-the art methods. The model and code will be available at https://***/fqfqngxhs/***

关键词： Smart cities Pressing Feature extraction Multitasking Road safety Safety pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Unsupervised Part Segmentation through Disentangling Appearance and Shape

Unsupervised Part Segmentation through Disentangling Appeara...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Liu, Shilong Zhang, Lei Yang, Xiao Su, Hang Zhu, Jun Tsinghua Univ Dept Comp Sci & Tech BNRist Ctr Inst AITsinghua Bosch Joint ML Ctr Beijing 100084 Peoples R China Microsoft Corp Redmond WA 98052 USA

ISBN: (纸本)9781665445092

We study the problem of unsupervised discovery and segmentation of object parts, which, as an intermediate local representation, are capable of finding intrinsic object structure and providing more explainable recognition results. Recent unsupervised methods have greatly relaxed the dependency on annotated data which are costly to obtain, but still rely on additional information such as object segmentation mask or saliency map. To remove such a dependency and further improve the part segmentation performance, we develop a novel approach by disentangling the appearance and shape representations of object parts followed with reconstruction losses without using additional object mask information. To avoid degenerated solutions, a bottleneck block is designed to squeeze and expand the appearance representation, leading to a more effective disentanglement between geometry and appearance. Combined with a self-supervised part classification loss and an improved geometry concentration constraint, we can segment more consistent parts with semantic meanings. Comprehensive experiments on a wide variety of objects such as face, bird, and PASCAL VOC objects demonstrate the effectiveness of the proposed method.

关键词： Geometry computer vision Shape Annotations Face recognition Semantics Neural networks

来源：评论

学校读者我要写书评

暂无评论

Linguistic Structures as Weak Supervision for Visual Scene Graph Generation

Linguistic Structures as Weak Supervision for Visual Scene G...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Ye, Keren Kovashka, Adriana Univ Pittsburgh Dept Comp Sci Pittsburgh PA 15260 USA

ISBN: (纸本)9781665445092

Prior work in scene graph generation requires categorical supervision at the level of triplets-subjects and objects, and predicates that relate them, either with or without bounding box information. However, scene graph generation is a holistic task: thus holistic, contextual supervision should intuitively improve performance. In this work, we explore how linguistic structures in captions can benefit scene graph generation. Our method captures the information provided in captions about relations between individual triplets, and context for subjects and objects (e.g. visual properties are mentioned). Captions are a weaker type of supervision than triplets since the alignment between the exhaustive list of human-annotated subjects and objects in triplets, and the nouns in captions, is weak. However, given the large and diverse sources of multimodal data on the web (e.g. blog posts with images and captions), linguistic supervision is more scalable than crowdsourced triplets. We show extensive experimental comparisons against prior methods which leverage instance- and image-level supervision, and ablate our method to show the impact of leveraging phrasal and sequential context, and techniques to improve localization of subjects and objects.

关键词： Location awareness Visualization computer vision Blogs Linguistics pattern recognition Noise measurement

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 431 432 433 434 435 436 437 438 439 440 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：