检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

29,402 篇 会议
1,396 册 图书
219 篇 期刊文献

馆藏范围

31,015 篇 电子文献
2 种 纸本馆藏

日期分布

学科分类号

17,281 篇 工学
- 13,622 篇 计算机科学与技术...
- 5,203 篇 软件工程
- 2,970 篇 机械工程
- 2,648 篇 光学工程
- 1,412 篇 控制科学与工程
- 1,409 篇 电气工程
- 1,333 篇 信息与通信工程
- 656 篇 生物工程
- 576 篇 仪器科学与技术
- 513 篇 生物医学工程（可授...
- 465 篇 电子科学与技术（可...
- 251 篇 化学工程与技术
- 213 篇 安全科学与工程
- 141 篇 交通运输工程
- 132 篇 建筑学
- 121 篇 材料科学与工程（可...
- 119 篇 土木工程
5,056 篇 理学
- 3,130 篇 物理学
- 2,404 篇 数学
- 824 篇 生物学
- 802 篇 统计学（可授理学、...
- 299 篇 系统科学
- 228 篇 化学
3,830 篇 医学
- 3,799 篇 临床医学
- 186 篇 基础医学(可授医学...
- 140 篇 药学(可授医学、理...
1,061 篇 管理学
- 617 篇 图书情报与档案管...
- 469 篇 管理科学与工程(可...
- 146 篇 工商管理
373 篇 艺术学
- 373 篇 设计学（可授艺术学...
116 篇 法学
81 篇 农学
48 篇 教育学
43 篇 经济学
18 篇 军事学
8 篇 文学

主题

12,594 篇 computer vision
5,699 篇 pattern recognit...
3,181 篇 training
2,264 篇 cameras
2,179 篇 computational mo...
2,117 篇 feature extracti...
2,049 篇 image segmentati...
1,971 篇 visualization
1,967 篇 shape
1,642 篇 robustness
1,491 篇 layout
1,476 篇 three-dimensiona...
1,444 篇 computer science
1,338 篇 computer archite...
1,295 篇 object detection
1,221 篇 semantics
1,144 篇 face recognition
1,107 篇 conferences
1,077 篇 benchmark testin...
1,056 篇 humans

机构

137 篇 univ sci & techn...
134 篇 tsinghua univers...
134 篇 univ chinese aca...
118 篇 chinese univ hon...
101 篇 microsoft resear...
97 篇 zhejiang univers...
95 篇 national laborat...
93 篇 shanghai jiao to...
93 篇 zhejiang univ pe...
85 篇 university of sc...
79 篇 shanghai ai lab ...
78 篇 swiss fed inst t...
65 篇 microsoft res as...
62 篇 adobe research
62 篇 computer vision ...
61 篇 peking univ peop...
58 篇 univ oxford oxfo...
57 篇 google mountain ...
57 篇 hong kong univ s...
56 篇 google res mount...

作者

107 篇 umapada pal
81 篇 van gool luc
68 篇 zhang lei
59 篇 timofte radu
41 篇 yang yi
37 篇 loy chen change
37 篇 hanqing lu
33 篇 liu yang
33 篇 xiaoou tang
32 篇 nassir navab
32 篇 wang liang
30 篇 tian qi
29 篇 h. bischof
29 篇 jan-michael frah...
29 篇 vittorio murino
29 篇 darrell trevor
27 篇 li xin
27 篇 vasconcelos nuno
27 篇 murino vittorio
27 篇 chen chen

语言

30,712 篇 英文
236 篇 其他
93 篇 中文
6 篇 土耳其文
2 篇 日文
2 篇 俄文

检索条件"任意字段=Conference on Computer Vision and Pattern Recognition"

共 31017 条记录，以下是4721-4730 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

DeVLBert: Out-of-distribution Visio-Linguistic Pretraining with Causality

DeVLBert: Out-of-distribution Visio-Linguistic Pretraining w...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Zhang, Shengyu Jiang, Tan Wang, Tan Kuang, Kun Zhao, Zhou Zhu, Jianke Yu, Jin Yang, Hongxia Wu, Fei Zhejiang Univ Hangzhou Peoples R China Univ Elect Sci & Technol China Chengdu Peoples R China Alibaba Grp Hangzhou Peoples R China

ISBN: (纸本)9781665448994

In this paper, we propose to investigate out-of-domain visio-linguistic pretraining, where the pretraining data distribution differs from that of downstream data on which the pretrained model will be fine-tuned. Existing methods for this problem are purely likelihood-based, leading to the spurious correlations and hurt the generalization ability when transferred to out-of-domain downstream tasks. By spurious correlation, we mean that the conditional probability of one token (object or word) given another one can be high (due to the dataset biases) without robust (causal) relationships between them. To mitigate such dataset biases, we propose a Deconfounded Visio-Linguistic Bert framework, abbreviated as DeVLBert(1), to perform intervention based-learning. We borrow the idea of the backdoor adjustment from the research field of causality and propose several neural-network based architectures for Bert-style out-of-domain pretraining. The quantitative results on three downstream tasks, Image Retrieval (IR), Zero-shot IR, and Visual Question Answering, show the effectiveness of DeVLBert by boosting generalization ability(2).

关键词： Visualization computer vision Correlation conferences Image retrieval computer architecture Knowledge discovery

来源：评论

学校读者我要写书评

暂无评论

A Bop and Beyond: A Second Order Optimizer for Binarized Neural Networks

A Bop and Beyond: A Second Order Optimizer for Binarized Neu...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Daniel Suarez-Ramirez, Cuauhtemoc Gonzalez-Mendoza, Miguel Chang, Leonardo Ochoa-Ruiz, Gilberto Alberto Duran-Vega, Mario Tecnol Monterrey Sch Engn & Sci Dept Comp Sci Monterrey NL Mexico

ISBN: (纸本)9781665448994

The optimization of Binary Neural Networks (BNNs) relies on approximating the real-valued weights with their binarized representations. Current techniques for weight-updating use the same approaches as traditional Neural Networks (NNs) with the extra requirement of using an approximation to the derivative of the sign function - as it is the Dirac-Delta function - for back-propagation;thus, efforts are focused adapting full-precision techniques to work on BNNs. In the literature, only one previous effort has tackled the problem of directly training the BNNs with bit-flips by using the first raw moment estimate of the gradients and comparing it against a threshold for deciding when to flip a weight (Bop). In this paper, we take an approach parallel to Adam which also uses the second raw moment estimate to normalize the first raw moment before doing the comparison with the threshold, we call this method Bop2ndOrder. We present two versions of the proposed optimizer: a biased one and a bias-corrected one, each with its own applications. Also, we present a complete ablation study of the hyperparameters space, as well as the effect of using schedulers on each of them. For these studies, we tested the optimizer in CIFAR10 using the BinaryNet architecture. Also, we tested it in ImageNet 2012 with the XnorNet and BiRealNet architectures for accuracy. In both datasets our approach proved to converge faster, was robust to changes of the hyperparameters, and achieved better accuracy values.

关键词： Training computer vision conferences computer architecture Artificial neural networks pattern recognition Optimization

来源：评论

学校读者我要写书评

暂无评论

Rethinking and Improving the Robustness of Image Style Transfer

Rethinking and Improving the Robustness of Image Style Trans...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Wang, Pei Li, Yijun Vasconcelos, Nuno UC La Jolla CA 92093 USA Adobe Res Los Altos CA USA

ISBN: (纸本)9781665445092

Extensive research in neural style transfer methods has shown that the correlation between features extracted by a pre-trained VGG network has a remarkable ability to capture the visual style of an image. Surprisingly, however, this stylization quality is not robust and often degrades significantly when applied to features from more advanced and lightweight networks, such as those in the ResNet family. By performing extensive experiments with different network architectures, we find that residual connections, which represent the main architectural difference between VGG and ResNet, produce feature maps of small entropy, which are not suitable for style transfer. To improve the robustness of the ResNet architecture, we then propose a simple yet effective solution based on a softmax transformation of the feature activations that enhances their entropy. Experimental results demonstrate that this small magic can greatly improve the quality of stylization results, even for networks with random weights. This suggests that the architecture used for feature extraction is more important than the use of learned weights for the task of style transfer.

关键词： Visualization Smoothing methods Correlation computer architecture Network architecture Feature extraction Robustness

来源：评论

学校读者我要写书评

暂无评论

Learning Feature Aggregation for Deep 3D Morphable Models

Learning Feature Aggregation for Deep 3D Morphable Models

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Chen, Zhixiang Kim, Tae-Kyun Imperial Coll London London England Korea Adv Inst Sci & Technol Seoul South Korea

ISBN: (纸本)9781665445092

3D morphable models are widely used for the shape representation of an object class in computer vision and graphics applications. In this work, we focus on deep 3D morphable models that directly apply deep learning on 3D mesh data with a hierarchical structure to capture information at multiple scales. While great efforts have been made to design the convolution operator, how to best aggregate vertex features across hierarchical levels deserves further attention. In contrast to resorting to mesh decimation, we propose an attention based module to learn mapping matrices for better feature aggregation across hierarchical levels. Specifically, the mapping matrices are generated by a compatibility function of the keys and queries. The keys and queries are trainable variables, learned by optimizing the target objective, and shared by all data samples of the same object class. Our proposed module can be used as a train-only drop-in replacement for the feature aggregation in existing architectures for both downsampling and upsampling. Our experiments show that through the end-to-end training of the mapping matrices, we achieve state-of-the-art results on a variety of 3D shape datasets in comparison to existing morphable models.

关键词： Training Deep learning Solid modeling computer vision Three-dimensional displays Shape Convolution

来源：评论

学校读者我要写书评

暂无评论

What is Point Supervision Worth in Video Instance Segmentation?

What is Point Supervision Worth in Video Instance Segmentati...

引用

IEEE computer Society conference on computer vision and pattern recognition Workshops (CVPRW)

作者： Shuaiyi Huang De-An Huang Zhiding Yu Shiyi Lan Subhashree Radhakrishnan Jose M. Alvarez Abhinav Shrivastava Anima Anandkumar University of Maryland College Park NVIDIA Caltech

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Video instance segmentation (VIS) is a challenging vision task that aims to detect, segment, and track objects in videos. Conventional VIS methods rely on densely-annotated object masks which are expensive. We reduce the human annotations to only one point for each object in a video frame during training, and obtain high-quality mask predictions close to fully supervised models. Our proposed training method consists of a class-agnostic proposal generation module to provide rich negative samples and a spatio-temporal point-based matcher to match the object queries with the provided point annotations. Comprehensive experiments on three VIS benchmarks demonstrate competitive performance of the proposed framework, nearly matching fully supervised methods.

关键词： Instance segmentation Training computer vision Annotations Shape Object segmentation Object detection

来源：评论

学校读者我要写书评

暂无评论

An Enhanced Face Spoof Detection using ResNet50 Based Cosine Learning Rate

An Enhanced Face Spoof Detection using ResNet50 Based Cosine...

引用

2024 International conference on Distributed Systems, computer Networks and Cybersecurity, ICDSCNC 2024

作者： Hussein, Layth Sharanya, D. Kumar, Gotte Ranjith Prashanth, V. Ramya, R. The Islamic University Department of Computers Techniques Engineering College of Technical Engineering Najaf Iraq Department of Cse Hyderabad India Sr University Department of Computer Science & Artificial Intelligence Warangal India Nitte Meenakshi Institute of Technology Department of Electrical & Electronics Engineering Bengaluru India Bannari Amman Institute of Technology Department of Computer Science and Engineering Sathyamangalam India

ISBN: (纸本)9798350375442

In recent times, face biometric systems have recognized widely and got attention in computer vision. Further, the biometric face recognition systems are susceptible for face spoofing attacks, where an attacker uses a fake face image to copy a legitimate user. An effective face spoof detection method can protect user identity, prevent unauthorized access and maintain trust in biometric authentication. The existing face spoof detection methods offer low efficiency in detecting small objects and cause overfitting. Therefore, this paper introduces ResNet50- Cosine Learning Rate (CLR) for face spoof detection which accurately distinguishes between real and fake face images by preventing unauthorized access in biometric systems. The proposed ResNet50-CLR takes input images from Institute of Automation, Chinese Academy of Sciences (CASIA) dataset. These images were preprocessed by background subtraction where features are extracted by Local Binary patterns (LBP);then, the extracted features are classified by Proposed ResNet50-CLR which achieved better results in terms of Equal Error Rate (EER) value as 0.12 (Btu/h per watt) and Half Total Error Rate (HTER) as 0.45%, False Acceptance Rate (FAR) as 0.65% and False Rejection Rate (FRR) as 0.66% when compared with existing Stacked models. © 2024 IEEE.

关键词： Local binary pattern

来源：评论

学校读者我要写书评

暂无评论

Bi-Causal: Group Activity recognition via Bidirectional Causality

Bi-Causal: Group Activity Recognition via Bidirectional Caus...

引用

conference on computer vision and pattern recognition (CVPR)

作者： Youliang Zhang Wenxuan Liu Danni Xu Zhuo Zhou Zheng Wang National Engineering Research Center for Multimedia Software Institute of Artificial Intelligence School of Computer Science Wuhan University Hubei Key Laboratory of Multimedia and Network Communication Engineering Wuhan University of Technology National University of Singapore

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

Current approaches in Group Activity recognition (GAR) predominantly emphasize Human Relations (HRs) while often neglecting the impact of Human-Object Inter-actions (HOIs). This study prioritizes the consideration of both HRs and HOIs, emphasizing their interdependence. Notably, employing Granger Causality Tests reveals the presence of bidirectional causality between HRs and HOIs. Leveraging this insight, we propose a Bidirectional-Causal GAR network. This network establishes a causality commu-nication channel while modeling relations and interactions, enabling reciprocal enhancement between human-object interactions and human relations, ensuring their mutual consistency. Additionally, an Interaction Module is devised to effectively capture the dynamic nature of human-object interactions. Comprehensive experiments conducted on two publicly available datasets showcase the superiority of our proposed method over state-of-the-art approaches. Our project page: https://***/***/

关键词： computer vision Computational modeling Cause effect analysis Communication channels Activity recognition

来源：评论

学校读者我要写书评

暂无评论

Taskology: Utilizing Task Relations at Scale

Taskology: Utilizing Task Relations at Scale

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Lu, Yao Pirk, Soren Dlabal, Jan Brohan, Anthony Pasad, Ankita Chen, Zhao Casser, Vincent Angelova, Anelia Gordon, Ariel Google Robot Mountain View CA 94043 USA Google Res Mountain View CA 94043 USA Toyota Technol Inst Chicago Chicago IL 60637 USA Waymo LLC Mountain View CA USA

ISBN: (纸本)9781665445092

Many computer vision tasks address the problem of scene understanding and are naturally interrelated e.g. object classification, detection, scene segmentation, depth estimation, etc. We show that we can leverage the inherent relationships among collections of tasks, as they are trained jointly, supervising each other through their known relationships via consistency losses. Furthermore, explicitly utilizing the relationships between tasks allows improving their performance while dramatically reducing the need for labeled data, and allows training with additional unsupervised or simulated data. We demonstrate a distributed joint training algorithm with task-level parallelism, which affords a high degree of asynchronicity and robustness. This allows learning across multiple tasks, or with large amounts of input data, at scale. We demonstrate our framework on subsets of the following collection of tasks: depth and normal prediction, semantic segmentation, 3D motion and egomotion estimation, and object tracking and 3D detection in point clouds. We observe improved performance across these tasks, especially in the low-label regime.

关键词： Training computer vision Three-dimensional displays Computational modeling Semantics Estimation Prediction algorithms

来源：评论

学校读者我要写书评

暂无评论

Pulsar: Efficient Sphere-based Neural Rendering

Pulsar: Efficient Sphere-based Neural Rendering

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Lassner, Christoph Zollhofer, Michael Facebook Real Labs Redmond WA 98052 USA

ISBN: (纸本)9781665445092

We propose Pulsar, an efficient sphere-based differentiable rendering module that is orders of magnitude faster than competing techniques, modular, and easy-to-use due to its tight integration with PyTorch. Differentiable rendering is the foundation for modern neural rendering approaches, since it enables end-to-end training of 3D scene representations from image observations. However, gradient-based optimization of neural mesh, voxel, or function representations suffers from multiple challenges, i.e., topological inconsistencies, high memory footprints, or slow rendering speeds. To alleviate these problems, Pulsar employs: 1) a sphere-based scene representation, 2) a modular, efficient differentiable projection operation, and 3) (optional) neural shading. Pulsar executes orders of magnitude faster than existing techniques and allows real-time rendering and optimization of representations with millions of spheres. Using spheres for the scene representation, unprecedented speed is obtained while avoiding topology problems. Pulsar is fully differentiable and thus enables a plethora of applications, ranging from 3D reconstruction to neural rendering.

关键词： Training Three-dimensional displays Scalability Rendering (computer graphics) Real-time systems Distance measurement Topology

来源：评论

学校读者我要写书评

暂无评论

Reconstructing 3D Human Pose by Watching Humans in the Mirror

Reconstructing 3D Human Pose by Watching Humans in the Mirro...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Fang, Qi Shuai, Qing Dong, Junting Bao, Hujun Zhou, Xiaowei Zhejiang Univ State Key Lab CAD&CG Hangzhou Zhejiang Peoples R China

ISBN: (纸本)9781665445092

In this paper, we introduce the new task of reconstructing 3D human pose from a single image in which we can see the person and the person's image through a mirror. Compared to general scenarios of 3D pose estimation from a single view, the mirror reflection provides an additional view for resolving the depth ambiguity. We develop an optimization-based approach that exploits mirror symmetry constraints for accurate 3D pose reconstruction. We also provide a method to estimate the surface normal of the mirror from vanishing points in the single image. To validate the proposed approach, we collect a large-scale dataset named Mirrored-Human, which covers a large variety of human subjects, poses and backgrounds. The experiments demonstrate that, when trained on Mirrored-Human with our reconstructed 3D poses as pseudo ground-truth, the accuracy and generalizability of existing single-view 3D pose estimators can be largely improved. The code and dataset are available at https://***/Mirrored-Human/.

关键词： Training Surface reconstruction Three-dimensional displays Shape Pose estimation Reflection pattern recognition

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 469 470 471 472 473 474 475 476 477 478 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：