检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,905 篇 会议
43 篇 期刊文献
18 册 图书

馆藏范围

8,965 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,564 篇 工学
- 4,024 篇 计算机科学与技术...
- 2,182 篇 软件工程
- 1,241 篇 光学工程
- 558 篇 控制科学与工程
- 433 篇 信息与通信工程
- 430 篇 机械工程
- 294 篇 电气工程
- 288 篇 仪器科学与技术
- 179 篇 生物工程
- 159 篇 生物医学工程（可授...
- 119 篇 电子科学与技术（可...
- 64 篇 安全科学与工程
- 58 篇 建筑学
- 58 篇 化学工程与技术
- 52 篇 土木工程
- 52 篇 交通运输工程
- 40 篇 力学（可授工学、理...
2,066 篇 理学
- 1,382 篇 物理学
- 1,198 篇 数学
- 420 篇 统计学（可授理学、...
- 238 篇 生物学
- 55 篇 化学
- 36 篇 系统科学
266 篇 管理学
- 182 篇 图书情报与档案管...
- 92 篇 管理科学与工程(可...
- 47 篇 工商管理
223 篇 医学
- 222 篇 临床医学
- 39 篇 基础医学(可授医学...
205 篇 艺术学
- 205 篇 设计学（可授艺术学...
45 篇 法学
- 43 篇 社会学
21 篇 农学
14 篇 教育学
9 篇 经济学
6 篇 军事学

主题

3,414 篇 computer vision
1,216 篇 pattern recognit...
946 篇 cameras
908 篇 conferences
765 篇 computer science
674 篇 image segmentati...
618 篇 layout
598 篇 training
548 篇 shape
518 篇 robustness
451 篇 feature extracti...
448 篇 humans
445 篇 face recognition
405 篇 computational mo...
402 篇 object detection
365 篇 visualization
356 篇 computer archite...
336 篇 application soft...
304 篇 lighting
257 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 tsinghua univers...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 computer vision ...
17 篇 department of co...
16 篇 institute of inf...
16 篇 school of comput...
15 篇 school of comput...
15 篇 carnegie mellon ...
14 篇 national laborat...

作者

57 篇 timofte radu
25 篇 huang thomas s.
24 篇 van gool luc
23 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 luc van gool
19 篇 t.s. huang
18 篇 xiaoou tang
18 篇 murino vittorio
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 li stan z.
15 篇 m. shah

语言

8,904 篇 英文
53 篇 其他
8 篇 中文
1 篇 土耳其文

检索条件"任意字段=IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops"

共 8966 条记录，以下是831-840 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Micro-Expression Classification based on Landmark Relations with Graph Attention Convolutional Network

Micro-Expression Classification based on Landmark Relations ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Kumar, Ankith Jain Rakesh Bhanu, Bir Univ Calif Riverside Dept Elect & Comp Engn Riverside CA 92521 USA

ISBN: (纸本)9781665448994

Facial micro-expressions are brief, rapid, spontaneous gestures of the facial muscles that express an individual's genuine emotions. Because of their short duration and subtlety, detecting and classifying these micro-expressions by humans and machines is difficult. In this paper, a novel approach is proposed that exploits relationships between landmark points and the optical flow patch for the given landmark points. It consists of a two-stream graph attention convolutional network that extracts the relationships between the landmark points and local texture using an optical flow patch. A graph structure is built to draw-out temporal information using the triplet of frames. One stream is for node feature location, and the other one is for a patch of optical-flow information. These two streams (node location stream and optical flow stream) are fused for classification. The results are shown on, CASME II and SAMM, publicly available datasets, for three classes and five classes of micro-expressions. The proposed approach outperforms the state-of-the-art methods for 3 and 5 categories of expressions.

关键词： computer vision conferences Feature extraction Facial muscles pattern recognition Optical flow

来源：评论

学校读者我要写书评

暂无评论

Cross-modal Speaker Verification and recognition: A Multilingual Perspective

Cross-modal Speaker Verification and Recognition: A Multilin...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Nawaz, Shah Saeed, Muhammad Saad Morerio, Pietro Mahmood, Arif Gallo, Ignazio Yousaf, Muhammad Haroon Del Bue, Alessio Ist Italiano Tecnol IIT Pattern Anal & Comp Vis PAVIS Genoa Italy Ist Italiano Tecnol IIT Visual Geometry & Modelling VGM Genoa Italy Univ Insubria Varese VA Italy Univ Engn & Technol Taxila Rawalpindi Punjab India Informat Technol Univ Lahore Pakistan

ISBN: (纸本)9781665448994

Recent years have seen a surge in finding association between faces and voices within a cross-modal biometric application along with speaker recognition. Inspired from this, we introduce a challenging task in establishing association between faces and voices across multiple languages spoken by the same set of persons. The aim of this paper is to answer two closely related questions: "Is face-voice association language independent?" and "Can a speaker be recognized irrespective of the spoken language?". These two questions are important to understand effectiveness and to boost development of multilingual biometric systems. To answer these, we collected a Multilingual Audio-Visual dataset, containing human speech clips of 154 identities with 3 language annotations extracted from various videos uploaded online. Extensive experiments on the two splits of the proposed dataset have been performed to investigate and answer these novel research questions that clearly point out the relevance of the multilingual problem.

关键词： Training Annotations Face recognition Pipelines Speech recognition Speaker recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

MTUNet: Few-shot Image Classification with Visual Explanations

MTUNet: Few-shot Image Classification with Visual Explanatio...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Wang, Bowen Li, Liangzhi Verma, Manisha Nakashima, Yuta Kawasaki, Ryo Nagahara, Hajime Osaka Univ Inst Databil Sci IDS Suita Osaka Japan Osaka Univ Grad Sch Med Suita Osaka Japan

ISBN: (纸本)9781665448994

Few-shot learning (FSL) approaches, mostly neural network-based, are assuming that the pre-trained knowledge can be obtained from base (seen) categories and transferred to novel (unseen) categories. However, the black-box nature of neural networks makes it difficult to understand what is actually transferred, which may hamper its application in some risk-sensitive areas. In this paper, we reveal a new way to perform explainable FSL for image classification, using discriminative patterns and pairwise matching. Experimental results prove that the proposed method can achieve satisfactory explainability on two mainstream datasets. Code is available*.

关键词： Knowledge engineering Visualization computer vision conferences Computational modeling Neural networks Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

A Deep Adversarial Framework for Visually Explainable Periocular recognition

A Deep Adversarial Framework for Visually Explainable Perioc...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Brito, Joao Proenca, Hugo Univ Beira Interior IT Inst Telecomunicacoes P-6200001 Covilha Portugal

ISBN: (纸本)9781665448994

In the biometrics context, the ability to provide the reasoning behind a decision has been at the core of major research efforts. Explanations serve not only to increase the trust amongst the users of a system, but also to augment the system's overall accountability and transparency. In this work, we describe a periocular recognition framework that not only performs biometric recognition, but also provides visual representations of the features/regions that supported a decision. Being particularly designed to explain non-match ("impostors") decisions, our solution uses adversarial generative techniques to synthesise a large set of "genuine" image pairs, from where the most similar elements with respect to a query are retrieved. Then, assuming the alignment between the query/retrieved pairs, the element-wise differences between the query and a weighted average of the retrieved elements yields a visual explanation of the regions in the query pair that would have to be different to transform it into a "genuine" pair. Our quantitative and qualitative experiments validate the proposed solution, yielding recognition rates that are similar to the state-of-the-art, but - most importantly - also providing the visual explanations for every decision.

关键词： Visualization computer vision Biometrics (access control) conferences Transforms Cognition pattern recognition

来源：评论

学校读者我要写书评

暂无评论

PDAVIS: Bio-inspired Polarization Event Camera

PDAVIS: Bio-inspired Polarization Event Camera

引用

2023 ieee/CVF conference on computer vision and pattern recognition workshops, CVPRW 2023

作者： Haessig, Germain Joubert, Damien Haque, Justin Milde, Moritz B. Delbruck, Tobi Gruev, Viktor Center for Vision Automation & Control High-Performance Vision Systems AIT Austrian Institute of Technology Vienna Austria University of Zurich Institute of Neuroinformatics ETH Zurich Switzerland Western Sydney University International Centre for Neuromorphic Systems The MARCS Institute Sydney Australia University of Illinois at Urbana-Champaign Department of Electrical and Computer Engineering UrbanaIL United States

ISBN: (纸本)9798350302493

The stomatopod (mantis shrimp) visual system has recently provided a blueprint for the design of paradigm-shifting polarization and multispectral imaging sensors, enabling solutions to challenging medical and remote sensing problems. However, these bioinspired frame-based cameras lack the high dynamic range and asynchronous polarization vision capabilities of the stomatopod visual system, limiting temporal resolution to ~12ms and dynamic range to ~72dB. Here we present a novel stomatopod-inspired polarization camera which mimics the sustained and transient biological visual pathways to save power and sample data beyond the maximum Nyquist frame rate. This bio-inspired sensor simultaneously captures both synchronous intensity frames and asynchronous polarization brightness change information with submillisecond latencies over a millionfold range of illumination. Our PDAVIS camera is comprised of 346x260 pixels, organized in 2-by-2 macropixels, which filter the incoming light with four linear polarization filters offset by 45°. Polarization information is reconstructed using both low-cost and low-latency event-based algorithms and more accurate but slower deep neural networks. Our sensor is used to image high dynamic range polarization scenes that vary at high speeds and to observe the dynamical properties of single collagen fibers in a bovine tendon under rapid cyclical ***: https://***/mFuCeTMWEqY © 2023 ieee.

关键词： Polarization

来源：评论

学校读者我要写书评

暂无评论

Assistive Signals for Deep Neural Network Classifiers

Assistive Signals for Deep Neural Network Classifiers

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Pestana, Camilo Liu, Wei Glance, David Owens, Robyn Mian, Ajmal Univ Western Australia 35 Stirling Hwy Crawley WA 6009 Australia

ISBN: (纸本)9781665448994

Deep Neural Networks are brittle in that small changes in the input can drastically affect their prediction outcome and confidence. Consequently, research in this area mainly focus on adversarial attacks and defenses. In this paper, we take an alternative stance and introduce the concept of Assistive Signals, which are perturbations optimized to improve a model's confidence score regardless if it's under attack or not. We analyze some interesting properties of these assistive perturbations and extend the idea to optimize them in the 3D space simulating different lighting conditions and viewing angles. Experimental evaluations show that the assistive signals generated by our optimization method increase the accuracy and confidence of deep models more than those generated by conventional methods that work in the 2D space. 'Assistive Signals' also illustrate bias of ML models towards certain patterns in real-life objects.

关键词： Deep learning computer vision Three-dimensional displays Perturbation methods conferences Optimization methods Lighting

来源：评论

学校读者我要写书评

暂无评论

Layer Importance Estimation with Imprinting for Neural Network Quantization

Layer Importance Estimation with Imprinting for Neural Netwo...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Liu, Hongyang Elkerdawy, Sara Ray, Nilanjan Elhoushi, Mostafa Univ Alberta Edmonton AB Canada Huawei Shenzhen Peoples R China

ISBN: (纸本)9781665448994

Neural network quantization has achieved a high compression rate using fixed low bit-width representation of weights and activations while maintaining the accuracy of the high-precision original network. However, mixed precision (per-layer bit-width precision) quantization requires careful tuning to maintain accuracy while achieving further compression and higher granularity than fixed-precision quantization. We propose an accuracy-aware criterion to quantify the layer's importance rank. Our method applies imprinting per layer which acts as a proxy module for accuracy estimation in an efficient way. We rank the layers based on the accuracy gain from previous modules and iteratively quantize first those with less accuracy gain. Previous mixed-precision methods either rely on expensive search techniques such as reinforcement learning (RL) or end-to-end optimization with a lack of interpretation to the quantization configuration scheme. Our method is a one-shot, efficient, accuracy-aware information estimation and thus draws better interpretability to the selected bit-width configuration.

关键词： Training Quantization (signal) Search methods Neural networks Estimation Reinforcement learning pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Learning to predict crop type from heterogeneous sparse labels using meta-learning

Learning to predict crop type from heterogeneous sparse labe...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Tseng, Gabriel Kerner, Hannah Nakalembe, Catherine Becker-Reshef, Inbal NASA Harvest College Pk MD 20742 USA Univ Maryland College Pk MD 20742 USA

ISBN: (纸本)9781665448994

There are many labelled datasets relating to land cover and crop type mapping that cover diverse geographies, agroecologies and land uses. However, these labels are often extremely sparse, particularly in low- and middle-income regions, with as few as tens of examples for certain crop types. This makes it challenging to train supervised machine learning models to detect specific crops in satellite observations of these regions. We investigate the utility of model-agnostic meta-learning (MAML) to learn from diverse global datasets and improve performance in data-sparse regions. We find that in a variety of countries (Togo, Kenya and Brazil) and across a variety of tasks (crop type mapping, crop vs. non-crop mapping), MAML improves performance compared to pretrained and random initial weights. We also investigate the utility of MAML for different target data-size regimes. We find MAML outperforms other methods for a wide range of training set sizes and positive to negative label ratios, indicating its general suitability for land use and crop type mapping.

关键词： Training Geography computer vision Satellites conferences Machine learning Agriculture

来源：评论

学校读者我要写书评

暂无评论

S3Net: A Single Stream Structure for Depth Guided Image Relighting

S3Net: A Single Stream Structure for Depth Guided Image Reli...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Yang, Hao-Hsiang Chen, Wei-Ting Kuo, Sy-Yen Asustek Comp Inc ASUS Intelligent Cloud Serv Taipei Taiwan Natl Taiwan Univ Grad Inst Elect Engn Taipei Taiwan Natl Taiwan Univ Dept Elect Engn Taipei Taiwan

ISBN: (纸本)9781665448994

Depth guided any-to-any image relighting aims to generate a relit image from the original image and corresponding depth maps to match the illumination setting of the given guided image and its depth map. To the best of our knowledge, this task is a new challenge that has not been addressed in the previous literature. To address this issue, we propose a deep learning-based neural Single Stream Structure network called S3Net for depth guided image relighting. This network is an encoder-decoder model. We concatenate all images and corresponding depth maps as the input and feed them into the model. The decoder part contains the attention module and the enhanced module to focus on the relighting-related regions in the guided images. Experiments performed on challenging benchmark show that the proposed model achieves the 3rd highest SSIM in the NTIRE 2021 Depth Guided Any-to-any Relighting Challenge.

关键词： Training Fuses Lighting Streaming media Feature extraction Decoding pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Temporal Alignment Networks for Long-term Video

Temporal Alignment Networks for Long-term Video

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Han, Tengda Xie, Weidi Zisserman, Andrew Univ Oxford Visual Geometry Grp Oxford England Shanghai Jiao Tong Univ Shanghai Peoples R China

ISBN: (数字)9781665469463

ISBN: (纸本)9781665469463

The objective of this paper is a temporal alignment network that ingests long term video sequences, and associated text sentences, in order to: (1) determine if a sentence is alignable with the video;and (2) if it is alignable, then determine its alignment. The challenge is to train such networks from large-scale datasets, such as HowTo100M, where the associated text sentences have significant noise, and are only weakly aligned when relevant. Apart from proposing the alignment network, we also make four contributions: (i) we describe a novel co-training method that enables to denoise and train on raw instructional videos without using manual annotation, despite the considerable noise;(ii) to benchmark the alignment performance, we manually curate a 10-hour subset of HowTo100M, totalling 80 videos, with sparse temporal descriptions. Our proposed model, trained on HowTo100M, outperforms strong baselines (CLIP, MIL-NCE) on this alignment dataset by a significant margin;(iii) we apply the trained model in the zero-shot settings to multiple downstream video understanding tasks and achieve state-of-the-art results, including text-video retrieval on YouCook2, and weakly supervised video action segmentation on Breakfast-Action. (iv) we use the automatically-aligned HowTo100M annotations for end-to-end finetuning of the backbone model, and obtain improved performance on downstream action recognition tasks.

关键词： computer vision Annotations Computational modeling Video sequences Noise reduction Manuals Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 80 81 82 83 84 85 86 87 88 89 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：