检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

6,421 篇 会议
25 篇 期刊文献
3 册 图书

馆藏范围

6,448 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

3,849 篇 工学
- 3,647 篇 计算机科学与技术...
- 1,431 篇 软件工程
- 790 篇 光学工程
- 302 篇 信息与通信工程
- 242 篇 控制科学与工程
- 219 篇 电气工程
- 201 篇 机械工程
- 80 篇 生物医学工程（可授...
- 68 篇 生物工程
- 67 篇 电子科学与技术（可...
- 64 篇 仪器科学与技术
- 36 篇 建筑学
- 33 篇 力学（可授工学、理...
- 33 篇 土木工程
- 33 篇 航空宇航科学与技...
- 26 篇 安全科学与工程
- 22 篇 交通运输工程
- 20 篇 材料科学与工程（可...
- 18 篇 化学工程与技术
1,453 篇 理学
- 945 篇 物理学
- 890 篇 数学
- 352 篇 统计学（可授理学、...
- 134 篇 生物学
- 38 篇 系统科学
- 23 篇 化学
160 篇 管理学
- 110 篇 图书情报与档案管...
- 52 篇 管理科学与工程(可...
- 25 篇 工商管理
112 篇 医学
- 112 篇 临床医学
17 篇 法学
- 17 篇 社会学
12 篇 农学
8 篇 教育学
7 篇 艺术学
6 篇 经济学
2 篇 军事学

主题

2,288 篇 computer vision
789 篇 pattern recognit...
637 篇 cameras
629 篇 computer science
568 篇 face recognition
555 篇 layout
510 篇 image segmentati...
509 篇 conferences
498 篇 shape
445 篇 robustness
439 篇 object recogniti...
388 篇 humans
332 篇 feature extracti...
321 篇 training
303 篇 object detection
262 篇 image recognitio...
257 篇 application soft...
246 篇 lighting
238 篇 image reconstruc...
237 篇 computational mo...

机构

41 篇 microsoft resear...
26 篇 department of co...
21 篇 swiss fed inst t...
21 篇 school of comput...
20 篇 department of co...
19 篇 swiss fed inst t...
19 篇 carnegie mellon ...
18 篇 department of co...
17 篇 department of in...
17 篇 the robotics ins...
17 篇 institute of com...
16 篇 univ sci & techn...
16 篇 robotics institu...
15 篇 tsinghua univ pe...
14 篇 department of el...
14 篇 school of comput...
14 篇 school of comput...
13 篇 univ maryland co...
13 篇 microsoft resear...
13 篇 microsoft resear...

作者

39 篇 timofte radu
28 篇 s.k. nayar
24 篇 huang thomas s.
23 篇 xiaoou tang
22 篇 t. kanade
20 篇 t.s. huang
19 篇 van gool luc
19 篇 t. darrell
19 篇 chellappa rama
18 篇 nayar shree k.
17 篇 a.k. jain
17 篇 a. zisserman
17 篇 jain anil k.
16 篇 g. healey
16 篇 torralba antonio
16 篇 heung-yeung shum
16 篇 zisserman andrew
16 篇 l. van gool
15 篇 m. shah
15 篇 ji qiang

语言

6,447 篇 英文
2 篇 其他

检索条件"任意字段=1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 1992"

共 6449 条记录，以下是381-390 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings

A Modular Multimodal Architecture for Gaze Target Prediction...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Gupta, Anshul Tafasca, Samy Odobez, Jean-Marc Idiap Res Inst Martigny Switzerland Ecole Polytech Fed Lausanne Lausanne Switzerland

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Predicting where a person is looking is a complex task, requiring to understand not only the person's gaze and scene content, but also the 3D scene structure and the person's situation (are they manipulating? interacting or observing others? attentive?) to detect obstructions in the line of sight or apply attention priors that humans typically have when observing others. In this paper, we hypothesize that identifying and leveraging such priors can be better achieved through the exploitation of explicitly derived multimodal cues such as depth and pose. We thus propose a modular multimodal architecture allowing to combine these cues using an attention mechanism. The architecture can naturally be exploited in privacy-sensitive situations such as surveillance and health, where personally identifiable information cannot be released. We perform extensive experiments on the GazeFollow and VideoAttentionTarget public datasets, obtaining state-of-the-art performance and demonstrating very competitive results in the privacy setting case. (1)

关键词： Three-dimensional displays Fuses Surveillance computer architecture Predictive models Skeleton pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Cell Selection-based Data Reduction Pipeline for Whole Slide Image Analysis of Acute Myeloid Leukemia

Cell Selection-based Data Reduction Pipeline for Whole Slide...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Kockwelp, Jacqueline Thiele, Sebastian Kockwelp, Pascal Bartsch, Jannis Schliemann, Christoph Angenendt, Linus Risse, Benjamin Univ Munster Munster Germany Univ Med Ctr Munster Munster Germany

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

computer-aided analyses of cells in Whole Slide Images (WSIs) have become an important topic in digital pathology. Despite the recent success of deep learning in biomedical research, these methods are still difficult to apply to multi-gigabyte WSIs. To overcome this difficulty, a variety of patch-based solutions have been introduced, which however all suffer from certain limitations compared to manual examinations and often fail to meet the specificities of cytological inspections. Here we introduce an alternative scheme which incorporates clinical expertise in the selection process to automatically identify the clinically relevant areas. By using a bone marrow smear dataset containing 22-gigapixel images of 153 patients, we introduce a novel pipeline combining unsupervised and supervised methodologies to gradually select the most appropriate single-cell regions, which are subsequently used in multiple medically crucial Acute Myeloid Leukemia (AML) predictions. Our approach is capable of dealing with a variety of common WSI challenges, massively limits the manual annotation effort, reduces the data by a factor of up to 99.9% and achieves super-human performance on the final cytological prediction tasks.

关键词： Deep learning Pathology computer vision Image analysis conferences Pipelines Manuals

来源：评论

学校读者我要写书评

暂无评论

Optimising rPPG Signal Extraction by Exploiting Facial Surface Orientation

Optimising rPPG Signal Extraction by Exploiting Facial Surfa...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Wong, Kwan Long Chin, Jing Wei Chan, Tsz Tai Odinaev, Ismoil Suhartono, Kristian Kang Tianqu So, Richard H. Y. Hong Kong Univ Sci & Technol Hong Kong Peoples R China PanopticAI Ltd Hong Kong Peoples R China

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Remote photoplethysmography (rPPG) is a contactless method to measure human vital signs by detecting subtle skin color changes through a camera. Although many studies have used region of interest (ROI) selection tools to improve rPPG signal extraction, no study has investigated the influence of the ROI's surface orientation. We propose a novel 'angle map' representation of the face to study the effects of the surface orientation on the extracted rPPG signal. The angle map is generated by mapping each facial pixel to an angle of reflection (angle between the skin surface and the camera) calculated from the surface normal of the facial landmarks and the camera axis. Our results show that surface orientation significantly affects the correlation between the extracted rPPG signal and ground truth blood volume pulse (BVP). Regions with small angles of reflection contained stronger signals, which explains why areas near the cheeks and forehead are often chosen for rPPG signal extraction. Moreover, we applied a thresholding method to the angle map and demonstrated its potential for dynamic ROI selection, thereby optimising the rPPG signal extraction process.

关键词： computer vision Forehead Correlation Face recognition conferences Cameras Photoplethysmography

来源：评论

学校读者我要写书评

暂无评论

Cross-modal Target Retrieval for Tracking by Natural Language

Cross-modal Target Retrieval for Tracking by Natural Languag...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Li, Yihao Yu, Jun Cai, Zhongpeng Pan, Yuwen Univ Sci & Technol China Hefei Anhui Peoples R China

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Tracking by natural language specification in a video is a challenging task in computer vision. Distinct from initializing the target state only by the bounding box in the first frame, language specification has a strong potential to assist visual object trackers to capture appearance variation and eliminate semantic ambiguity of the tracked object. In this paper, we carefully design a unified local-global-search framework from the perspective of cross-modal retrieval, including a local tracker, an adaptive retrieval switch module, and a target-specific retrieval module. The adaptive retrieval switch module aligns semantics from the visual signal and the lingual description of the target using three sub-modules, i.e., object-aware attention memory, part-aware cross-attention, and vision-language contrast, which achieve an automatic switch between local search and global search. When booting the global search mechanism, the target-specific retrieval module relocalizes the missing target in the image-wide range via an efficient vision-language guided proposal selector and target-text match. Numerous experimental results on three prevailing benchmarks show the effectiveness and generalization of our framework.

关键词： Visualization computer vision Target tracking Natural languages Semantics Switches Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

An Effective Framework of Multi-Class Product Counting and recognition for Automated Retail Checkout

An Effective Framework of Multi-Class Product Counting and R...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Wan, Junfeng Qian, Shuhao Tian, Zihan Zhao, Yanyun Beijing Univ Posts & Telecommun Beijing Peoples R China Beijing Key Lab Network Syst & Network Culture Beijing Peoples R China

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

As the field of computer vision grows, Automated Retail Checkout has become a highly anticipated development goal. The key of this task is to improve the accuracy rate. If there is an error, it will bring serious losses to the business and awful experience for customers which is not our expected. This competition gives us an opportunity to simulate check-out in a real world scenario, so that we can identify problems and solve them, not only for the competition, but also for the practical application. As one of the participating teams in this task, we pursue the goal of avoiding misdetection and misclassification, and build a complete set of framework to achieve high-precise, high-recall performance. In addition, there is an excessive difference between the training data and test data. How to use limited data to make up for the differences in this part is also one of the highlights of our framework. In general, our framework consists of three main parts. Firstly, the Pre-Processing module to make up for the differences between training and test data. The DTC module completes the overall process of automatic recognition. Finally the MTCR module is proposed to post-process the output of the DTC module. On the TestA data of AICITY2022 Task 4, we have achieved significant result compared to the other teams. Finally, our model is ranked 1st in AICITY2022 Task 4 [17].

关键词： Training computer vision Codes conferences Training data Trajectory pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Multi-encoder Network for Parameter Reduction of a Kernel-based Interpolation Architecture

Multi-encoder Network for Parameter Reduction of a Kernel-ba...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Khalifeh, Issa Blanch, Marc Gorriz Izquierdo, Ebroul Mrak, Marta British Broadcasting Corp London W12 7TQ England Queen Mary Univ London London E1 4NS England

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Video frame interpolation involves the synthesis of new frames from existing ones. Convolutional neural networks (CNNs) have been at the forefront of the recent advances in this field. One popular CNN-based approach involves the application of generated kernels to the input frames to obtain an interpolated frame. Despite all the benefits interpolation methods offer, many of these networks require a lot of parameters, with more parameters meaning a heavier computational burden. Reducing the size of the model typically impacts performance negatively. This paper presents a method for parameter reduction for a popular flow-less kernel-based network (Adaptive Collaboration of Flows). Through our technique of removing the layers that require the most parameters and replacing them with smaller encoders, we reduce the number of parameters of the network and even achieve better performance compared to the original method. This is achieved by deploying rotation to force each individual encoder to learn different features from the input images. Ablations are conducted to justify design choices and an evaluation on how our method performs on full-length videos is presented.

关键词： Training Interpolation computer vision conferences Force computer architecture pattern recognition

来源：评论

学校读者我要写书评

暂无评论

HR-STAN: High-Resolution Spatio-Temporal Attention Network for 3D Human Motion Prediction

HR-STAN: High-Resolution Spatio-Temporal Attention Network f...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Medjaouri, Omar Desai, Kevin Univ Texas San Antonio San Antonio TX 78249 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

3D human motion prediction requires making sense of the complex spatio-temporal dynamics which underpin human motion to make highly accurate predictions. Part of this complexity is due to the trade-off between long-term (>400ms) and short-term predictions (<400ms) which require different levels of granularity to observe patterns. Several works have explored methods of improving long-term prediction performance by utilizing longer motion histories but this typically comes at the cost of very short-term (<200ms) performance. Inspired by high-resolution network architectures, we propose a novel high-resolution spatio-temporal attention network (HR-STAN) which leverages parallel feature branches and dilated convolutions to observe human motion at different scales. Furthermore, we augment this architecture with split spatial and temporal attention mechanisms to efficiently capture spatio-temporal dependencies within a given motion. We evaluate the ability of our HR-STAN architecture at incorporating long-term motion histories while producing short-term predictions and show that it improves over several state-of-the-art methods on both the AMASS and Human3.6M benchmarks.

关键词： computer vision Three-dimensional displays Costs conferences Dynamics computer architecture Predictive models

来源：评论

学校读者我要写书评

暂无评论

Watch and Act: Dual Interacting Agents for Automatic Generation of Possession Statistics in Soccer

Watch and Act: Dual Interacting Agents for Automatic Generat...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Sarkar, Saikat Mukherjee, Dipti Prasad Chakrabarti, Amlan Univ Calcutta Kolkata India Indian Stat Inst Kolkata India

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Pass localization and team identification are two primary tasks for pass-count based possession statistics generation of a soccer match. While the existing works perform these two tasks separately, we propose dual interacting reinforcement learning agents to jointly perform these tasks. The proposed model has a localization agent, that decides which direction to move a temporal window to localize a pass. On the other hand, there is an identification agent that decides if the temporal window contains a pass for team-A (or team-B), or the localization agent needs to readjust the temporal window further. In this multi-agent setup, an agent may communicate by sharing some message to guide the other agent to achieve its task. To achieve this inter-agent communication, we extend the Dueling DQN architecture and share the value of a state as a message to the other agent. Two agents watch, act independently and cooperate with each other in order to detect a valid pass in a soccer video. A novel reward function is proposed that helps the agents to learn the optimal policy. Experiments performed on online videos show that our method is 3% better at localization of pass than the competitive methods.

关键词： Location awareness computer vision conferences Reinforcement learning Games computer architecture Real-time systems

来源：评论

学校读者我要写书评

暂无评论

Classification of Facial Expression In-the-Wild based on Ensemble of Multi-head Cross Attention Networks

Classification of Facial Expression In-the-Wild based on Ens...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Jeong, Jae Yeop Hong, Yeong-Gi Kim, Daun Jeong, Jin-Woo Jung, Yuchul Kim, Sang-Ho Seoul Natl Univ Sci & Technol Dept Data Sci Gongreung Ro 232 Seoul South Korea Kumoh Natl Inst Technol Daehak Ro 61 Gumi South Korea

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

How to build a system for robust classification and recognition of facial expressions has been one of the most important research issues for successful interactive computing applications. However, previous datasets and studies mainly focused on facial expression recognition in a controlled/lab setting, therefore, could hardly be generalized in a more practical and real-life environment. The Affective Behavior Analysis in-the-wild (ABAW) 2022 competition released a dataset consisting of various video clips of facial expressions in-the-wild. In this paper, we propose a method based on the ensemble of multi-head cross attention networks to address the facial expression classification task introduced in the ABAW 2022 competition. We built a uni-task approach for this task, achieving the average F1-score of 34.60 on the validation set and 33.77 on the test set, ranking second place on the final leaderboard.

关键词： Gold computer vision Face recognition conferences Estimation Multitasking Behavioral sciences

来源：评论

学校读者我要写书评

暂无评论

A Two-Stage Shake-Shake Network for Long-Tailed recognition of SAR Aerial View Objects

A Two-Stage Shake-Shake Network for Long-Tailed Recognition ...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Li, Gongzhe Pan, Linpeng Qiu, Linwei Tan, Zhiwen Xie, Fengying Zhang, Haopeng Beihang Univ Beijing Peoples R China

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Synthetic Aperture Radar (SAR) has received more attention due to its complementary superiority on capturing significant information in the remote sensing area. However, for an Aerial View Object Classification (AVOC) task, SAR images still suffer from the long-tailed distribution of the aerial view objects. This disparity limit the performance of classification methods, especially for the data-sensitive deep learning models. In this paper, we propose a two-stage shake-shake network to tackle the long-tailed learning problem. Specifically, it decouples the learning procedure into the representation learning stage and the classification learning stage. Moreover, we apply the test time augmentation (TTA) and the classification with alternating normalization (CAN) to improve the accuracy. In the PBVS 1 2022 Multi-modal Aerial View Object Classification Challenge Track 1, our method achieves 21.82% and 27.97% accuracy in the development phase and testing phase respectively, which wins the top-tier among all the participants.

关键词： Training Representation learning Feature extraction Radar tracking Radar polarimetry pattern recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 35 36 37 38 39 40 41 42 43 44 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：