检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,901 篇 会议
43 篇 期刊文献
18 册 图书

馆藏范围

8,961 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,560 篇 工学
- 4,020 篇 计算机科学与技术...
- 2,178 篇 软件工程
- 1,241 篇 光学工程
- 555 篇 控制科学与工程
- 431 篇 信息与通信工程
- 430 篇 机械工程
- 294 篇 电气工程
- 287 篇 仪器科学与技术
- 179 篇 生物工程
- 159 篇 生物医学工程（可授...
- 119 篇 电子科学与技术（可...
- 61 篇 安全科学与工程
- 58 篇 建筑学
- 58 篇 化学工程与技术
- 52 篇 土木工程
- 49 篇 交通运输工程
- 40 篇 力学（可授工学、理...
2,065 篇 理学
- 1,382 篇 物理学
- 1,198 篇 数学
- 420 篇 统计学（可授理学、...
- 238 篇 生物学
- 54 篇 化学
- 36 篇 系统科学
263 篇 管理学
- 180 篇 图书情报与档案管...
- 89 篇 管理科学与工程(可...
- 47 篇 工商管理
223 篇 医学
- 222 篇 临床医学
- 39 篇 基础医学(可授医学...
205 篇 艺术学
- 205 篇 设计学（可授艺术学...
45 篇 法学
- 43 篇 社会学
21 篇 农学
14 篇 教育学
9 篇 经济学
6 篇 军事学

主题

3,412 篇 computer vision
1,216 篇 pattern recognit...
946 篇 cameras
908 篇 conferences
765 篇 computer science
674 篇 image segmentati...
618 篇 layout
598 篇 training
548 篇 shape
518 篇 robustness
451 篇 feature extracti...
448 篇 humans
445 篇 face recognition
405 篇 computational mo...
402 篇 object detection
365 篇 visualization
356 篇 computer archite...
336 篇 application soft...
304 篇 lighting
259 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 tsinghua univers...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 computer vision ...
17 篇 department of co...
16 篇 institute of inf...
16 篇 school of comput...
15 篇 school of comput...
15 篇 carnegie mellon ...
14 篇 national laborat...

作者

57 篇 timofte radu
25 篇 huang thomas s.
24 篇 van gool luc
23 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 luc van gool
19 篇 t.s. huang
18 篇 xiaoou tang
18 篇 murino vittorio
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 li stan z.
15 篇 m. shah

语言

8,932 篇 英文
21 篇 其他
8 篇 中文
1 篇 土耳其文

检索条件"任意字段=IEEE-Computer-Society Conference on Computer Vision and Pattern Recognition Workshops"

共 8962 条记录，以下是331-340 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Cross-modal Target Retrieval for Tracking by Natural Language

Cross-modal Target Retrieval for Tracking by Natural Languag...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Li, Yihao Yu, Jun Cai, Zhongpeng Pan, Yuwen Univ Sci & Technol China Hefei Anhui Peoples R China

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Tracking by natural language specification in a video is a challenging task in computer vision. Distinct from initializing the target state only by the bounding box in the first frame, language specification has a strong potential to assist visual object trackers to capture appearance variation and eliminate semantic ambiguity of the tracked object. In this paper, we carefully design a unified local-global-search framework from the perspective of cross-modal retrieval, including a local tracker, an adaptive retrieval switch module, and a target-specific retrieval module. The adaptive retrieval switch module aligns semantics from the visual signal and the lingual description of the target using three sub-modules, i.e., object-aware attention memory, part-aware cross-attention, and vision-language contrast, which achieve an automatic switch between local search and global search. When booting the global search mechanism, the target-specific retrieval module relocalizes the missing target in the image-wide range via an efficient vision-language guided proposal selector and target-text match. Numerous experimental results on three prevailing benchmarks show the effectiveness and generalization of our framework.

关键词： Visualization computer vision Target tracking Natural languages Semantics Switches Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

Multi-encoder Network for Parameter Reduction of a Kernel-based Interpolation Architecture

Multi-encoder Network for Parameter Reduction of a Kernel-ba...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Khalifeh, Issa Blanch, Marc Gorriz Izquierdo, Ebroul Mrak, Marta British Broadcasting Corp London W12 7TQ England Queen Mary Univ London London E1 4NS England

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Video frame interpolation involves the synthesis of new frames from existing ones. Convolutional neural networks (CNNs) have been at the forefront of the recent advances in this field. One popular CNN-based approach involves the application of generated kernels to the input frames to obtain an interpolated frame. Despite all the benefits interpolation methods offer, many of these networks require a lot of parameters, with more parameters meaning a heavier computational burden. Reducing the size of the model typically impacts performance negatively. This paper presents a method for parameter reduction for a popular flow-less kernel-based network (Adaptive Collaboration of Flows). Through our technique of removing the layers that require the most parameters and replacing them with smaller encoders, we reduce the number of parameters of the network and even achieve better performance compared to the original method. This is achieved by deploying rotation to force each individual encoder to learn different features from the input images. Ablations are conducted to justify design choices and an evaluation on how our method performs on full-length videos is presented.

关键词： Training Interpolation computer vision conferences Force computer architecture pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Watch and Act: Dual Interacting Agents for Automatic Generation of Possession Statistics in Soccer

Watch and Act: Dual Interacting Agents for Automatic Generat...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Sarkar, Saikat Mukherjee, Dipti Prasad Chakrabarti, Amlan Univ Calcutta Kolkata India Indian Stat Inst Kolkata India

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Pass localization and team identification are two primary tasks for pass-count based possession statistics generation of a soccer match. While the existing works perform these two tasks separately, we propose dual interacting reinforcement learning agents to jointly perform these tasks. The proposed model has a localization agent, that decides which direction to move a temporal window to localize a pass. On the other hand, there is an identification agent that decides if the temporal window contains a pass for team-A (or team-B), or the localization agent needs to readjust the temporal window further. In this multi-agent setup, an agent may communicate by sharing some message to guide the other agent to achieve its task. To achieve this inter-agent communication, we extend the Dueling DQN architecture and share the value of a state as a message to the other agent. Two agents watch, act independently and cooperate with each other in order to detect a valid pass in a soccer video. A novel reward function is proposed that helps the agents to learn the optimal policy. Experiments performed on online videos show that our method is 3% better at localization of pass than the competitive methods.

关键词： Location awareness computer vision conferences Reinforcement learning Games computer architecture Real-time systems

来源：评论

学校读者我要写书评

暂无评论

Coarse-to-Fine Cascaded Networks with Smooth Predicting for Video Facial Expression recognition

Coarse-to-Fine Cascaded Networks with Smooth Predicting for ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Xue, Fanglei Tan, Zichang Zhu, Yu Ma, Zhongsong Guo, Guodong Univ Chinese Acad Sci Beijing Peoples R China Chinese Acad Sci Technol & Engn Ctr Space Utilizat Key Lab Space Utilizat Beijing Peoples R China Baidu Res Inst Deep Learning Beijing Peoples R China Natl Engn Lab Deep Learning Technol & Applicat Beijing Peoples R China

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Facial expression recognition plays an important role in human-computer interaction. In this paper, we propose the Coarse-to-Fine Cascaded network with Smooth Predicting (CFC-SP) to improve the performance of facial expression recognition. CFC-SP contains two core components, namely Coarse-to-Fine Cascaded networks (CFC) and Smooth Predicting (SP). For CFC, it first groups several similar emotions to form a rough category, and then employs a network to conduct a coarse but accurate classification. Later, an additional network for these grouped emotions is further used to obtain fine-grained predictions. For SP, it improves the recognition capability of the model by capturing both universal and unique expression features. To be specific, the universal features denote the general characteristic of facial emotions within a period and the unique features denote the specific characteristic at this moment. Experiments on Aff-Wild2 show the effectiveness of the proposed CFSP. We achieved 3rd place in the Expression Classification Challenge of the 3rd Competition on Affective Behavior Analysis in-the-wild. The code will be released at https://***/BR-IDL/PaddleViT.

关键词： Human computer interaction computer vision Codes Face recognition conferences Behavioral sciences

来源：评论

学校读者我要写书评

暂无评论

Cell Selection-based Data Reduction Pipeline for Whole Slide Image Analysis of Acute Myeloid Leukemia

Cell Selection-based Data Reduction Pipeline for Whole Slide...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Kockwelp, Jacqueline Thiele, Sebastian Kockwelp, Pascal Bartsch, Jannis Schliemann, Christoph Angenendt, Linus Risse, Benjamin Univ Munster Munster Germany Univ Med Ctr Munster Munster Germany

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

computer-aided analyses of cells in Whole Slide Images (WSIs) have become an important topic in digital pathology. Despite the recent success of deep learning in biomedical research, these methods are still difficult to apply to multi-gigabyte WSIs. To overcome this difficulty, a variety of patch-based solutions have been introduced, which however all suffer from certain limitations compared to manual examinations and often fail to meet the specificities of cytological inspections. Here we introduce an alternative scheme which incorporates clinical expertise in the selection process to automatically identify the clinically relevant areas. By using a bone marrow smear dataset containing 22-gigapixel images of 153 patients, we introduce a novel pipeline combining unsupervised and supervised methodologies to gradually select the most appropriate single-cell regions, which are subsequently used in multiple medically crucial Acute Myeloid Leukemia (AML) predictions. Our approach is capable of dealing with a variety of common WSI challenges, massively limits the manual annotation effort, reduces the data by a factor of up to 99.9% and achieves super-human performance on the final cytological prediction tasks.

关键词： Deep learning Pathology computer vision Image analysis conferences Pipelines Manuals

来源：评论

学校读者我要写书评

暂无评论

Material Swapping for 3D Scenes using a Learnt Material Similarity Measure

Material Swapping for 3D Scenes using a Learnt Material Simi...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Perroni-Scharf, Maxine Sunkavalli, Kalyan Eisenmann, Jonathan Hold-Geoffroy, Yannick Princeton Univ Princeton NJ 08544 USA Adobe Res San Jose CA USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

We present a method for augmenting photo-realistic 3D scene assets by automatically recognizing, matching, and swapping their materials. Our method proposes a material matching pipeline for the efficient replacement of unknown materials with perceptually similar PBR materials from a database, enabling the quick creation of many variations of a given 3D synthetic scene. At the heart of this method is a novel material similarity feature that is learnt, in conjunction with optimal lighting conditions, by fine-tuning a deep neural network on a material classification task using our proposed dataset. Our evaluation demonstrates that lighting optimization improves CNN-based texture feature extraction methods and better estimates material properties. We conduct a series of experiments showing our method's ability to augment photo-realistic indoor scenes using both standard and procedurally generated PBR materials.

关键词： Training Three-dimensional displays Pipelines Neural networks Lighting Feature extraction Rendering (computer graphics)

来源：评论

学校读者我要写书评

暂无评论

Three Stream Graph Attention Network using Dynamic Patch Selection for the classification of micro-expressions

Three Stream Graph Attention Network using Dynamic Patch Sel...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Jain, Ankith Kumar, Rakesh Bhanu, Bir Univ Calif Riverside Dept Elect & Comp Engn Riverside CA 92521 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

To understand the genuine emotions expressed by humans during social interactions, it is necessary to recognize the subtle changes on the face (micro-expressions) demonstrated by an individual. Facial micro-expressions are brief, rapid, spontaneous gestures and non-voluntary facial muscle movements beneath the skin. Therefore, it is a challenging task to classify facial micro-expressions. This paper presents an end-to-end novel three-stream graph attention network model to capture the subtle changes on the face and recognize micro-expressions (MEs) by exploiting the relationship between optical flow magnitude, optical flow direction, and the node locations features. A facial graph representational structure is used to extract the spatial and temporal information using the three frames. The varying dynamic patch size of optical flow features is used to extract the local texture information across each landmark point. The network only utilizes the landmark points location features and optical flow information across these points and generates good results for the classification of MEs. A comprehensive evaluation of SAMM and the CASME II datasets demonstrates the high efficacy, efficiency, and generalizability of the proposed approach and achieves better results than the state-of-the-art methods.

关键词： Emotion recognition computer vision Face recognition conferences Feature extraction Skin Facial muscles

来源：评论

学校读者我要写书评

暂无评论

Classification of Facial Expression In-the-Wild based on Ensemble of Multi-head Cross Attention Networks

Classification of Facial Expression In-the-Wild based on Ens...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Jeong, Jae Yeop Hong, Yeong-Gi Kim, Daun Jeong, Jin-Woo Jung, Yuchul Kim, Sang-Ho Seoul Natl Univ Sci & Technol Dept Data Sci Gongreung Ro 232 Seoul South Korea Kumoh Natl Inst Technol Daehak Ro 61 Gumi South Korea

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

How to build a system for robust classification and recognition of facial expressions has been one of the most important research issues for successful interactive computing applications. However, previous datasets and studies mainly focused on facial expression recognition in a controlled/lab setting, therefore, could hardly be generalized in a more practical and real-life environment. The Affective Behavior Analysis in-the-wild (ABAW) 2022 competition released a dataset consisting of various video clips of facial expressions in-the-wild. In this paper, we propose a method based on the ensemble of multi-head cross attention networks to address the facial expression classification task introduced in the ABAW 2022 competition. We built a uni-task approach for this task, achieving the average F1-score of 34.60 on the validation set and 33.77 on the test set, ranking second place on the final leaderboard.

关键词： Gold computer vision Face recognition conferences Estimation Multitasking Behavioral sciences

来源：评论

学校读者我要写书评

暂无评论

CLIP-Guided vision-Language Pre-training for Question Answering in 3D Scenes

CLIP-Guided Vision-Language Pre-training for Question Answer...

引用

2023 ieee/CVF conference on computer vision and pattern recognition workshops, CVPRW 2023

作者： Parelli, Maria Delitzas, Alexandros Hars, Nikolas Vlassis, Georgios Anagnostidis, Sotirios Bachmann, Gregor Hofmann, Thomas Eth Zurich Switzerland

ISBN: (纸本)9798350302493

Training models to apply linguistic knowledge and visual concepts from 2D images to 3D world understanding is a promising direction that researchers have only recently started to explore. In this work, we design a novel 3D pre-training vision-Language method that helps a model learn semantically meaningful and transferable 3D scene point cloud representations. We inject the representational power of the popular CLIP model into our 3D encoder by aligning the encoded 3D scene features with the corresponding 2D image and text embeddings produced by CLIP. To assess our model's 3D world reasoning capability, we evaluate it on the downstream task of 3D Visual Question Answering. Experimental quantitative and qualitative results show that our pre-training method outperforms state-of-the-art works in this task and leads to an interpretable representation of 3D scene features. © 2023 ieee.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Facial Expression Classification using Fusion of Deep Neural Network in Video

Facial Expression Classification using Fusion of Deep Neural...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Kim Ngan Phan Hong-Hai Nguyen Van-Thong Huynh Kim, Soo-Hyung Chonnam Natl Univ Dept Artificial Intelligence Convergence Gwangju South Korea

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

For computers to recognize human emotions, expression classification is an equally important problem in the human-computer interaction area. In the 3rd Affective Behavior Analysis In-The-Wild competition, the task of expression classification includes eight classes with six basic expressions of human faces from videos. In this paper, we employ a transformer mechanism to encode the robust representation from the backbone. Fusion of the robust representations plays an important role in the expression classification task. Our approach achieves 30.35% and 28.60% for the F-1 score on the validation set and the test set, respectively. This result shows the effectiveness of the proposed architecture based on the Aff-Wild2 dataset and our team archives 5th for the expression classification task in the 3rd Affective Behavior Analysis In-The-Wild competition.

关键词： Human computer interaction Emotion recognition Computational modeling Neural networks Transformers Behavioral sciences pattern recognition

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 30 31 32 33 34 35 36 37 38 39 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：