检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,905 篇 会议
43 篇 期刊文献
18 册 图书

馆藏范围

8,965 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,564 篇 工学
- 4,024 篇 计算机科学与技术...
- 2,182 篇 软件工程
- 1,241 篇 光学工程
- 558 篇 控制科学与工程
- 433 篇 信息与通信工程
- 430 篇 机械工程
- 294 篇 电气工程
- 288 篇 仪器科学与技术
- 179 篇 生物工程
- 159 篇 生物医学工程（可授...
- 119 篇 电子科学与技术（可...
- 64 篇 安全科学与工程
- 58 篇 建筑学
- 58 篇 化学工程与技术
- 52 篇 土木工程
- 52 篇 交通运输工程
- 40 篇 力学（可授工学、理...
2,066 篇 理学
- 1,382 篇 物理学
- 1,198 篇 数学
- 420 篇 统计学（可授理学、...
- 238 篇 生物学
- 55 篇 化学
- 36 篇 系统科学
266 篇 管理学
- 182 篇 图书情报与档案管...
- 92 篇 管理科学与工程(可...
- 47 篇 工商管理
223 篇 医学
- 222 篇 临床医学
- 39 篇 基础医学(可授医学...
205 篇 艺术学
- 205 篇 设计学（可授艺术学...
45 篇 法学
- 43 篇 社会学
21 篇 农学
14 篇 教育学
9 篇 经济学
6 篇 军事学

主题

3,414 篇 computer vision
1,216 篇 pattern recognit...
946 篇 cameras
908 篇 conferences
765 篇 computer science
674 篇 image segmentati...
618 篇 layout
598 篇 training
548 篇 shape
518 篇 robustness
451 篇 feature extracti...
448 篇 humans
445 篇 face recognition
405 篇 computational mo...
402 篇 object detection
365 篇 visualization
356 篇 computer archite...
336 篇 application soft...
304 篇 lighting
257 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 tsinghua univers...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 computer vision ...
17 篇 department of co...
16 篇 institute of inf...
16 篇 school of comput...
15 篇 school of comput...
15 篇 carnegie mellon ...
14 篇 national laborat...

作者

57 篇 timofte radu
25 篇 huang thomas s.
24 篇 van gool luc
23 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 luc van gool
19 篇 t.s. huang
18 篇 xiaoou tang
18 篇 murino vittorio
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 li stan z.
15 篇 m. shah

语言

8,904 篇 英文
53 篇 其他
8 篇 中文
1 篇 土耳其文

检索条件"任意字段=IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops"

共 8966 条记录，以下是1151-1160 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

A stroke of genius: Predicting the next move in badminton

A stroke of genius: Predicting the next move in badminton

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Magnus Ibh Stella Graßhof Dan Witzner Hansen Machine Learning Group IT University of Copenhagen

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

This paper presents, RallyTemPose, a transformer encoder-decoder model for predicting future badminton strokes based on previous rally actions. The model uses court position, skeleton poses, and player-specific embeddings to learn stroke and player-specific latent representations in a spatiotemporal encoder module. The representations are then used to condition the subsequent strokes in a decoder module through rally-aware fusion blocks, which provide additional relevant strategic and technical considerations to make more informed predictions. RallyTemPose shows improved forecasting accuracy compared to traditional sequential methods on two real-world badminton datasets. The performance boost can also be attributed to the inclusion of improved stroke embeddings extracted from the latent representation of a pre-trained large-language model subjected to detailed text descriptions of stroke descriptions. In the discussion, the latent representations learned by the encoder module show useful properties regarding player analysis and comparisons. The code can be found at: This https url.

关键词： computer vision conferences Predictive models Transformers Skeleton Data models Spatiotemporal phenomena

来源：评论

学校读者我要写书评

暂无评论

End-to-end Solution for Tenebrio Molitor Rearing Monitoring with Uncertainty Estimation and Domain Shift Detection

End-to-end Solution for Tenebrio Molitor Rearing Monitoring ...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Paweł Majewski Piotr Lampa Robert Burduk Jacek Reiner Wrocław University of Science and Technology Poland

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

The large-scale rearing of edible insects, of which Tenebrio Molitor is a representative, requires monitoring using vision systems to control the process and to detect anomalies. Previously proposed solutions by researchers relied on multiple modules related to specific tasks (calculated coefficients) and specific types of models (instance segmentation, semantic segmentation). Long processing times and difficulties in maintaining and updating modules encourage the search for a more condensed solution as an end-to-end model. This paper proposed a modified YOLOv8 architecture extended with additional heads related to specific tasks. Heads were trained on problem-oriented small datasets, which significantly reduced the time spent on sample annotation. The proposed solution also included estimation of prediction uncertainty based on variation among predictions in model ensemble and detection of domain shift phenomenon. Quantitative results from the conducted experiments confirmed the potential of the developed solution.

关键词： Instance segmentation Uncertainty Semantic segmentation Machine vision Insects Estimation Process control

来源：评论

学校读者我要写书评

暂无评论

DFM4SFM - Dense Feature Matching for Structure from Motion

DFM4SFM - Dense Feature Matching for Structure from Motion

引用

2023 ieee International conference on Image Processing Challenges and workshops, ICIPCW 2023

作者： Seibt, Simon Von Rymon Lipinski, Bartosz Chang, Thomas Latoschik, Marc Erich Nuremberg Institute of Technology Game Tech Lab Faculty of Computer Science Nuremberg Germany Institute of Computer Science University of Wuerzburg Human-Computer Interaction Group Wuerzburg Germany

ISBN: (纸本)9798350302585

Structure from motion (SfM) is a fundamental task in computer vision and allows recovering the 3D structure of a stationary scene from an image set. Finding robust and accurate feature matches plays a crucial role in the early stages of SfM. So in this work, we propose a novel method for computing image correspondences based on dense feature matching (DFM) using homographic decomposition: The underlying pipeline provides refinement of existing matches through iterative rematching, detection of occlusions and extrapolation of additional matches in critical image areas between image pairs. Our main contributions are improvements of DFM specifically for SfM, resulting in global refinement and global extrapolation of image correspondences between related views. Furthermore, we propose an iterative version of the Delaunay-triangulation-based outlier detection algorithm for robust processing of repeated image patterns. Through experiments, we demonstrate that the proposed method significantlv improves the reconstruction accuracy. © 2023 ieee.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

ChaLearn LAP Large Scale Signer Independent Isolated Sign Language recognition Challenge: Design, Results and Future Research

ChaLearn LAP Large Scale Signer Independent Isolated Sign La...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Sincan, Ozge Mercanoglu Jacques Junior, Julio C. S. Escalera, Sergio Keles, Hacer Yalim Ankara Univ Ankara Turkey Comp Vis Ctr Barcelona Spain Univ Barcelona Barcelona Spain

ISBN: (纸本)9781665448994

The performances of Sign Language recognition (SLR) systems have improved considerably in recent years. However, several open challenges still need to be solved to allow SLR to be useful in practice. The research in the field is in its infancy in regards to the robustness of the models to a large diversity of signs and signers, and to fairness of the models to performers from different demographics. This work summarises the ChaLearn LAP Large Scale Signer Independent Isolated SLR Challenge, organised at CVPR 2021 with the goal of overcoming some of the aforementioned challenges. We analyse and discuss the challenge design, top winning solutions and suggestions for future research. The challenge attracted 132 participants in the RGB track and 59 in the RGB+Depth track, receiving more than 1.5K submissions in total. Participants were evaluated using a new large-scale multi-modal Turkish Sign Language (AUTSL) dataset, consisting of 226 sign labels and 36,302 isolated sign video samples performed by 43 different signers. Winning teams achieved more than 96% recognition rate, and their approaches benefited from pose/hand/face estimation, transfer learning, external data, fusion/ensemble of modalities and different strategies to model spatio-temporal information. However, methods still fail to distinguish among very similar signs, in particular those sharing similar hand trajectories.

关键词： computer vision Tracking Assistive technology Computational modeling Transfer learning Semantics Gesture recognition

来源：评论

学校读者我要写书评

暂无评论

Towards Efficient Audio-Visual Learners via Empowering Pre-trained vision Transformers with Cross-Modal Adaptation

Towards Efficient Audio-Visual Learners via Empowering Pre-t...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Kai Wang Yapeng Tian Dimitrios Hatzinakos University of Toronto University of Texas Dallas

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

In this paper, we explore the cross-modal adaptation of pre-trained vision Transformers (ViTs) for the audio-visual domain by incorporating a limited set of trainable parameters. To this end, we propose a Spatial-Temporal-Global Cross-Modal Adaptation (STG-CMA) to gradually equip the frozen ViTs with the capability for learning audio-visual representation, consisting of the modality-specific temporal adaptation for temporal reasoning of each modality, the cross-modal spatial adaptation for refining the spatial information with the cue from counterpart modality, and the cross-modal global adaptation for global interaction between audio and visual modalities. Our STG-CMA presents a meaningful finding that only leveraging the shared pre-trained image model with inserted lightweight adapters is enough for spatial-temporal modeling and feature interaction of audio-visual modality. Extensive experiments indicate that our STG-CMA achieves state-of-the-art performance on various audio-visual understanding tasks including AVE, AVS, and AVQA while containing significantly reduced tunable parameters. The code is available at https://***/kaiw7/STG-CMA.

关键词： computer vision Adaptation models Visualization Codes conferences Computational modeling Refining

来源：评论

学校读者我要写书评

暂无评论

M2DAR: Multi-View Multi-Scale Driver Action recognition with vision Transformer

M2DAR: Multi-View Multi-Scale Driver Action Recognition with...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Yunsheng Ma Liangqi Yuan Amr Abdelraouf Kyungtae Han Rohit Gupta Zihao Li Ziran Wang College of Engineering Purdue University Toyota Motor North America InfoTech Labs

Ensuring traffic safety and preventing accidents is a critical goal in daily driving, where the advancement of computer vision technologies can be leveraged to achieve this goal. In this paper, we present a multi-view, multi-scale framework for naturalistic driving action recognition and localization in untrimmed videos, namely M 2 DAR, with a particular focus on detecting distracted driving behaviors. Our system features a weight-sharing, multi-scale Transformer-based action recognition network that learns robust hierarchical representations. Furthermore, we propose a new election algorithm consisting of aggregation, filtering, merging, and selection processes to refine the preliminary results from the action recognition module across multiple views. Extensive experiments conducted on the 7th AI City Challenge Track 3 dataset demonstrate the effectiveness of our approach, where we achieved an overlap score of 0.5921 on the A2 test set. Our source code is available at https://***/PurdueDigitalTwin/M2DAR.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Super-Resolution Appearance Transfer for 4D Human Performances

Super-Resolution Appearance Transfer for 4D Human Performanc...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Pesavento, Marco Volino, Marco Hilton, Adrian Univ Surrey Ctr Vis Speech & Signal Proc Guildford Surrey England

ISBN: (纸本)9781665448994

A common problem in the 4D reconstruction of people from multi-view video is the quality of the captured dynamic texture appearance which depends on both the camera resolution and capture volume. Typically the requirement to frame cameras to capture the volume of a dynamic performance (> 50m(3)) results in the person occupying only a small proportion < 10% of the field of view. Even with ultra high-definition 4k video acquisition this results in sampling the person at less-than standard definition 0.5k video resolution resulting in low-quality rendering. In this paper we propose a solution to this problem through super-resolution appearance transfer from a static high-resolution appearance capture rig using digital stills cameras (> 8k) to capture the person in a small volume (< 8m(3)). A pipeline is proposed for super-resolution appearance transfer from high-resolution static capture to dynamic video performance capture to produce super-resolution dynamic textures. This addresses two key problems: colour mapping between different camera systems;and dynamic texture map super-resolution using a learnt model. Comparative evaluation demonstrates a significant qualitative and quantitative improvement in rendering the 4D performance capture with super-resolution dynamic texture appearance. The proposed approach reproduces the high-resolution detail of the static capture whilst maintaining the appearance dynamics of the captured video.

关键词： Surface reconstruction Image color analysis Superresolution Pipelines Cameras Rendering (computer graphics) Surface texture

来源：评论

学校读者我要写书评

暂无评论

Adversarial Identity Injection for Semantic Face Image Synthesis

Adversarial Identity Injection for Semantic Face Image Synth...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Giuseppe Tarollo Tomaso Fontanini Claudio Ferrari Guido Borghi Andrea Prati Department of Engineering and Architecture University of Parma Parma Italy Department of Computer Science and Engineering University of Bologna Cesena Italy

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Nowadays, deep learning models have reached incredible performance in the task of image generation. Plenty of literature works address the task of face generation and editing, with human and automatic systems that struggle to distinguish what’s real from generated. Whereas most systems reached excellent visual generation quality, they still face difficulties in preserving the identity of the starting input subject. Among all the explored techniques, Semantic Image Synthesis (SIS) methods, whose goal is to generate an image conditioned on a semantic segmentation mask, are the most promising, even though preserving the perceived identity of the input subject is not their main concern. Therefore, in this paper, we investigate the problem of identity preservation in face image generation and present an SIS architecture that exploits a cross-attention mechanism to merge identity, style, and semantic features to generate faces whose identities are as similar as possible to the input ones. Experimental results reveal that the proposed method is not only suitable for preserving the identity but is also effective in the face recognition adversarial attack, i.e. hiding a second identity in the generated faces.

关键词： Deep learning Visualization computer vision Image synthesis Face recognition Semantic segmentation conferences

来源：评论

学校读者我要写书评

暂无评论

LCCNet: LiDAR and Camera Self-Calibration using Cost Volume Network

LCCNet: LiDAR and Camera Self-Calibration using Cost Volume ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Lv, Xudong Wang, Boya Dou, Ziwen Ye, Dong Wang, Shuo Harbin Inst Technol Sch Instrumentat Sci & Engn Harbin Peoples R China

ISBN: (纸本)9781665448994

Multi-sensor fusion is for enhancing environment perception and 3D reconstruction in self-driving and robot navigation. Calibration between sensors is the precondition of effective multi-sensor fusion. Laborious manual works and complex environment settings exist in old-fashioned calibration techniques for Light Detection and Ranging (LiDAR) and camera. We propose an online LiDAR-Camera Self-calibration Network (LCCNet), different from the previous CNN-based methods. LCCNet can be trained end-to-end and predict the extrinsic parameters in real-time. In the LCCNet, we exploit the cost volume layer to express the correlation between the RGB image features and the depth image projected from point clouds. Besides using the smooth L1-Loss of the predicted extrinsic calibration parameters as a supervised signal, an additional self-supervised signal, point cloud distance loss, is applied during training. Instead of directly regressing the extrinsic parameters, we predict the decalibrated deviation from initial calibration to the ground truth. The calibration error decreases further with iterative refinement and the temporal filtering approach in the inference stage. The execution time of the calibration process is 24ms for each iteration on a single GPU. LCCNet achieves a mean absolute calibration error of 0.297cm in translation and 0.017. in rotation with miscalibration magnitudes of up to +/- 1.5m and +/- 20. on the KITTI-odometry dataset, which is better than the state-of-the-art CNN-based calibration methods. The code will be publicly available at https://***/LvXudong-HIT/LCCNet

关键词： Training Three-dimensional displays Laser radar Robot vision systems Sensor fusion Cameras Feature extraction

来源：评论

学校读者我要写书评

暂无评论

IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models

IDAdapter: Learning Mixed Features for Tuning-Free Personali...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Siying Cui Jia Guo Xiang An Jiankang Deng Yongle Zhao Xinyu Wei Ziyong Feng Peking University DeepGlint InsightFace

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Leveraging Stable Diffusion for the generation of personalized portraits has emerged as a powerful and noteworthy tool, enabling users to create high-fidelity, custom character avatars based on their specific prompts. However, existing personalization methods face challenges, including test-time fine-tuning, the requirement of multiple input images, low preservation of identity, and limited diversity in generated outcomes. To overcome these challenges, we introduce IDAdapter, a tuning-free approach that enhances the diversity and identity preservation in personalized image generation from a single face image. IDAdapter integrates a personalized concept into the generation process through a combination of textual and visual injections and a face identity loss. During the training phase, we incorporate mixed features from multiple reference images of a specific identity to enrich identity-related content details, guiding the model to generate images with more diverse styles, expressions, and angles. Extensive evaluations demonstrate the effectiveness of our method, achieving both diversity and identity fidelity.

关键词： Training Visualization computer vision Image synthesis Face recognition Avatars conferences

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 112 113 114 115 116 117 118 119 120 121 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：