检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,905 篇 会议
43 篇 期刊文献
18 册 图书

馆藏范围

8,965 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,564 篇 工学
- 4,024 篇 计算机科学与技术...
- 2,182 篇 软件工程
- 1,241 篇 光学工程
- 558 篇 控制科学与工程
- 433 篇 信息与通信工程
- 430 篇 机械工程
- 294 篇 电气工程
- 288 篇 仪器科学与技术
- 179 篇 生物工程
- 159 篇 生物医学工程（可授...
- 119 篇 电子科学与技术（可...
- 64 篇 安全科学与工程
- 58 篇 建筑学
- 58 篇 化学工程与技术
- 52 篇 土木工程
- 52 篇 交通运输工程
- 40 篇 力学（可授工学、理...
2,066 篇 理学
- 1,382 篇 物理学
- 1,198 篇 数学
- 420 篇 统计学（可授理学、...
- 238 篇 生物学
- 55 篇 化学
- 36 篇 系统科学
266 篇 管理学
- 182 篇 图书情报与档案管...
- 92 篇 管理科学与工程(可...
- 47 篇 工商管理
223 篇 医学
- 222 篇 临床医学
- 39 篇 基础医学(可授医学...
205 篇 艺术学
- 205 篇 设计学（可授艺术学...
45 篇 法学
- 43 篇 社会学
21 篇 农学
14 篇 教育学
9 篇 经济学
6 篇 军事学

主题

3,414 篇 computer vision
1,216 篇 pattern recognit...
946 篇 cameras
908 篇 conferences
765 篇 computer science
674 篇 image segmentati...
618 篇 layout
598 篇 training
548 篇 shape
518 篇 robustness
451 篇 feature extracti...
448 篇 humans
445 篇 face recognition
405 篇 computational mo...
402 篇 object detection
365 篇 visualization
356 篇 computer archite...
336 篇 application soft...
304 篇 lighting
257 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 tsinghua univers...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 computer vision ...
17 篇 department of co...
16 篇 institute of inf...
16 篇 school of comput...
15 篇 school of comput...
15 篇 carnegie mellon ...
14 篇 national laborat...

作者

57 篇 timofte radu
25 篇 huang thomas s.
24 篇 van gool luc
23 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 luc van gool
19 篇 t.s. huang
18 篇 xiaoou tang
18 篇 murino vittorio
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 li stan z.
15 篇 m. shah

语言

8,904 篇 英文
53 篇 其他
8 篇 中文
1 篇 土耳其文

检索条件"任意字段=IEEE-Computer-Society Conference on Computer Vision and Pattern Recognition Workshops"

共 8966 条记录，以下是1391-1400 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Medium Scale Benchmark for Cricket Excited Actions Understanding

Medium Scale Benchmark for Cricket Excited Actions Understan...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Altaf Hussain Noman Khan Muhammad Munsif Min Je Kim Sung Wook Baik Sejong University Republic of Korea

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

The Sports Action recognition (SAR) domain is of significant importance in research, with diverse applications, ranging from aiding coaches in strategic decision-making to empowering athletes and contributing to real-time commercial entertainment. Despite the existence of extensive large-scale and small-scale datasets, the direct application of these datasets to specific sports domains, such as cricket, poses challenges. Existing datasets predominantly center around daily life actions, lacking the necessary granularity for in-depth sports analyses. Current Cricket Action Analysis (CAA) datasets have limitations, including their small scale, modality constraints, and their narrow focus on specific aspects, such as cricket batting. Recognizing the need for a more comprehensive benchmark, this article introduces the Cricket Excited Actions (CEA) dataset. Developed in collaboration with professional cricket players, the CEA dataset encompasses challenging multi-person actions within realistic cricket scenarios. The selected activity classes, such as Clean Bowled, Six, Four, and Catches, adhere to official standards and represent pivotal moments in cricket matches. Through precise annotation and empirical studies, utilizing state-of-the-art action recognition model architectures, this study provides a valuable resource for further research and makes significant contributions by offering support essential to advancing CAA within the cricket sports community. The data and code are available at https://***/Altaf-hucn/Cricket-Excited-Actions-Benchmark.

关键词： Computational modeling Refining Collaboration computer architecture Benchmark testing Real-time systems pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Recursive Joint Cross-Modal Attention for Multimodal Fusion in Dimensional Emotion recognition

Recursive Joint Cross-Modal Attention for Multimodal Fusion ...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： R. Gnana Praveen Jahangir Alam Computer Research Institute of Montreal (CRIM) Canada

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Though multimodal emotion recognition has achieved significant progress over recent years, the potential of rich synergic relationships across the modalities is not fully exploited. In this paper, we introduce Recursive Joint Cross-Modal Attention (RJCMA) to effectively capture both intra- and inter-modal relationships across audio, visual, and text modalities for dimensional emotion recognition. In particular, we compute the attention weights based on cross-correlation between the joint audio-visual-text feature representations and the feature representations of individual modalities to simultaneously capture intra- and inter-modal relationships across the modalities. The attended features of the individual modalities are again fed as input to the fusion model in a recursive mechanism to obtain more refined feature representations. We have also explored Temporal Convolutional Networks (TCNs) to improve the temporal modeling of the feature representations of individual modalities. Extensive experiments are conducted to evaluate the performance of the proposed fusion model on the challenging Affwild2 dataset. By effectively capturing the synergic intra- and inter-modal relationships across audio, visual, and text modalities, the proposed fusion model achieves a Concordance Correlation Coefficient (CCC) of 0.585 (0.542) and 0.674 (0.619) for valence and arousal, respectively, on the validation set (test set). This shows a significant improvement over the baseline of 0.240 (0.211) and 0.200 (0.191) for valence and arousal, respectively, in the validation set (test set), achieving second place in the valence-arousal challenge of the 6th Affective Behavior Analysis in-the-Wild (ABAW) competition. The code is available on GitHub: https://***/praveena2j/RJCMA.

关键词： Correlation coefficient Visualization Emotion recognition computer vision Computational modeling conferences Refining

来源：评论

学校读者我要写书评

暂无评论

H3Net: Irregular Posture Detection by Understanding Human Character and Core Structures

H3Net: Irregular Posture Detection by Understanding Human Ch...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Seungha Noh Kangmin Bae Yuseok Bae Byong-Dai Lee Kyonggi University Suwon South Korea ETRI Daejeon South Korea

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

This paper proposes H 3 Net that considers detecting people in irregular postures by utilizing human structures and characters. To handle both features, we introduce two attention modules: 1) Human Structure Attention Module (HSAM), which is introduced to focus on the spatial aspects of a person, and 2) Human Character Attention Module (HCAM), which is designed to address the issue of repetitive appearance. HSAM effectively handles both foreground and background information about a human instance and utilizes keypoints to provide additional guidance to predict irregular postures. Meanwhile, HCAM employs ID information obtained from the tracking head, enriching the posture prediction with high-level semantic information. Furthermore, gathering images of people in irregular postures is a challenging task. Therefore, many conventional datasets consist of images with the same actors simulating varying postures in distinct images. To address this problem, we propose a Human ID Dependent Posture (HID 2 ) loss that handles repeated instances. The HID 2 loss generates a regularization term by considering duplicated instances to reduce bias. Our experiments demonstrate the effectiveness of H 3 Net compared to existing algorithms on irregular posture datasets. Furthermore, we show the qualitative results using color-coded masks and bounding boxes. We also provide ablation studies to highlight the significance of our proposed methods.

关键词： computer vision Head conferences Semantics pattern recognition

来源：评论

学校读者我要写书评

暂无评论

3D Human Scan With A Moving Event Camera

3D Human Scan With A Moving Event Camera

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Kai Kohyama Shintaro Shiba Yoshimitsu Aoki Keio University

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Capturing the 3D human body is one of the important tasks in computer vision with a wide range of applications such as virtual reality and sports analysis. However, conventional frame cameras are limited by their temporal resolution and dynamic range, which imposes constraints in real-world application setups. Event cameras have the advantages of high temporal resolution and high dynamic range (HDR), but the development of event-based methods is necessary to handle data with different characteristics. This paper proposes a novel event-based method for 3D pose estimation and human mesh recovery. Prior work on event-based human mesh recovery require frames (images) as well as event data. The proposed method solely relies on events; it carves 3D voxels by moving the event camera around a stationary body, reconstructs the human pose and mesh by attenuated rays, and fit statistical body models, preserving high-frequency details. The experimental results show that the proposed method outperforms conventional frame-based methods in the estimation accuracy of both pose and body mesh. We also demonstrate results in challenging situations where other frame-based methods suffer from motion blur. This is the first-of-its-kind to demonstrate event-only human mesh recovery, and we hope that it is the first step toward achieving robust and accurate 3D human body scanning from vision sensors.

关键词： computer vision Solid modeling Three-dimensional displays Accuracy Pose estimation Virtual reality vision sensors

来源：评论

学校读者我要写书评

暂无评论

Photo-Realistic Image Restoration in the Wild with Controlled vision-Language Models

Photo-Realistic Image Restoration in the Wild with Controlle...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Ziwei Luo Fredrik K. Gustafsson Zheng Zhao Jens Sjölund Thomas B. Schön Uppsala University Karolinska Institutet

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Though diffusion models have been successfully applied to various image restoration (IR) tasks, their performance is sensitive to the choice of training datasets. Typically, diffusion models trained in specific datasets fail to recover images that have out-of-distribution degradations. To address this problem, this work leverages a capable vision-language model and a synthetic degradation pipeline to learn image restoration in the wild (wild IR). More specifically, all low-quality images are simulated with a synthetic degradation pipeline that contains multiple common degradations such as blur, resize, noise, and JPEG compression. Then we introduce robust training for a degradation-aware CLIP model to extract enriched image content features to assist high-quality image restoration. Our base diffusion model is the image restoration SDE (IR-SDE). Built upon it, we further present a posterior sampling strategy for fast noise-free image generation. We evaluate our model on both synthetic and real-world degradation datasets. Moreover, experiments on the unified image restoration task illustrate that the proposed posterior sampling improves image generation quality for various degradations.

关键词： Degradation Training Image synthesis Pipelines Transform coding Diffusion models Feature extraction

来源：评论

学校读者我要写书评

暂无评论

The 5th AI City Challenge

The 5th AI City Challenge

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Naphade, Milind Wang, Shuo Anastasiu, David C. Tang, Zheng Chang, Ming-Ching Yang, Xiaodong Yao, Yue Zheng, Liang Chakraborty, Pranamesh Sharma, Anuj Feng, Qi Ablavsky, Vitaly Sclaroff, Stan NVIDIA Corp Santa Clara CA 95051 USA Santa Clara Univ Santa Clara CA 95053 USA SUNY Albany Albany NY 12222 USA Australian Natl Univ Canberra ACT Australia Indian Inst Technol Kanpur Kanpur Uttar Pradesh India Iowa State Univ Ames IA USA Boston Univ Boston MA 02215 USA Univ Washington Seattle WA 98195 USA Amazon Seattle WA USA QCraft Santa Clara CA USA

ISBN: (纸本)9781665448994

The AI City Challenge was created with two goals in mind: (1) pushing the boundaries of research and development in intelligent video analysis for smarter cities use cases, and (2) assessing tasks where the level of performance is enough to cause real-world adoption. Transportation is a segment ripe for such adoption. The fifth AI City Challenge attracted 305 participating teams across 38 countries, who leveraged city-scale real traffic data and high-quality synthetic data to compete in five challenge tracks. Track 1 addressed video-based automatic vehicle counting, where the evaluation being conducted on both algorithmic effectiveness and computational efficiency. Track 2 addressed city-scale vehicle re-identification with augmented synthetic data to substantially increase the training set for the task. Track 3 addressed city-scale multi-target multi-camera vehicle tracking. Track 4 addressed traffic anomaly detection. Track 5 was a new track addressing vehicle retrieval using natural language descriptions. The evaluation system shows a general leader board of all submitted results, and a public leader board of results limited to the contest participation rules, where teams are not allowed to use external data in their work. The public leader board shows results more close to real-world situations where annotated data is limited. Results show the promise of AI in Smarter Transportation. State-of-the-art performance for some tasks shows that these technologies are ready for adoption in real-world systems.

关键词： Training computer vision Three-dimensional displays Target tracking Tracking Urban areas Transportation

来源：评论

学校读者我要写书评

暂无评论

Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap

Are NeRFs ready for autonomous driving? Towards closing the ...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Carl Lindström Georg Hess Adam Lilja Maryam Fatemi Lars Hammarstrand Christoffer Petersson Lennart Svensson Zenseact Chalmers University of Technology

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Neural Radiance Fields (NeRFs) have emerged as promising tools for advancing autonomous driving (AD) research, offering scalable closed-loop simulation and data augmentation capabilities. However, to trust the results achieved in simulation, one needs to ensure that AD systems perceive real and rendered data in the same way. Although the performance of rendering methods is increasing, many scenarios will remain inherently challenging to reconstruct faithfully. To this end, we propose a novel perspective for addressing the real-to-simulated data gap. Rather than solely focusing on improving rendering fidelity, we explore simple yet effective methods to enhance perception model robustness to NeRF artifacts without compromising performance on real data. Moreover, we conduct the first large-scale investigation into the real-to-simulated data gap in an AD setting using a state-of-the-art neural rendering technique. Specifically, we evaluate object detectors and an online mapping model on real and simulated data, and study the effects of different fine-tuning strategies. Our results show notable improvements in model robustness to simulated data, even improving real-world performance in some cases. Last, we delve into the correlation between the real-to-simulated gap and image reconstruction metrics, identifying FID and LPIPS as strong indicators.

关键词： Measurement Object detection Neural radiance field Rendering (computer graphics) Data models Robustness pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Cross-view Aggregation Network For Stereo Image Super-Resolution

Cross-view Aggregation Network For Stereo Image Super-Resolu...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Zhitao Chen Tao Lu Kanghui Zhao Bolin Zhu Zhen Li Jiaming Wang Yanduo Zhang Hubei Key Laboratory of Intelligent Robot Wuhan Institute of Technology

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Although stereo image super-resolution has been extensively studied, many existing works only rely on attention in a single epipolar direction to reconstruct stereo images. In the case of asymmetric parallax images, these methods often struggle to capture reliable stereo correspondence, resulting in reconstructed images suffering from blurring and artifacts. In this paper, we propose a novel method called Cross-View Aggregation Network for Stereo Image Super-Resolution (CANSSR) and explore the relationship between multi-directional epipolar lines to construct reliable stereo correspondence. Specifically, we propose a multidirectional cross-view aggregation module (MCAM) that effectively captures multi-directional stereo correspondence and obtains cross-view complementary information. Furthermore, we design a channel-spatial aggregation module (CSAM) that aggregates multi-order global-local information in intra-view to reconstruct clearer texture features. In addition, we equip a large kernel convolution in the Feedforward Network to acquire richer detailed texture information. The extensive experiments conclusively demonstrate that CANSSR outperforms the state-of-the-art method both qualitatively and quantitatively in terms of stereo image super-resolution on the Flickr 1024 and Middlebury datasets.

关键词： Image databases Convolution computer network reliability Superresolution pattern recognition Reliability Web sites

来源：评论

学校读者我要写书评

暂无评论

Learning Surface Terrain Classifications from Ground Penetrating Radar

Learning Surface Terrain Classifications from Ground Penetra...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Anja Sheppard Jason Brown Nilton Renno Katherine A. Skinner The University of Michigan Ann Arbor MI

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Terrain classification is an important problem for mobile robots operating in extreme environments as it can aid downstream tasks such as autonomous navigation and planning. While RGB cameras are widely used for terrain identification, vision-based methods can suffer due to poor lighting conditions and occlusions. In this paper, we propose the novel use of Ground Penetrating Radar (GPR) for terrain characterization for mobile robot platforms. Our approach leverages machine learning for surface terrain classification from GPR data. We collect a new dataset consisting of four different terrain types, and present qualitative and quantitative results. Our results demonstrate that classification networks can learn terrain categories from GPR signals. Additionally, we integrate our GPR-based classification approach into a multimodal semantic mapping framework to demonstrate a practical use case of GPR for surface terrain classification on mobile robots. Overall, this work extends the usability of GPR sensors deployed on robots to enable terrain classification in addition to GPR’s existing scientific use cases.

关键词： Ground penetrating radar Surface waves Semantics Robot vision systems Sensor systems Sensors Planning

来源：评论

学校读者我要写书评

暂无评论

Blind Image Inpainting via Omni-dimensional Gated Attention and Wavelet Queries

Blind Image Inpainting via Omni-dimensional Gated Attention ...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Shruti S. Phutke Ashutosh Kulkarni Santosh Kumar Vipparthi Subrahmanyam Murala Computer Vision and Pattern Recognition Lab Indian Institute of Technology Ropar Rupnagar Punjab

Blind image inpainting is a crucial restoration task that does not demand additional mask information to restore the corrupted regions. Yet, it is a very less explored research area due to the difficulty in discriminating between corrupted and valid regions. There exist very few approaches for blind image inpainting which sometimes fail at producing plausible inpainted images. Since they follow a common practice of predicting the corrupted regions and then inpaint them. To skip the corrupted region prediction step and obtain better results, in this work, we propose a novel end-to-end architecture for blind image inpainting consisting of wavelet query multi-head attention transformer block and the omni-dimensional gated attention. The proposed wavelet query multi-head attention in the transformer block provides encoder features via processed wavelet coefficients as query to the multi-head attention. Further, the proposed omni-dimensional gated attention effectively provides all dimensional attentive features from the encoder to the respective decoder. Our proposed approach is compared numerically and visually with existing state-of-the-art methods for blind image inpainting on different standard datasets. The comparative and ablation studies prove the effectiveness of the proposed approach for blind image inpainting. The testing code is available at : https://***/shrutiphutke/Blind_Omni_Wav_Net

关键词：

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 136 137 138 139 140 141 142 143 144 145 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：