检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

220 篇 会议
5 篇 期刊文献
2 册 图书

馆藏范围

227 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

217 篇 工学
- 203 篇 计算机科学与技术...
- 25 篇 电气工程
- 20 篇 机械工程
- 4 篇 信息与通信工程
- 4 篇 控制科学与工程
- 4 篇 软件工程
- 1 篇 光学工程
- 1 篇 冶金工程
- 1 篇 建筑学
- 1 篇 土木工程
- 1 篇 生物医学工程（可授...
- 1 篇 生物工程
7 篇 理学
- 2 篇 数学
- 2 篇 物理学
- 1 篇 大气科学
- 1 篇 海洋科学
- 1 篇 生物学
- 1 篇 系统科学
- 1 篇 统计学（可授理学、...
3 篇 医学
- 3 篇 临床医学
2 篇 管理学
- 2 篇 管理科学与工程(可...

主题

44 篇 computer vision
24 篇 pattern recognit...
23 篇 object recogniti...
20 篇 images
15 篇 face recognition
11 篇 object detection
9 篇 action recogniti...
9 篇 training
8 篇 image segmentati...
8 篇 feature extracti...
7 篇 image classifica...
6 篇 cameras
6 篇 gesture recognit...
6 篇 dataset
6 篇 machine learning
6 篇 recognition
5 篇 deep learning
5 篇 visualization
5 篇 image recognitio...
5 篇 shape

机构

6 篇 chinese acad sci...
4 篇 chinese univ hon...
4 篇 mpi intelligent ...
4 篇 carnegie mellon ...
4 篇 univ washington ...
3 篇 univ oxford oxfo...
3 篇 hong kong univ s...
3 篇 mit cambridge ma...
3 篇 univ calif berke...
2 篇 inria
2 篇 stevens inst tec...
2 篇 univ calif river...
2 篇 univ toronto on
2 篇 rensselaer polyt...
2 篇 georgia inst tec...
2 篇 swiss fed inst t...
2 篇 chinese acad sci...
2 篇 chinese acad sci...
2 篇 natl univ singap...
2 篇 swiss fed inst t...

作者

4 篇 hu weiming
3 篇 yuan chunfeng
3 篇 li stan z.
3 篇 lu cewu
3 篇 wen longyin
3 篇 jia jiaya
3 篇 murala subrahman...
3 篇 ji qiang
3 篇 salzmann mathieu
3 篇 shao ling
3 篇 vedaldi andrea
3 篇 yan shuicheng
3 篇 vipparthi santos...
3 篇 brandt jonathan
3 篇 hua gang
2 篇 kannala juho
2 篇 cucchiara rita
2 篇 ling haibin
2 篇 wang limin
2 篇 maji subhransu

语言

227 篇 英文

检索条件"任意字段=27th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2014"

共 227 条记录，以下是1-10 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Key Point-Based Driver Activity recognition

Key Point-Based Driver Activity Recognition

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Vats, Arpita Anastasiu, David C. Santa Clara Univ Santa Clara CA 95053 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

We present a key point-based activity recognition framework, built upon pre-trained human pose estimation and facial feature detection models. Our method extracts complex static and movement-based features from key frames in videos, which are used to predict a sequence of key-frame activities. Finally, a merge procedure is employed to identify robust activity segments while ignoring outlier frame activity predictions. We analyze the different components of our framework via a wide array of experiments and draw conclusions with regards to the utility of the model and ways it can be improved. Results show our model is competitive, taking the 11th place out of 27 teams submitting to Track 3 of the 2022 AI City Challenge.

关键词： computer vision conferences Urban areas Pose estimation Activity recognition Predictive models Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Probing Attention-Driven Normalizing Flow Network for Low-Light Image Enhancement 27th

Probing Attention-Driven Normalizing Flow Network for Low-L...

引用

27th International conference on pattern recognition, ICPR 2024

作者： Singh, Siddharth Mehta, Nancy Prakash, K.N. Vipparthi, Santosh Kumar Murala, Subrahmanyam CVPR Lab Indian Institute of Technology Ropar Rupnagar India Vision Lab CAIDAS & IFI University of Wuerzburg Würzburg Germany SR Gudlavalleru Engineering College Vijayawada India CVPR Lab School of Computer Science and Statistics Trinity College Dublin Dublin Ireland

ISBN: (纸本)9783031781247

Existing low-light image enhancement approaches based upon pixel-wise reconstruction losses are inadept at capturing the complex distribution of well-exposed images, resulting in residual noise, insufficient illuminance, and artifacts. Additionally, the mapping relationship between weakly-illuminated and normally exposed images is one-to-many, making low-light image enhancement a vastly ill-posed problem. In this work, we probe into this one-to-many relationship via an attention and frequency driven normalizing flow network by minimizing the negative log-likelihood loss. the proposed model comprises of two parts: a dual-attention-oriented frequency encoder network and an invertible network which inputs the conditional low-light images and changes the mapping of the complex distribution of well-light images to simpler Gaussian distribution. the proposed model not only utilizes the spatial information inherent in the image for improving the contrast but also extracts the frequency information for preserving the intricate details. To sum up, the distribution of the well-exposed images can be characterized better, and the overall enhancement mechanism becomes analogous to being restrained by a loss function which defines the manifold structure of natural images during the training. Detailed experiment analysis on a variety of challenging low-light images exemplifies the potency of the model and shows its primacy over the state-of-the-art in terms of enhanced quality and efficiency. © the Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Mapping

来源：评论

学校读者我要写书评

暂无评论

the 6th AI City Challenge

The 6th AI City Challenge

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Naphade, Milind Wang, Shuo Anastasiu, David C. Tang, Zheng Chang, Ming-Ching Yao, Yue Zheng, Liang Rahman, Mohammed Shaiqur Venkatachalapathy, Archana Sharma, Anuj Feng, Qi Ablavsky, Vitaly Sclaroff, Stan Chakraborty, Pranamesh Li, Alice Li, Shangru Chellappa, Rama NVIDIA Corp Santa Clara CA 95051 USA Santa Clara Univ Santa Clara CA 95053 USA SUNY Albany Albany NY 12222 USA Australian Natl Univ Canberra ACT Australia Indian Inst Technol Kanpur Kanpur Uttar Pradesh India Iowa State Univ Ames IA USA Boston Univ Boston MA 02215 USA Univ Washington Seattle WA 98195 USA Johns Hopkins Univ Baltimore MD 21218 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

the 6th edition of the AI City Challenge specifically focuses on problems in two domains where there is tremendous unlocked potential at the intersection of computer vision and artificial intelligence: Intelligent Traffic Systems (ITS), and brick and mortar retail businesses. the four challenge tracks of the 2022 AI City Challenge received participation requests from 254 teams across 27 countries. Track 1 addressed city-scale multi-target multi-camera (MTMC) vehicle tracking. Track 2 addressed natural-language-based vehicle track retrieval. Track 3 was a brand new track for naturalistic driving analysis, where the data were captured by several cameras mounted inside the vehicle focusing on driver safety, and the task was to classify driver actions. Track 4 was another new track aiming to achieve retail store automated checkout using only a single view camera. We released two leader boards for submissions based on different methods, including a public leader board for the contest, where no use of external data is allowed, and a general leader board for all submitted results. the top performance of participating teams established strong baselines and even outperformed the state-of-the-art in the proposed challenge tracks.

关键词： computer vision Mortar Urban areas Focusing Cameras Safety pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Frequency Modulated Deformable Transformer for Underwater Image Enhancement 27th

Frequency Modulated Deformable Transformer for Underwater I...

引用

27th International conference on pattern recognition, ICPR 2024

作者： Dukre, Adinath Deshmukh, Vivek Kulkarni, Ashutosh Phutke, Shruti Vipparthi, Santosh Kumar Gonde, Anil B. Murala, Subrahmanyam Shri Guru Gobind Singhji Institute of Engineering and Technology Nanded India CVPR Lab Indian Institute of Technology Ropar Rupnagar India ETI Lab Yamaha Motor Solutions Faridabad India School of Computer Science and Statistics Trinity College Dublin Dublin Ireland

ISBN: (纸本)9783031781247

Underwater images frequently experience quality degradation due to refraction, back-scattering, and absorption, leading to color distortion, blurriness, and reduced visibility. Such degradation present in the underwater images can cause inaccuracies while functioning with higher advanced level computer vision applications, equipped for autonomous underwater vehicles. Despite the ability of enhancing the degraded images, existing approaches fail at preserving the localized fine edges also producing the true colors. therefore, an effective pre-processing network is necessary for underwater image enhancement. With this motivation, we propose a frequency modulated deformable transformer network for underwater image enhancement. Initially, the features are extracted with the proposed multi-scale feature fusion feed-forward module. Further, the frequency modulated deformable attention module is proposed to reconstruct fine-level texture in the restored image. Here, we propose a spatio-channel attentive offset extractor in the modulated deformable convolution for focusing on relevant contextual information. Also, adaptive edge-preserving skip connections are proposed for propagating prominent edge features from the network’s shallow layers to its deeper layers. A comprehensive evaluation of the proposed method on synthetic and real-world datasets and extensive ablation analysis demonstrates that the proposed approach shows superior performance than existing state-of-the-art methods. the testing code is provided at https://***/adinathdukre/FMDTUIE. © the Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Frequency modulation

来源：评论

学校读者我要写书评

暂无评论

FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting 27th

FastTextSpotter: A High-Efficiency Transformer for Multilin...

引用

27th International conference on pattern recognition, ICPR 2024

作者： Das, Alloy Biswas, Sanket Pal, Umapada Lladós, Josep Bhattacharya, Saumik CVPR Unit Indian Statistical Institute Kolkata Kolkata India Computer Vision Center Universitat Autónoma de Barcelona Bellaterra Spain ECE Indian Institute of Technology Kharagpur Kharagpur India

ISBN: (纸本)9783031784972

the proliferation of scene text in both structured and unstructured environments presents significant challenges in optical character recognition (OCR), necessitating more efficient and robust text spotting solutions. this paper presents FastTextSpotter, a framework that integrates a Swin Transformer visual backbone with a Transformer Encoder-Decoder architecture, enhanced by a novel, faster self-attention unit, SAC2, to improve processing speeds while maintaining accuracy. FastTextSpotter has been validated across multiple datasets, including ICDAR2015 for regular texts and CTW1500 and TotalText for arbitrary-shaped texts, benchmarking against current state-of-the-art models. Our results indicate that FastTextSpotter not only achieves superior accuracy in detecting and recognizing multilingual scene text (English and Vietnamese) but also improves model efficiency, thereby setting new benchmarks in the field. this study underscores the potential of advanced transformer architectures in improving the adaptability and speed of text spotting applications in diverse real-world settings. the dataset, code, and pre-trained models have been released in our Github. © the Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Optical character recognition

来源：评论

学校读者我要写书评

暂无评论

Deformable Multi-Scale Network for Snow Removal in Video 27th

Deformable Multi-Scale Network for Snow Removal in Video

引用

27th International conference on pattern recognition, ICPR 2024

作者： He, Runlin Zhou, Gang Xue, Tianhao Liu, Zhaoxi Jia, Zhenhong Key Laboratory of Signal Detection and Processing Department of Computer Science and Technology Xinjiang University Urumqi China

ISBN: (纸本)9783031781247

Snowfall severely degrades outdoor video visibility while reducing the performance of subsequent vision tasks. Although video recovery methods based on deep learning have achieved amazing accomplishments, video snow removal still faces problems such as varying scales and intricate trajectories of snowflakes, which makes it difficult to remove snowflakes and easy to create artifacts on moving objects. To address these issues, we propose a deformable multi-scale video desnowing network. Specifically, we design a multi-scale pseudo-3D residual block(MSRB-P3D) that can effectively remove snowflakes of different scales. Furthermore, a deformable large kernel attention 3Dblock(D-LKA 3Dblock) is introduced to capture the inter-frame dynamic information and reduce the artifacts. Due to the scarcity of dataset, we proposed a new dataset named Synthetic and Real Snowy Video Dataset(SRSVD). Extensive experiments have proven that our proposed method not only outperforms other state-of-the-art methods on both synthetic and real snowy videos, but also effectively improves performance on subsequent vision task. © the Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Snow

来源：评论

学校读者我要写书评

暂无评论

Fusing Image and Text Features for Scene Sentiment Analysis Using Whale-Honey Badger Optimization Algorithm (WHBOA) 27th

Fusing Image and Text Features for Scene Sentiment Analysi...

引用

27th International conference on pattern recognition, ICPR 2024

作者： Yadav, Prem Shanker Tyagi, Dinesh Kumar Vipparthi, Santosh Kumar Department of Computer Science and Engineering Malaviya National Institute of Technology Rajasthan Jaipur302017 India School of Artificial Intelligence and Data Engineering Indian Institute of Technology Ropar Punjab Rupnagar140001 India

ISBN: (纸本)9783031781650

Developing a real-time sentiment analysis application that relies solely on features extracted from images or textual content falls short of capturing human emotions’ nuanced and multifaceted nature. the unlabeled dataset, though useful, has limitations for sentiment analysis due to its general image descriptions, which lack emotional depth and do not include direct sentiment labels. Finding scene sentiment is a challenging task. To address this, combining textual descriptions with visual features is crucial. Important parameters include entropy, bag of words, and parts of speech (nouns, adjectives, and verbs) for textual analysis, alongside visual features like SIFT, SURF, and color histograms. these features are integrated to capture a comprehensive range of sentiment cues, enhancing the accuracy and depth of sentiment insights. this paper proposes an optimized adaptive neuro-fuzzy inference system for a compelling feature enhancement using the Whale-Honey Badger Optimization Algorithm (WHBOA). the proposed method identifies the most relevant and effective features from both textual and visual data. It captures visual-specific attributes to provide a richer and more detailed representation of visual content, addressing the limitations of general image descriptions and paving the way for the development of predictive models. Additionally, text pre-processing cleans and normalizes the textual data. We conducted an extensive comparative performance evaluation to assess the effectiveness and accuracy of the proposed model. the model is compared with the Nearest Neighbor, Support Vector Machine (SVM), and Decision Tree classification algorithms for the performance *** results demonstrate that the optimized model performs better, achieving an accuracy of approximately 91.2%, compared to the other models. © the Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Support vector machines

来源：评论

学校读者我要写书评

暂无评论

A Low-cost Approach Towards Streaming 3D Videos of Large-scale Sport Events to Mixed Reality Headsets in Real-time

A Low-cost Approach Towards Streaming 3D Videos of Large-sca...

引用

27th ieee conference on Virtual Reality and 3D User Interfaces (ieee VR)

作者： Marty, Kevin Rajasekaran, Prithvi Sun, Yongbin Fuchs, Klaus MIT Auto ID Labs Cambridge MA 02139 USA ETHZ Zurich Switzerland

ISBN: (数字)9781728165325

ISBN: (纸本)9781728165325

Watching sports events via 3D-instead of two-dimensional video streaming allows for increased immersion, e.g. via mixed reality headsets in comparison to traditional screens. So far, capturing 3D video of sports events required expensive outside-in tracking with numerous cameras. this study demonstrates the feasibility of streaming sports content to mixed reality headsets as holographs in real-time using inside-out tracking and low-cost equipment only. We demonstrate our system by streaming a race car on an indoor track as 3D models, which are then rendered in an Magic Leap One headset. An onboard camera, mounted on the race car provides the video stream used to localize the car via computer vision. the localization is estimated by an end-to-end convolutional neural network (CNN). the study compares three state-of-the-art CNN models in their respective accuracy and execution time, with PoseNet+LSTM achieving position and orientation accuracy of 0.35m and 3.95 degrees. the total streaming latency in this study was 1041ms, suggesting technical feasibility of streaming 3D sports content, e.g. on large playgrounds, in near real-time onto mixed-reality headsets.

关键词： Augmented Reality Visualization Head mounted display Sport streaming Deep learning Image processing pattern recognition Localization

来源：评论

学校读者我要写书评

暂无评论

An overview of robot vision

An overview of robot vision

引用

27th Southern African Universities Power Engineering conference (SAUPEC) / 11th Robotics and Mechatronics conference of South Africa (RobMech) / 29th Annual Symposium of pattern-recognition-Association-of-South-Africa (PRASA)

作者： van Eden, Beatrice Rosman, Benjamin CSIR Mobile Intelligent Autonomous Syst Pretoria South Africa Univ Witwatersrand Sch Comp Sci & Appl Math Johannesburg South Africa

ISBN: (纸本)9781728103693

Robot vision is an interdisciplinary field that deals with how robots can be made to gain high-level understanding from digital images or videos. Understanding an image at the pixel level often does not provide enough information for decision making and action taking. In this case, higher level semantic information that describes the image is required. this helps the robot to accomplish complex tasks that require visual understanding. For robots to add value they need to be sufficiently effective at executing tasks in different settings. Despite many impressive advances in robot vision, robots still lack the ability to function as humans do in complex environments. Importantly, this includes being able to interpret and understand the perceptual complexities of the world. Robot vision is dependant on ideas from both computer vision and machine learning. In this paper we provide a overview of the advances in these disciplines and how they contribute to robot vision.

关键词： Robot sensing systems Image classification Object recognition Object detection Visualization computer architecture

来源：评论

学校读者我要写书评

暂无评论

MULTI-CLASS WEAthER CLASSIFICATION FROM STILL IMAGE USING SAID ENSEMBLE MEthOD

MULTI-CLASS WEATHER CLASSIFICATION FROM STILL IMAGE USING SA...

引用

作者： Ajayi, Gbeminiyi Oluwafemi Wang Zenghui Univ South Africa Elect & Min Dept Johannesburg South Africa

ISBN: (纸本)9781728103693

In the field of computer vision, multi-class outdoor weather classification is a difficult task to perform due to diversity and lack of distinct weather characteristic or features. this research proposed a novel framework for identifying different weather scenes from still images using heterogeneous ensemble methods. Our approach is based on a method called Selection Based on Accuracy Intuition and diversity (SAID) of stacked ensemble algorithms. this involves the extraction of histogram of features from different weather scenes. the blending and boosting of different weather features using stacked ensemble algorithms increases recognition rate of different weather conditions compared to other classification and ensemble methods. the paper presents academic and practitioners a new insight into diversity of heterogeneous ensemble methods for solving the challenges of weather recognition from still images.

关键词： Stacking ensemble ensemble diversity weather identification recognition Image classification

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共23页 << < 1 2 3 4 5 6 7 8 9 10 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：