检索结果-内蒙古大学图书馆

Improving deep learning Models Considering the time Lags between Explanatory and Response Variables

JOURNAL OF INFORMATION processing SYSTEMS 2024年第3期20卷 345-359页

作者： Kim, Chaehyeon Lee, Ki Yong Sookmyung Womens Univ Dept Comp Sci Seoul South Korea Univ Penn Dept Comp & Informat Sci Philadelphia PA USA

A regression model represents the relationship between explanatory and response variables. In real life, explanatory variables often affect a response variable with a certain time lag, rather than immediately. For example, the marriage rate affects the birth rate with a time lag of 1 to 2 years. Although deep learning models have been successfully used to model various relationships, most of them do not consider the time lags between explanatory and response variables. Therefore, in this paper, we propose an extension of deep learning models, which automatically finds the time lags between explanatory and response variables. The proposed method finds out which of the past values of the explanatory variables minimize the error of the model, and uses the found values to determine the time lag between each explanatory variable and response variables. After determining the time lags between explanatory and response variables, the proposed method trains the deep learning model again by reflecting these time lags. Through various experiments applying the proposed method to a few deep learning models, we confirm that the proposed method can find a more accurate model whose error is reduced by more than 60% compared to the original model.

关键词： deep learning Model Optimization Regression Model time Lag

来源：评论

学校读者我要写书评

暂无评论

Application of Wireless Network Visual Communication Technology in Fuzzy image Target Classification Detection 2

Application of Wireless Network Visual Communication Technol...

引用

2nd IEEE International Conference on Integrated Circuits and Communication Systems, ICICACS 2024

作者： Zhang, Yuhao Hanyang University Visual Design Major Graduate School of Design Seoul Special City Korea Republic of

ISBN: (纸本)9798350317558

Fuzzy image target classification detection plays an important role in image processing. Traditional classification detection methods are easily affected by environmental and equipment factors, and there are certain limitations in classification accuracy and real-time performance. In order to improve the performance of fuzzy image target classification detection and image processing, this paper combines wireless network visual communication technology and utilizes multi node collaborative processing to achieve fuzzy image information processing. To verify its effectiveness, this article compares it with traditional deep learning (DL) methods. In the experimental analysis, it was found that compared to the DL method, the average accuracy of fuzzy image classification under wireless network visual communication technology improved by 9.46%. The conclusion indicates that wireless network visual communication technology can help improve the real-time performance and classification effect of blurred image targets, and can promote the intelligent development of image processing technology. © 2024 IEEE.

关键词： Blur image Feature Recognition image Classification Visual Communication Wireless Network

来源：评论

学校读者我要写书评

暂无评论

real-time detection of wood defects based on SPP-improved YOLO algorithm

引用

MULtimeDIA TOOLS AND APPLICATIONS 2023年第14期82卷 21031-21044页

作者： Cui, Yuming Lu, Shuochen Liu, Songyong Jiangsu Normal Univ Sch Mechatron Engn Xuzhou 221116 Peoples R China China Univ Min & Technol Sch Mechatron Engn Xuzhou 221116 Peoples R China

Wood processing is one of the most widely used in agriculture and industry. Low precision and high time delay of machine learning in wood defect detection are currently the main factors restricting the production efficiency and product quality of the wood processing industry. An SPP-improved deep learning method was proposed to detect wood defects based on the basic framework of the YOLO V3 network to improve accuracy and real-time performance. The extended dataset was firstly established by image data enhancement and preprocessing based on the limited samples of the wood defect dataset. Anchor box scale re-clustering of the wood defect dataset was carried out according to the defect features. The spatial pyramid pooling (SPP) network was applied to improve the feature pyramid (FP) network in YOLO V3. The validity and real-time performance of the proposed algorithm were verified by a randomly selected test set. The results show that the overall detection accuracy rate on the wood defect test dataset reaches 93.23% while the detection time for each image is within 13 ms.

关键词： Transfer learning Wood defects detection real-time detection Full convolutional neural network

来源：评论

学校读者我要写书评

暂无评论

Large-Scale image Indexing and Retrieval Methods: A PRISMA-Based Review

引用

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS 2024年第7期15卷 325-337页

作者： Saouabe, Abdelkrim Tkatek, Said Oualla, Hicham Henriquez, Carlos S. O. S. A. IbnTofail Univ Kenitra Fac Sci Comp Sci Res Lab Kenitra Morocco AKKODIS Res Paris France

Large-scale image indexing and retrieval are pivotal in artificial intelligence, especially within computer vision, for efficiently organizing and accessing extensive image databases. This systematic literature review employs the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) methodology to thoroughly analyze and synthesise the current research landscape in this domain. Through meticulous research and a stringent selection process, this study uncovers significant trends, pioneering methodologies, and ongoing challenges in large-scale image indexing and retrieval. Key findings reveal a growing adoption of deep learning techniques, the integration of multimodal data to improve retrieval accuracy, and persistent challenges related to scalability and real-time processing. These insights offer a valuable resource for researchers and practitioners striving to enhance the efficiency and effectiveness of image indexing and retrieval systems.

关键词： image indexing image retrieval similarity PRISMA, computer vision

来源：评论

学校读者我要写书评

暂无评论

Research on speckle image reconstruction method based on deep learning 24

Research on speckle image reconstruction method based on dee...

引用

2024 International Conference on Virtual reality, image and Signal processing, ICVISP 2024

作者： Li, Xuandong Chen, Musheng College of physics and information engineering Quanzhou Normal University Fuzhou China College of Photonic and Electronic Engineer Fujian Normal University Fuzhou China

ISBN: (纸本)9798400710926

Optical speckle image reconstruction is crucial in fields such as biomedical imaging, optical coherence tomography, and optical remote sensing. However, speckle images often suffer from noise and scattering, leading to diminished image quality. This paper introduces a deep learning-based approach to enhance the accuracy and quality of reconstructed speckle images. The process begins with the construction of an experimental optical setup using a spatial light modulator to generate the corresponding speckle patterns. A novel network model, named pix2pix, is then developed based on the generative adversarial network (GAN), integrating both generator and discriminator modules. The model is trained using both the collected speckle image data and real data. Extensive experiments have validated the effectiveness of our approach, showing significant improvements in reconstruction accuracy and computational efficiency. The results indicate that, compared to traditional methods, the deep learning-based technique greatly enhances image clarity, effectively reduces noise, and demonstrates superior generalization in image reconstruction. © 2024 Copyright held by the owner/author(s).

关键词： deep learning

来源：评论

学校读者我要写书评

暂无评论

Autonomous Landing on a Moving Platform Using Vision-Based deep Reinforcement learning

引用

IEEE ROBOTICS AND AUTOMATION LETTERS 2024年第5期9卷 4575-4582页

作者： Ladosz, Pawel Mammadov, Meraj Shin, Heejung Shin, Woojae Oh, Hyondong Univ Manchester Dept Mech Aerosp & Civil Engn Manchester M13 9PL England Ulsan Natl Inst Sci & Technol Dept Mech Engn Ulsan 44610 South Korea

This letter describes autonomous landing of an unmanned aircraft system on a moving platform using vision and deep reinforcement learning. Landing on the moving platform offers several benefits, such as more mission flexibility and reduced flight time. In particular, the end-to-end vision approach (i.e., an input to the reinforcement learning is a raw image from the camera) with the deep regularized Q algorithm and custom designed reward is utilized. The custom reward was specifically devised to encourage useful feature extraction from the state space. Additionally, the proposed reinforcement learning algorithm has full 3D velocity control including the vertical channel. The simulation results show that the proposed approach can outperform existing approaches which use high-level extracted features (such as relative position and velocity of the landing pad). The simulation results are then successfully transferred to the real-world experiment by utilizing domain randomization.

关键词： AI-enabled robotics aerial systems: Applications reinforcement learning vision-based navigation

来源：评论

学校读者我要写书评

暂无评论

Hybrid ultra-short term solar irradiation forecasting using resource-efficient multi-step long-short term memory

引用

RENEWABLE ENERGY 2025年 247卷

作者： Barancsuk, Lilla Groma, Veronika Kocziha, Barnabas Budapest Univ Technol & Econ Dept Elect Power Engn Muegyet Quay 3 H-1111 Budapest Hungary HUN REN Ctr Energy Res Konkoly Thege Miklos St 29-33 H-1121 Budapest Hungary

Accurate forecasting of solar irradiance is a key tool for optimizing the efficiency and service quality of solar energy systems. In this paper, a novel approach is proposed for multi-step solar irradiation forecasting using deep learning models optimized for low computational resource environments. Traditional forecasting models often lack accuracy, and modern, deep-learning based models, while accurate, require substantial computational resources, making them impractical for real-time or resource-constrained environments. Our method uniquely combines dimensionality reduction via image processing with an LSTM-based architecture, achieving significant input data reduction by a factor of 4250 while preserving essential sky condition information, resulting in a lightweight neural network architecture that balances prediction accuracy with computational efficiency. The forecasts are generated simultaneously for multiple time steps: 1 minute, 5 minutes, 10 minutes and 20 minutes. Models are evaluated against a custom dataset, spanning across more than 3 years, containing 1 min samples encompassing both all-sky imagery and meteorological measurements. The approach is demonstrated to achieve better forecasting accuracy, namely a forecast skill of 10 % compared to persistence, and a significantly reduced computational overhead compared to benchmark ConvLSTM models. Moreover, utilizing the preprocessed image features reduces input size by a factor of 6 compared to the raw images. Our findings suggest that the proposed models are well-suited for deployment in embedded systems, remote sensors, and other scenarios where computational resources are limited.

关键词： Solar irradiation forecast Multistep forecasting deep learning LSTM image features Resource-efficient Total sky imager

来源：评论

学校读者我要写书评

暂无评论

Enhancing Contrastive learning With Positive Pair Mining for Few-Shot Hyperspectral image Classification

引用

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING 2024年 17卷 8509-8526页

作者： Braham, Nassim Ait Ali Mairal, Julien Chanussot, Jocelyn Mou, Lichao Zhu, Xiao Xiang Tech Univ Munich TUM Chair Data Sci Earth Observat SiPEO D-80333 Munich Germany German Aerosp Ctr DLR Remote Sensing Technol Inst IMF D-82234 Wessling Germany Univ Grenoble Alpes Inria CNRS Grenoble INPLJK F-38000 Grenoble France

In recent years, deep learning has emerged as the dominant approach for hyperspectral image (HSI) classification. However, deep neural networks require large annotated datasets to generalize well. This limits the applicability of deep learning for real-world HSI classification problems, as manual labeling of thousands of pixels per scene is costly and time consuming. In this article, we tackle the problem of few-shot HSI classification by leveraging state-of-the-art self-supervised contrastive learning with an improved view-generation approach. Traditionally, contrastive learning algorithms heavily rely on hand-crafted data augmentations tailored for natural imagery to generate positive pairs. However, these augmentations are not directly applicable to HSIs, limiting the potential of self-supervised learning in the hyperspectral domain. To overcome this limitation, we introduce two positive pair-mining strategies for contrastive learning on HSIs. The proposed strategies mitigate the need for high-quality data augmentations, providing an effective solution for few-shot HSI classification. Through extensive experiments, we show that the proposed approach improves accuracy and label efficiency on four popular HSI classification benchmarks. Furthermore, we conduct a thorough analysis of the impact of data augmentation in contrastive learning, highlighting the advantage of our positive pair-mining approach.

关键词： Contrastive learning hyperspectral image (HSI) classification positive pair mining self-supervised learning

来源：评论

学校读者我要写书评

暂无评论

AI Enabled Threat Detection: Leveraging Artificial Intelligence for Advanced Security and Cyber Threat Mitigation

引用

IEEE ACCESS 2024年 12卷 173127-173136页

作者： Dhanushkodi, Kavitha Thejas, S. Vellore Inst Technol Sch Comp Sci & Engn Chennai 600127 India

This comprehensive review examines the role of artificial intelligence (AI) in enhancing threat detection and cybersecurity, focusing on recent advancements and ongoing challenges in this dynamic field. The ability to identify and counteract cybersecurity threats including network breaches, adversarial assaults, and zero-day vulnerabilities has significantly increased with the inclusion of AI, especially machine learning and deep learning techniques. The review underscores the critical role of explainability and resilience in AI models to ensure trustworthiness and reliability in AI-driven security solutions. The studies analyzed span a wide range of sectors, including Industry 5.0, the Internet of Things (IoT), 5G networks, and autonomous vehicles, illustrating AI's adaptability in tackling unique security issues across these domains. Cutting-edge approaches, such as transformer-based models, federated learning, and blockchain integration, are advancing the development of more robust and real-time threat detection systems. However, challenges persist, particularly in managing large-scale data, enabling real-time processing, and ensuring privacy and security. The review concludes that although substantial progress has been achieved, ongoing research and collaboration are vital to fully harness AI's potential in securing digital landscapes.

关键词： Artificial intelligence Computer security Data models Threat assessment Adaptation models Internet of Things deep learning Accuracy Complexity theory Analytical models Intrusion detection Blockchains Zero-day vulnerabilities network intrusion detection federated learning blockchain Internet of Things (IoT) adversarial attacks

来源：评论

学校读者我要写书评

暂无评论

learning to Control Camera Exposure via Reinforcement learning

Learning to Control Camera Exposure via Reinforcement Learni...

引用

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

作者： Lee, Kyunghyun Shin, Ukcheol Lee, Byeong-Uk LG AI Res Seoul South Korea CMU Pittsburgh PA USA KRAFTON Seoul South Korea

ISBN: (纸本)9798350353013;9798350353006

Adjusting camera exposure in arbitrary lighting conditions is the first step to ensure the functionality of computer vision applications. Poorly adjusted camera exposure often leads to critical failure and performance degradation. Traditional camera exposure control methods require multiple convergence steps and time-consuming processes, making them unsuitable for dynamic lighting conditions. In this paper, we propose a new camera exposure control framework that rapidly controls camera exposure while performing real-time processing by exploiting deep reinforcement learning. The proposed framework consists of four contributions: 1) a simplified training ground to simulate real-world's diverse and dynamic lighting changes, 2) flickering and image attribute-aware reward design, along with lightweight state design for real-time processing, 3) a static-to-dynamic lighting curriculum to gradually improve the agent's exposure-adjusting capability, and 4) domain randomization techniques to alleviate the limitation of the training ground and achieve seamless generalization in the wild. As a result, our proposed method rapidly reaches a desired exposure level within five steps with real-time processing (1ms). Also, the acquired images are well-exposed and show superiority in various computer vision tasks, such as feature extraction and object detection.

关键词： auto exposure control reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：