检索结果-内蒙古大学图书馆

UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science 2024年第2期86卷 211-230页

作者： Yan, Shaokui Ding, Haili Zhang, Jie Wang, Xiangwei Wang, Mingqiang Yinchuan China Comarvel Intelligent Technology Company Limited Yantai China

In order to realize the dynamic management of goods in the warehouse and improve the overall management efficiency and safety performance of the warehouse, a deep learning and target detection technology for remote real-time monitoring and tracking of the stereo warehouse is proposed. Wireless sensor nodes are used to provide all-round coverage of the three-dimensional warehouse to meet the monitoring needs of various parameters such as cargo status and cargo location, and the image data of the three-dimensional warehouse is trained by deep confidence networks to extract high-quality and distinguished features of the three-dimensional warehouse. The hybrid Gaussian model is used to accurately locate and identify the target warehouse, determine the location and trajectory of warehouse objects, and finally introduce a priori and measurement information by virtue of Kalman filtering method to realize real-time monitoring and tracking of the stereo warehouse. The results show that the AUC value of the complex monitoring attributes of the proposed method is as high as 0.93 after the optimization process, and the monitoring accuracy is as high as 98.43%, and the monitoring time is short, and the predicted value of the center of mass coordinates of the warehouse is the same as the real value of the actual coordinates, which indicates that the proposed method can ensure the comprehensiveness and real-time monitoring and tracking, and realize the intelligence and efficiency of the warehouse management. © 2024, Politechnica University of Bucharest. All rights reserved.

关键词： Target tracking

来源：评论

学校读者我要写书评

暂无评论

deep multi-convolutional stacked capsule network fostered human gait recognition from enhanced gait energy image

引用

SIGNAL image AND VIDEO processing 2024年第2期18卷 1375-1382页

作者： Nithyakani, P. Ferni Ukrit, M. SRM Inst Sci & Technol Dept Comp Technol Kattankulathur 603203 Tamil Nadu India SRM Inst Sci & Technol Dept Computat Intelligence Kattankulathur 603203 Tamil Nadu India

Gait recognition is a well-known biometric identification technology and is widely employed in different fields. Due to the advantages of deep learning, such as self-learning capability, high accuracy and excellent generalization ability, various deep network algorithms have been applied in biometric recognition. Numerous studies have been conducted in this area;however, they may not always yield the expected outcomes owing to the issue of data imbalance in clinical and healthcare industries. To overcome this problem, deep multi-convolutional stacked capsule network fostered human gait recognition from enhanced gait energy image (HGR-DMCSCN) is proposed in this manuscript. Initially, the input images are taken from CASIA B and OU-ISIR datasets. Then the input images are given to preprocessing segment to enhance the superiority of the images based upon contrast-limited adaptive histogram equalization filtering (CLAHEF). Then preprocessed image is given to classification process using deep multi-convolutional stacked capsule network (DMCSCN) that is utilized for human gait detection under various conditions, like normal walking, carrying a bag and wearing a cloth. The proposed HGR-DMCSCN approach is executed in python and its performance is examined under performance metrics, such as F-Score, accuracy, RoC and computational time. Finally, the proposed approach attains 28.70%, 11.87% and 14.79% higher accuracy for CASIA B compared with existing methods.

关键词： Convolutional neural network Capsule network deep learning Gait energy image Gait biometric recognition

来源：评论

学校读者我要写书评

暂无评论

real-time sports injury monitoring system based on the deep learning algorithm

引用

BMC MEDICAL IMAGING 2024年第1期24卷 1页

作者： Ren, Luyao Wang, Yanyan Li, Kaiyong Nanjing Forestry Univ Dept Phys Educ Nanjing 210037 Jiangsu Peoples R China Beijing Foreign Studies Univ Dept Phys Educ Beijing 100089 Peoples R China Qinghai Nationalities Univ Coll Phys & Elect Informat Engn Xining 810007 Qinghai Peoples R China

In response to the low real-time performance and accuracy of traditional sports injury monitoring, this article conducts research on a real-time injury monitoring system using the SVM model as an example. Video detection is performed to capture human movements, followed by human joint detection. Polynomial fitting analysis is used to extract joint motion patterns, and the average of training data is calculated as a reference point. The raw data is then normalized to adjust position and direction, and dimensionality reduction is achieved through singular value decomposition to enhance processing efficiency and model training speed. A support vector machine classifier is used to classify and identify the processed data. The experimental section monitors sports injuries and investigates the accuracy of the system's monitoring. Compared to mainstream models such as Random Forest and Naive Bayes, the SVM utilized demonstrates good performance in accuracy, sensitivity, and specificity, reaching 94.2%, 92.5%, and 96.0% respectively.

关键词： Sports injury monitoring deep learning algorithms Machine learning Medical applications

来源：评论

学校读者我要写书评

暂无评论

Enhanced Face Identification Performance Using Online Mining Strategy in Multi-Task Cascaded Mask Convolutional Networks

引用

TRAITEMENT DU SIGNAL 2025年第1期42卷 143-152页

作者： Mony, Krishnaraj Raj, Jeberson Retna Sathyabama Inst Sci & Technol Sch Comp Dept Comp Sci & Engn Chennai 600119 Tamil Nadu India

Due to the variety of lighting, postures, and occlusions, symmetry of faces and identification in an unrestricted area are difficult. The latest study demonstrates that deep learning techniques can do remarkably well on these two challenges. The complex transmitted multitask structure the developers provide in this research takes advantage of the natural relationship between them to improve efficiency. The suggested Multi-task Cascaded Mask Convolutional Network (MTCMCN) has three layers of carefully planned deep convolution networks that work together to figure out where faces and landmarks are from a wide range of angles. Additionally, they provide a novel, continuous, difficult sample mining approach for learning procedures, which may automatically boost efficiency without the manual choice of samples. The use of a sizable cross-age image collection containing gender and age descriptors advances the creation of Age-Invariant Face Recognition (AIFR) and FAS. MTCMCN outperforms existing methods by achieving state-of-the-art accuracy on benchmarks like FDDB and WIDER FACE, exceeding 95% accuracy in some cases. It has a Central processing Unit (CPU) speed of 16 frames per second and a GPU speed of 99 frames per second, ensuring real-time performance. The proposed system achieves this by using a special identification conditional block and live hard sample mining, thereby improving face recognition regardless of age.

关键词： Age-Invariant Face Recognition (AIFR) cross-age image collection deep learning face detection hard sample mining task Cascaded Mask Convolutional Network (MTCMCN) real-time performance symmetry and occlusion handling

来源：评论

学校读者我要写书评

暂无评论

learning global and local features of power load series through transformer and 2D-CNN: An image-based multi-step forecasting approach incorporating phase space reconstruction

引用

APPLIED ENERGY 2024年 378卷

作者： Tang, Zihan Ji, Tianyao Kang, Jiaxi Huang, Yunlin Tang, Wenhu South China Univ Technol Sch Elect Power Engn Guangzhou 510640 Peoples R China

As modern power systems continue to evolve, accurate power load forecasting remains a critical issue in energy management. The phase space reconstruction (PSR) method can effectively retain the inner chaotic property of power load from a system dynamics perspective and thus is a promising knowledge-based method for power load forecasting. To fully leverage the PSR method's capability in modeling this high-dimensional, non-stationary characteristics of power load data, and to address the challenges faced by its classical mathematical prediction algorithms ineffectively solving contemporary prediction scenarios characterized by massive volumes of data. This study proposes a novel learning-based multi-step forecasting approach that utilizes an image-based modeling perspective for the reconstructed phase trajectories. Firstly, the feature engineering approach that simultaneously utilizes dynamic evolution features and temporal locality features in the trajectory image is proposed. Through mathematical derivation, the equivalent characterization of the PSR method and another time series modeling approach, patch segmentation (PS), is demonstrated for the first time. Building on this prior knowledge, a novel image-based modeling perspective incorporating a global and local feature extraction strategy is introduced to fully leverage these valuable features. Subsequently, within this framework, a novel deep learning model, termed PSR-GALIEN, is designed for end-to-end processing. This model employs a Transformer Encoder and 2D convolutional neural networks (CNNs) to extract global and local patterns from the image, while a multi-layer perceptron (MLP)-based predictor is utilized for efficient correlation modeling. Extensive experiments on five real-world datasets show that PSR-GALIEN consistently outperforms six state-of-the-art deep learning models in short-term load forecasting scenarios with varying characteristics, demonstrating its great robustness. Ablation studies fur

关键词： Multi-step power load forecasting Phase space reconstruction image-based modeling perspective Global and local feature extraction Feature interpretation

来源：评论

学校读者我要写书评

暂无评论

deep learning-based recognition of real and artificial images

Deep learning-based recognition of real and artificial image...

引用

2024 International Conference on image processing and Artificial Intelligence, ICIPAl 2024

作者： Li, Yuchong Cao, Jinxuan Cai, Muqing Southampton Ocean Engineering Joint Institute Harbin Engineering University No. 145 Nantong Street Nangang District Heilongjiang Province Harbin City China

ISBN: (纸本)9781510681514

In response to the challenge of effectively identifying artificially generated images from real ones, this paper proposes a deep learning-based approach for authenticating images. The proposed method utilizes a combination of convolutional neural networks (CNN) and generative adversarial networks (GANs) to compare and analyze various indicators of images. Experimental results demonstrate that deep learning algorithms can significantly improve the accuracy and reliability of image authenticity identification. The proposed method has significant implications for protecting intellectual property rights and ensuring public safety. The research contributes to the advancement of computer vision and image processing fields and underscores the need for continued efforts to address the challenges posed by artificial intelligence and image generation technology. © 2024 SPIE.

关键词： Generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

real time Pothole Detection System Using deep learning Techniques: A Systematic Review 4

Real Time Pothole Detection System Using Deep Learning Techn...

引用

4th International Conference on Ubiquitous Computing and Intelligent Information Systems, ICUIS 2024

作者： Saraswathi, S. Vigneshwari, S. Sathyabama Institute of Science and Technology Department of Computer Science and Engineering Tamil Nadu Chennai India

ISBN: (纸本)9798331529635

real-time pothole detection serves a crucial role in maintaining road infrastructure and ensuring the safety of drivers. Traditional methods for identifying potholes are often labour-intensive and inefficient, prompting the exploration of automated techniques using deep learning. This comprehensive review aims to synthesize the current state of research on pothole detection systems leveraging deep learning algorithms. By examining a range of studies published over the past decade, the review highlights various methodologies, including You Only Look Once (YOLO), convolutional neural networks (CNNs), deep CNN, and hybrid models, as well as the different datasets and image processing techniques employed. The review categorizes the approaches based on their accuracy, computational efficiency, and practical implementation challenges. The review also identifies research gaps in the literature survey, such as the demand for standardized datasets and real-time processing capabilities, and suggests directions for future research. By providing a systematic overview of deep learning-based pothole detection systems, the purpose of this review is to help researchers and practitioners develop more reliable and scalable solutions for maintaining road infrastructure. © 2024 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Development of deep learning method for automatic seismic first break picking 4

Development of deep learning method for automatic seismic fi...

引用

4th International Meeting for Applied Geoscience and Energy, image 2024

作者： Farkhutdinov, Albert Malikov, Ruslan Shahsenov, Izat WAVERITY Azerbaijan

A novel methodology for first break picking (FBP) based on deep learning algorithms is proposed in this paper. The goal of this study is to automate FBP by application of neural networks trained on synthetic seismic data that comprehensively mimics and describes the target real data. The effectiveness of the proposed approach is tested and validated on real seismic data from open sources and a detailed description of the obtained results is provided. The results of this study open up promising potential for improving the accuracy of FBP outcome and significantly reducing processing time. © 2024 Society of Exploration Geophysicists and the American Association of Petroleum Geologists.

关键词： deep reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

A Comprehensive study on the different types of soil desiccation cracks and their implications for soil identification using deep learning techniques

引用

EUROPEAN PHYSICAL JOURNAL E 2024年第9期47卷 1-14页

作者： Daimari, Emanual Ratna, Sai Mouli, P. V. S. S. R. Chandra Madhurima, V. Cent Univ Tamil Nadu Dept Phys Thiruvarur 610005 Tamil Nadu India Cent Univ Tamil Nadu Dept Comp Sci Thiruvarur 610005 Tamil Nadu India

Rapid drying of soil leads to its fracture. The cracks left behind by these fractures are best seen in soils such as clays that are fine in the texture and shrink on drying, but this can be seen in other soils too. Hence, different soils from the same region show different characteristic desiccation cracks and can thus be used to identify the soil type. In this paper, three types soils namely clay, silt, and sandy-clay-loam from the Brahmaputra river basin in India are studied for their crack patterns using both conventional studies of hierarchical crack patterns using Euler numbers and fractal dimensions, as well as by applying deep-learning techniques to the images. Fractal dimension analysis is found to be an useful pre-processing tool for deep learning image analysis. Feed forward neural networks with and without data augmentation and with the use of filters and noise suggest that data augmentation increases the robustness and improves the accuracy of the model. Even on the introduction of noise, to mimic a real-life situation, 92.09% accuracy in identification of soil was achieved, proving the combination of conventional studies of desiccation crack images with deep learning algorithms to be an effective tool for identification of real soil types.

关键词： Fractal dimension

来源：评论

学校读者我要写书评

暂无评论

Detecting coagulation time in cheese making by means of computer vision and machine learning techniques

引用

COMPUTERS IN INDUSTRY 2025年 164卷

作者： Loddo, Andrea Di Ruberto, Cecilia Armano, Giuliano Manconi, Andrea Univ Cagliari Dept Math & Comp Sci I-09124 Cagliari Italy CNR Inst Biomed Technol Natl Res Council Via Fratelli Cervi 93 I-20054 Milan Italy

Cheese production, a globally cherished culinary tradition, faces challenges in ensuring consistent product quality and production efficiency. The critical phase of determining cutting time during curd formation significantly influences cheese quality and yield. Traditional methods often struggle to address variability in coagulation conditions, particularly in small-scale factories. In this paper, we present several key practical contributions to the field, including the introduction of CM-IDB, the first publicly available image dataset related to the cheese-making process. Also, we propose an innovative artificial intelligence-based approach to automate the detection of curd-firming time during cheese production using a combination of computer vision and machine learning techniques. The proposed method offers real-time insights into curd firmness, aiding in predicting optimal cutting times. Experimental results show the effectiveness of integrating sequence information with single image features, leading to improved classification performance. In particular, deep learning-based features demonstrate excellent classification capability when integrated with sequence information. The study suggests the suitability of the proposed approach for integration into real-time systems, especially within dairy production, to enhance product quality and production efficiency.

关键词： image processing Computer vision Machine learning Food industry Curd-firming time detection

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：