the proceedings contain 177 papers. the topics discussed include: short-term power load forecasting model based on EEMD-SE-ERCNN;dynamic low-rank adaptation based pruning algorithm for large language models;dual-input...
ISBN:
(纸本)9798350350890
the proceedings contain 177 papers. the topics discussed include: short-term power load forecasting model based on EEMD-SE-ERCNN;dynamic low-rank adaptation based pruning algorithm for large language models;dual-input dual-output underwater image enhancement with branch interaction;object detection model of YOLOv8-CSD for UAV images;SE-ResNet: recognition of ECG anomalies based on convolutional neural network with fused attention;predicting accuracy for quantized neural networks: an attention-based approach with single-image;improved self-attention for Spodoptera frugiperda larval instar stages identification;an improved YOLOv8 algorithm for rotating object detection in unmanned aerial vehicle remote sensing images;and a comprehensive data augmentation method for modern cigarette packaging defect detection.
Action recognition holds a key position in the field of video understanding. While methods based on deep learning have made significant strides in big data backgrounds, their application in few-shot scenarios for acti...
详细信息
ISBN:
(纸本)9798350385113;9798350385106
Action recognition holds a key position in the field of video understanding. While methods based on deep learning have made significant strides in big data backgrounds, their application in few-shot scenarios for action recognition still faces numerous challenges. this review delves deeply into over 20 methods in the few-shot action recognition (FSAR) area from recent years. Initially, following the research's developmental trajectory, we systematically categorize the current mainstream methods for few-shot action recognition into three perspectives: memory networks, metric learning, and data augmentation. In order to ensure a fair comparison, we examine the performance of several approaches on multiple datasets. Lastly, we offer insights into the future development directions of this field. this review is the first review in the FSAR domain, summarizing a wealth of research outcomes, techniques, and theories. It offers researchers a cohesive perspective and illuminates potential new avenues for innovative studies.
the occurrence of privacy breaches through screen peeping has highlighted the increasing importance of anti-peeping screen algorithms. In the traditional YOLOv7 algorithm, it may not accurately detect faces when indiv...
详细信息
Several approaches for operating bionic limbs are outlined in this paper. While patternrecognition and gesture classification have shown promise, they also foreshadow a number of difficulties due to inaccurate findin...
详细信息
To solve the problem of EEG signal on feature extraction and low recognition rate, we analyze the characteristics of EEG signal and propose an EEG signal analysis method based on common spatial pattern (CSP) and suppo...
详细信息
Rhythmic patternrecognition is crucial in music analysis and information retrieval, involving the identification and classification of rhythmic structures. this paper offers a detailed overview of advanced methods in...
详细信息
the identification of partial discharge types is of great significance for the diagnosis and evaluation of transformer insulation. Withthe rapid advancement of deep learning in fields like machine vision and image pr...
详细信息
ISBN:
(纸本)9798350377040;9798350377033
the identification of partial discharge types is of great significance for the diagnosis and evaluation of transformer insulation. Withthe rapid advancement of deep learning in fields like machine vision and image processing, applying deep learning algorithms to partial discharge type recognition has become a new trend in power transformer fault diagnosis. Firstly, this article established a simulation test platform for partial discharge of oil paper insulation in the laboratory, conducted long-term partial discharge tests using the constant voltage method, and obtained ultrasonic signals under typical discharge defect models. then a two-dimensional time-frequency spectrum was constructed, and a depthwise separable convolutional neural network model was used to classify the spectrum, thereby achieving identification of partial discharge types. the results indicate that there are differences in the time-frequency spectra of ultrasound signals among different defect models;the partial discharge identification model used in this article can ensure high accuracy while effectively reducing the size of the model and improving training efficiency;Compared with traditional machine learning models, the model used in this paper has a higher recognition accuracy, reflecting the superiority of convolutional neural networks in image recognition.
Concrete is widely used in numerous buildings, and timely detection of concrete defects is significant. Currently, the detection of concrete defects mainly relies on manual visual inspection methods. However, manual i...
详细信息
Human activity recognition (HAR) is utilized to monitor human postures and actions in various fields such as human-computer interaction, motion detection and analysis, and medical rehabilitation. Currently, there is a...
详细信息
the proceedings contain 27 papers. the topics discussed include: analysis and characterization of vehicular traffic in a low emission zone;an augmented reality-based proving ground vehicle-in-the-loop test platform;ne...
ISBN:
(纸本)9781643685045
the proceedings contain 27 papers. the topics discussed include: analysis and characterization of vehicular traffic in a low emission zone;an augmented reality-based proving ground vehicle-in-the-loop test platform;next place prediction model: a literature review;driver model using fuzzy logic for virtual validation;risky driving behaviors among motorcyclists in Ecuador: a study of associated factors;high-speed localization estimation method using lighting recognition in tunnels;a machine learning approach for efficient assignment of batch delivery tasks;real-time idling vehicles detection using combined audio-visual deep learning;assessment of methods to estimate the level of service on third-class roads;urban traffic flow predictions with impacts of weather and holidays;and data formats for communication between freight tram operator and forwarder.
暂无评论