Enhancing Vehicle Analysis with fine Estimation on video streams time object detection and tracking, and OCR for license plate recognition. By leveraging YOLO, the project extracts precise vehicle information from vid...
详细信息
With the rapid evolution of AI Generated Content (AIGC), forged images produced through this technology are inherently more deceptive and require less human intervention compared to traditional Computer-generated Grap...
详细信息
ISBN:
(纸本)9798350300673
With the rapid evolution of AI Generated Content (AIGC), forged images produced through this technology are inherently more deceptive and require less human intervention compared to traditional Computer-generated Graphics (CG). However, owing to the disparities between CG and AIGC, conventional CG detection methods tend to be inadequate in identifying AIGC-produced images. To address this issue, our research concentrates on the text-to-image generation process in AIGC. Initially, we first assemble two text-to-image databases utilizing two distinct AI systems, DALL center dot E2 and DreamStudio. Aiming to holistically capture the inherent anomalies produced by AIGC, we develope a robust dual-stream network comprised of a residual stream and a content stream. The former employs the Spatial Rich Model (SRM) to meticulously extract various texture information from images, while the latter seeks to capture additional forged traces in low frequency, thereby extracting complementary information that the residual stream may overlook. To enhance the information exchange between these two streams, we incorporate a cross multi-head attention mechanism. Numerous comparative experiments are performed on both databases, and the results show that our detection method consistently outperforms traditional CG detection techniques across a range of image resolutions. Moreover, our method exhibits superior performance through a series of robustness tests and cross-database experiments. When applied to widely recognized traditional CG benchmarks such as SPL2018 and DsTok, our approach significantly exceeds the capabilities of other existing methods in the field of CG detection.
Memristors, also known as variable resistors, are often used in research in fields such as neural networks, nonlinear systems, and digital circuits due to their memory, resistivity, and nonlinear characteristics. Beca...
详细信息
This paper discusses key challenges of data processing in the field of artificial intelligence (AI), specifically in dealing with unstructured data and adapting to market changes. We propose a novel AI risk assessment...
详细信息
The article studies the adaptation of literary works to animated films. Discuss the importance of the adaptation of literary works to the creation of animated films. A new method of using the Poly (polygonal geometric...
详细信息
Low-light images have low contrast and unclear details, resulting in the reduction of available information for human vision. The current mainstream enhancement algorithms have problems such as noise amplification, co...
详细信息
With the rapid development of computer vision and imageprocessing technology, scene imageprocessing under particular weather conditions has become an important research direction, especially in foggy conditions of t...
详细信息
Most of the existing network slicing methods only consider a single attribute of network nodes and ignore the overall network attributes, however, due to the different characteristics of network resources, complex nod...
详细信息
Due to the improvement in the car manifacture, the rate of road traffic accidents is increasing. To solve these problems, there is loads of attention in research on the development of driver assistance systems, where ...
详细信息
ISBN:
(纸本)9783031538292;9783031538308
Due to the improvement in the car manifacture, the rate of road traffic accidents is increasing. To solve these problems, there is loads of attention in research on the development of driver assistance systems, where the main innovation is traffic sign recognition (TSR). In this article, a special convolutional neural network model with high accuracy compared to traditional models is used for TSR. The Uzbek Traffic Sign Dataset (UTSD) applied in the zone of Uzbekistan was created, consisting of 21.923 images belonging to 56 classes. We proposed a parallel computing method for real-time processing of video haze removal. Our utilization can process the 1920 x 1080 video series with 176 frames per second for the dark channel prior (DCP) algorithm. 8.94 times reduction of calculation time compared to the Central processing Unit (CPU) was achieved by performing the TSR process on the Graphics processing Unit (GPU). The algorithms used to detect traffic signs are improved YOLOv5. The results showed a 3.9% increase in accuracy.
Due to the complexity of their structure and the particularity of their application environments, aircraft High-Voltage Direct Current (HVDC) systems are prone to faults, with inter-module failures complicating fault ...
详细信息
ISBN:
(数字)9798331541460
ISBN:
(纸本)9798331541477
Due to the complexity of their structure and the particularity of their application environments, aircraft High-Voltage Direct Current (HVDC) systems are prone to faults, with inter-module failures complicating fault diagnosis. To address this issue, a fault diagnosis method for HVDC systems has been developed. Feature extraction methods were designed for the rectifier, BUCK converter, and inverter, respectively, with the sum-to-amplitude ratio of signals selected as a feature for the rectifier; multi-scale skewness was proposed for the BUCK converter; and the ratio of the signal's average to peak absolute value was chosen for the inverter. Subsequently, the PSO-LightGBM algorithm was proposed, which employs the LightGBM algorithm for classification and utilizes a particle swarm algorithm to optimize the parameters of the LightGBM, establishing the optimal model. The experimental results demonstrate that the proposed method can accurately achieve fault diagnosis in HVDC systems.
暂无评论