Object detection remains a critical challenge with extensive real-time applications, including autonomous vehicles, medical imaging, and surveillance systems. The field has experienced significant progress, particular...
详细信息
This work evaluates deeplearning segmentation models to propose a deforestation monitoring embedded system. The approach stands for environmental monitoring using remote sensing imagery, edge computing, and a deep le...
详细信息
Accurate geolocation of maritime objects in satellite imagery is challenging due to geometric distortions, atmospheric conditions, and sensor inaccuracies in low-Earth orbit satellites. This study presents a novel aut...
详细信息
Accurate geolocation of maritime objects in satellite imagery is challenging due to geometric distortions, atmospheric conditions, and sensor inaccuracies in low-Earth orbit satellites. This study presents a novel automatic identification system (AIS) data-guided geolocation correction method that integrates real-time AIS data with satellite imagery to rectify geolocation errors. The approach utilizes the GeoAISNet neural network, which enhances positional accuracy without relying on ground control points. By incorporating a modified YOLOv8 architecture with orientation parameters and the convolutional block attention module, detection performance improved significantly, achieving precision, recall, and F1 scores of 91.82%, 89.56%, and 90.67%, respectively. Ablation studies demonstrated the crucial impact of feature integration and attention mechanisms. Results indicate a mean average precision of 89%, with general cargo ships achieving 99.9% AP50. Localization accuracy saw a notable improvement, with root-mean-squared error reduced from 12 to 3 m, and layer normalization further enhanced stability, increasing precision, recall, and F1 scores to 94.23%, 92.67%, and 93.44%, respectively. The use of differential AIS data decreased maximum positional errors by 30%, achieving errors around 2 m. Computational efficiency was also enhanced, with processingtime reduced from 2 to 0.5 s per image. This method effectively addresses oil spills and non-AIS vessel detection, expanding maritime surveillance capabilities. The global training dataset, validated with data from the South China Sea, ensures the method's applicability across diverse conditions.
Accurate structural behavior interpretation via finite element models is often disrupted by uncertainties, while data-driven approaches can struggle with long datasets, complex fluctuations, and the omission of essent...
详细信息
Accurate structural behavior interpretation via finite element models is often disrupted by uncertainties, while data-driven approaches can struggle with long datasets, complex fluctuations, and the omission of essential spatio-temporal features. Additionally, these methods are limited by their reliance on interpolative predictions. This paper introduces a novel, model-free approach that integrates Moving Principal Component Analysis (MPCA), bidirectional gated recurrent units (biGRU), and attention mechanisms (AM) within an encoder-decoder (ED) architecture. MPCA reduces dimensional complexity, extracts spatial features, and consolidates them into new time-series data for subsequent analysis. The biGRU module captures past and future dependencies, while AM emphasizes most relevant information. Validated on a full-scale pedestrian bridge dataset, the presented MPCA-biGRU-AM model converges 19% faster than MPCA-GRU and reduces anomaly detection lag by 46-78%. Although its per-step processingtime (8 ms) slightly exceeds that of MPCA-GRU (3 ms), the model demonstrates greater robustness across diverse damage scenarios. These results highlight its potential for real-time structural health monitoring by effectively capturing spatio-temporal patterns with computational efficiency.
Detecting cracks are a great significance for the maintenance of the man-made buildings, and deeplearning methods such as semantic segmentation have greatly boosted this process in recent years. However, the existing...
详细信息
Detecting cracks are a great significance for the maintenance of the man-made buildings, and deeplearning methods such as semantic segmentation have greatly boosted this process in recent years. However, the existing crack segmentation methods often sacrifice feature resolution to achieve real-time inference speed which leads to poor performance, or use complex network module to improve the accuracy which leads to lower inference speed. In this paper, we propose a novel deep Crack Segmentation Network (DcsNet) that incorporates two feature extraction branches to achieve the balance of speed and accuracy. We first design a morphology branch (MB) to preserve the morphology information of scale invariance that consists of a lightweight convolution network, a pyramid pooling module (PPM), and an attention module (CSA). Meanwhile, a shallow detail branch (DB) with a small stride is constructed to supplement detailed information. Extensive experiments are conducted on five challenging datasets (Crack500, deepcrack, Gaps384, Structure, and Damcrack), and the results demonstrated that the proposed network achieves a good trade-off between accuracy and inference speed and outperforms state-of-the-art methods.
Wearable devices such as headphones are increasingly popular in people's lives, and there is an increasing focus on how to achieve continuous and reliable information input using these devices. However, due to con...
详细信息
ISBN:
(纸本)9798350363999;9798350364002
Wearable devices such as headphones are increasingly popular in people's lives, and there is an increasing focus on how to achieve continuous and reliable information input using these devices. However, due to constraints in computing power, low power consumption, and low operating frequencies, such devices often record and transmit signals at lower sampling rates, the resultant lower-quality signals often have catastrophic implications for system performance. Efficient real-time conversion of low-resolution speech signals to full-resolution high-quality signals using low-cost wearable sensors on edge devices is a challenging research endeavor. To address this, this paper designs TransFiLM, a mobile deeplearning network. It allows users to obtain full-resolution high-quality audio signals using low-cost wearable sensors on edge devices. TransFiLM integrates residual learning and super-resolution networks and employs effective signal processing strategies to achieve audio upscaling and noise reduction, significantly improving audio quality. We implement a prototype on commercial devices and conduct a series of experiments to evaluate its performance. Using signal-to-noise ratio (SNR) and log-spectral distance (LSD) as evaluation metrics, TransFiLM exhibits superior performance compared to other time-domain methods in cross-user, cross-corpus, and cross-noise environment testing. Additionally, our TransFiLM network handles 8192 samples with a response time of 181 ms, which meets the requirement to run in real-time on edge devices.
In streaming computing systems, flexible resource allocation for time-varying data is the key to ensure application and system performance. Traditional resource allocation methods and existing intelligent methods have...
详细信息
Four-dimensional magnetic resonance imaging (4D-MRI) is an emerging technique for tumor motion management in image-guided radiation therapy (IGRT). However, current 4D-MRI suffers from low spatial resolution and stron...
详细信息
Four-dimensional magnetic resonance imaging (4D-MRI) is an emerging technique for tumor motion management in image-guided radiation therapy (IGRT). However, current 4D-MRI suffers from low spatial resolution and strong motion artifacts owing to the long acquisition time and patients' respiratory variations. If not managed properly, these limitations can adversely affect treatment planning and delivery in IGRT. In this study, we developed a novel deeplearning framework called the coarse-super-resolution-fine network (CoSF-Net) to achieve simultaneous motion estimation and super-resolution within a unified model. We designed CoSF-Net by fully excavating the inherent properties of 4D-MRI, with consideration of limited and imperfectly matched training datasets. We conducted extensive experiments on multiple real patient datasets to assess the feasibility and robustness of the developed network. Compared with existing networks and three state-of-the-art conventional algorithms, CoSF-Net not only accurately estimated the deformable vector fields between the respiratory phases of 4D-MRI but also simultaneously improved the spatial resolution of 4D-MRI, enhancing anatomical features and producing 4D-MR images with high spatiotemporal resolution.
The Automatic UIDAI Details Extraction System extracts details from Aadhaar cards and stores them into a document. The system can be utilized in banking, in government agencies, at vaccination centers. The presented s...
详细信息
Lung cancer must be detected as early as possible, but it may be difficult with existing methods since they often depend on human judgment and outdated imageprocessing. These techniques take a lot of time and are pro...
详细信息
暂无评论