Accurate classification and identification of vessels in remote sensing satellite imagery is critical for ocean monitoring and resource management. The ability to extract information from remote-sensing data is of par...
详细信息
ISBN:
(纸本)9798350350920
Accurate classification and identification of vessels in remote sensing satellite imagery is critical for ocean monitoring and resource management. The ability to extract information from remote-sensing data is of paramount importance. To exploit the non-stationary characteristics of synthetic aperture radar (SAR) target, a comprehensive SAR ship recognition framework is designed by combing the second-order synchrosqueezing transform (SST), an effective non-stationary signal processing tool, with the histogram of oriented gradient (HOG) feature in this paper. Firstly, the second-order SST is performed on SAR images to describe the non-stationary characteristics of ships at different times and frequencies. Secondly, HOG features are utilized to effectively extract the non-stationary information of SAR ships and provide more discriminative input for the deeplearning network. Then, the optimal ResNet model is selected as the convolutional neural network (CNN) classifier to automatically fuse the non-stationary features and abstract features of SAR ships. Experiments on two open SAR ship datasets (OpenSARShip and FUSAR-Ship) show that the proposed method achieves accurate classification and outperforms the state-of-the-art (SOTA) CNN-based methods in terms of robustness and generalization ability. The positive effect of non-stationary characteristics on SAR ship classification is verified.
Dual-fisheye photos are the most efficient and economical way for 3D image/video display and VR applications. The photos are captured by camera with fisheye lens and stored in left and right views individually, and th...
详细信息
American Sign Language (ASL), a visual language utilizing hand gestures, facial expressions, and body movements, remains less recognized than spoken languages, resulting in communication challenges between deaf and he...
详细信息
The main focus of this paper is to address the obstacle avoidance and path planning challenges in Unmanned Ground Vehicle (UGV) cluster navigation tasks within unknown environments of a certain scale, proposing a Mult...
详细信息
Data imbalance is a common problem in breast cancer diagnosis, to address this challenge, the research explores the use of Generative Adversarial Networks (GANs) to generate synthetic medical data. Various GAN methods...
详细信息
Data imbalance is a common problem in breast cancer diagnosis, to address this challenge, the research explores the use of Generative Adversarial Networks (GANs) to generate synthetic medical data. Various GAN methods, including Wasserstein GAN with Gradient Penalty (WGAN-GP), Cycle GAN, Conditional GAN, and Spectral Normalization GAN (SNGAN), were tested for data augmentation in breast regions of interest (ROIs) using mammography and ultrasound databases. The study employed real, synthetic, and hybrid ROIs (128x128 pixels) to train a Resnet network for classifying as benign (B) or malignant (M) classes. The quality and diversity of the synthetic data were assessed using several metrics: Fre chet Inception Distance (FID), Kernel Inception Distance (KID), Structural Similarity Index (SSIM), Multi -Scale SSIM (MS-SSIM), Blind Reference image Spatial Quality Evaluator (BRISQUE), Naturalness image Quality Evaluator (NIQE), and Perception -based image Quality Evaluator (PIQE).Results revealed that the SNGAN model (FID = 52.89) was most effective for augmenting mammography data, while CGAN (FID = 116.03) excelled with ultrasound data. Cycle GAN and WGAN-GP, though demonstrating lower KID values, did not perform better than SNGAN and CGAN. The lower average MS-SSIM values suggested that SNGAN and CGAN produced a high diversity of synthetic images. However, lower SSIM, BRISQUE, NIQE, and PIQE values indicated poor quality in both real and synthetic images. Classification results showed high accuracy without data augmentation in both US (93.1 %B/94.9 %M) and mammography (80.9 %B/76.9 %M). The research concludes that preprocessing and characterizing ROIs by abnormality type is crucial to generate diverse synthetic data and improve accuracy in the classification process using combined GANs and CNN models.
We study the problem of online learning of optimal offloading policies for imageprocessing tasks, for minimizing a cost that is weighted sum of transmit energy and object recognition error rate. A mobile node generat...
详细信息
ISBN:
(纸本)9781538674628
We study the problem of online learning of optimal offloading policies for imageprocessing tasks, for minimizing a cost that is weighted sum of transmit energy and object recognition error rate. A mobile node generates imageprocessing tasks that involve object recognition. There exist three options: (i) transmit the image to a remote server for processing with a deep-learning (DL) model, (ii) process locally with a simpler model, (iii) apply a lightweight, error-prone technique for object detection, and if objects are detected, then send image to the server. The proper offloading decision requires knowledge of the transmit energy cost and object recognition error rate for each option. However, these processes are non-stationary due to unpredictable object occurrence, mobility and propagation dynamics, and they depend on the object inference result which is unknown at decision time. We cast the problem as an adversarial multi-armed bandit, in which the EXP3 algorithm achieves sublinear regret. For the constrained problem, we propose an algorithm that extends EXP3 and achieves good regret in the objective and constraint, thus asymptotically learning the optimal static randomized offloading policy, while satisfying the error constraint. Performance is validated via numerical experiments informed by real-life object recognition measurements and models.
The majority of tropical and subtropical nations in the world eat rice as their main meal. This involves hectare- sized paddy fields, whose upkeep and care becomes a tiresome Undertaking for the farmers. The caregiver...
详细信息
Understanding the variations in soil fertility and crop growth across time and geography is crucial for understanding the agricultural environment. Satellite and unmanned aerial remote sensing are the two main types o...
详细信息
Surface waves represent low-frequency regular interference waves in onshore seismic exploration, exerting a significant influence on the seismic data processing quality. Despite the classic method for surface wave sup...
详细信息
Container cranes are of key importance for maritime cargo transportation. The uninterrupted and all-day operation of these container cranes, which directly affects the efficiency of the port, necessitates the continuo...
详细信息
Container cranes are of key importance for maritime cargo transportation. The uninterrupted and all-day operation of these container cranes, which directly affects the efficiency of the port, necessitates the continuous inspection of these massive hoisting steel structures. Due to the large size of cranes, the current manual inspections performed by expert climbers are costly, risky, and time-consuming. This motivates further investigations on automated non-destructive approaches for the remote inspection of fatigue-prone parts of cranes. In this paper, we investigate the effectiveness of color space-based and deeplearning-based approaches for separating the foreground crane parts from the whole image. Subsequently, three different ML-based algorithms (k-Nearest Neighbors, Random Forest, and Naive Bayes) are employed to detect the rust and repainting areas from detected foreground parts of the crane body. Qualitative and quantitative comparisons of the results of these approaches were conducted. While quantitative evaluation of pixel-based analysis reveals the superiority of the k-Nearest Neighbors algorithm in our experiments, the potential of Random Forest and Naive Bayes for region-based analysis of the defect is highlighted.
暂无评论