Remote sensing image change detection, as one of the important branches of remote sensing technology, has been greatly developed and improved in recent years. The effectiveness of change detection is unstable, as deep...
详细信息
Remote sensing image change detection, as one of the important branches of remote sensing technology, has been greatly developed and improved in recent years. The effectiveness of change detection is unstable, as deep learning components exhibit a high degree of sensitivity to hyperparameters, necessitating substantial fine-tuning to achieve optimal outcomes. Additionally, large and intricate deep learning models are not appropriate for smaller-scale datasets. In this paper, a remote sensing image change detection method based on general deep forest module (GDFM) is proposed. In the proposed GDFM, the common remote sensing image change detection method is firstly used for preliminary detection to obtain the detection result to be optimized, and the optimized change detection result is then obtained by using the classification function of the deep forest. This work aims to achieve a general enhancement of the performance of existing change detection methods by utilizing the characteristics of multi-grained scanning in deep forest and the data classification capabilities of cascaded forests. At the same time, this model alleviates the problems of parameter tuning complexity and inapplicability to small datasets in additional deep learning modules. It is worth noting that this general method is not a direct splicing process, but combines the advantages of the two parts and uses the characteristics of the deep forest to achieve the goal. Experiments are conducted on multiple common remote sensing change detection models. The results show that the proposed GDFM has significant improvements in metrics, which include varying degrees of improvement in F1 score by around 1% -14%.
Crop diseases, as one of the major problems in global agricultural production, lead to crop yield reduction, death, and even total extinction, with serious impacts on farmers and the food supply. Traditionally, crop d...
详细信息
Crop diseases, as one of the major problems in global agricultural production, lead to crop yield reduction, death, and even total extinction, with serious impacts on farmers and the food supply. Traditionally, crop diseases are identified by visual inspection and based on the experience of farmers and agricultural experts, a method that not only consumes human resources but also has a certain degree of subjectivity and inaccuracy. The development of artificial intelligence technology successfully achieves real-time monitoring, automatic identification, and intelligent decision by combining the Internet of Things (IoT) technology and cloud computing technology. Herein, we proposed an EfficientNet Convolutional Group-Wise Transformer (EGWT) architecture. The local features of crop leaf images are extracted by EfficientNet convolution and then input into a group-wise transformer architecture. In the group-wise transformer process, the input features are divided into multiple groups. An attention mechanism is used within each group to calculate correlations between features. After calculating the intra-group attention, the output features of each group are stitched together to form the final output features. Our proposed model achieves 99.8% accuracy on the PlantVillage dataset, 86.9% accuracy on the cassava dataset, and 99.4% accuracy on the Tomato leaves dataset, with the least number of parameters 23.04M in the state-of-the-art convolutional combinatorial transformer hybrid model. The experimental results indicate that the proposed model has the best accuracy and optimal model complexity so far compared to other neural networks based on CNN, transformer, and the hybrid structure of CNN and transformer.
Dataset drift is a common challenge in machine learning, especially for models trained on unstructured data, such as images. In this article, we propose a new approach for the detection of data drift in black-box mode...
详细信息
Dataset drift is a common challenge in machine learning, especially for models trained on unstructured data, such as images. In this article, we propose a new approach for the detection of data drift in black-box models, which is based on Hellinger distance and feature extraction methods. The proposed approach is aimed at detecting data drift without knowing the architecture of the model to monitor, the dataset on which it was trained, or both. The article analyzes three different use cases to evaluate the effectiveness of the proposed approach, encompassing a variety of tasks including document segmentation, classification, and handwriting recognition. The use cases considered for the drift are adversarial assaults, domain shifts, and dataset biases. The experimental results show the efficacy of our drift detection approach in identifying changes in distribution under various training settings.
Deep learning-based computer vision technology has been extensively utilized to assist medical microscopic smear diagnoses, such as complete blood count, detecting mycobacterium tuberculosis in sputum samples, and ide...
详细信息
Deep learning-based computer vision technology has been extensively utilized to assist medical microscopic smear diagnoses, such as complete blood count, detecting mycobacterium tuberculosis in sputum samples, and identifying plasmodium in blood smears. Effective medical microscopic smear detection necessitates balancing both detection accuracy and model parameters, facilitating the application of the model on low-resource computing platforms and ensuring real-time detection capabilities. Furthermore, the model should demonstrate superior generalization capabilities on microscopic smears to meet various detection tasks. In this study, we propose YOLO-FMS, a lightweight and efficient model based on YOLOv5 for medical microscopic smear detection with a compact weight of 15.06M, which can resolve the challenge of the balance between detection accuracy and model parameters. Firstly, YOLO-FMS improves the performance of small-scale platelet and mycobacterium tuberculosis detection by adding a small target detection head. Secondly, A lightweight convolutional technique, GSConv, was introduced to make the symbolic ability of the lightweight convolutional as close to the vanilla convolutional as possible and to reduce the computational cost. Thirdly, the feature extraction ability of YOLO-FMS is enhanced by C3-B-CBAM and Tiny-SPPCSPC modules constructed by our proposed method. Comprehensive experiments prove that YOLO-FMS shows high detection accuracy, with 92.5% mAP on the BCCD dataset and 87.6% mAP on the Tuberculosis-Phonecamera dataset. Additionally, numerous verification experiments are conducted on the BCDD and BBBC041 datasets, and the results confirm the effectiveness and generalization capability of YOLO-FMS in the medical microscopic smear detection field. The codes of YOLO-FMS are available at https://***/GefionP/YOLO-FMS.
Intelligent detection of pavement damage is crucial to road maintenance. Timely identification of cracks and potholes helps prolong the road service life. Current detection models fail to balance accuracy and speed. I...
详细信息
Intelligent detection of pavement damage is crucial to road maintenance. Timely identification of cracks and potholes helps prolong the road service life. Current detection models fail to balance accuracy and speed. In this study, we propose a fast damage detection algorithm named FPDDN to achieve real-time and high-accuracy pavement damage detection. FPDDN integrates the deformable transformer, D2f block, and SFB module to predict pavement damage of different sizes in multiple branches. The deformable transformer allows the FPDDN to exhibit adaptability to geometric variations in road defects, thereby improving the detection accuracy of irregular defects such as cracks. D2f block is mainly used to lightweight the network and increase the inference speed. The SFB module can significantly decrease the loss of information during downsampling of small-sized objects. This integration enhances the model's ability to extract global damage features, reduces the loss of information on small-scale defects, and improves the synergy between deep and shallow feature layers. The model's performance was evaluated using the RDD2022 dataset, focusing on inference speed and detection accuracy. When compared to state-of-the-art models such as YOLO v8, FPDDN has a parameter count that is only one-fifth of that of YOLO v8x, yet it surpasses YOLO v8x in detection accuracy. The FPDDN achieved an F1 score of 0.601 and a mAP50 of 0.610 on the RDD2022 dataset, outperforming the compared models. Additionally, the algorithm achieved a balance between accuracy and speed with an inference speed of 1.8ms for pavement damage detection.
The widespread use of social media platforms has led to an increase in the dissemination of fake news with the intention of manipulating public opinion and causing chaos and panic among the population. To address this...
详细信息
The widespread use of social media platforms has led to an increase in the dissemination of fake news with the intention of manipulating public opinion and causing chaos and panic among the population. To address this issue, we focus on detecting the organized groups that participate together in fake news campaigns without prior knowledge of the news content or the profiles of social accounts. To this end, we propose a spatial-temporal similarity graph, a novel graph structure that connects social accounts that participate in the early stage of similar fake news campaigns. A community detection algorithm is applied on the similarity graph to cluster the users into communities. We propose a community labeling algorithm to label the communities as benign or malicious based on the output of a fake news classifier. Evaluation results show that the community labeling algorithm can correctly label the communities with an accuracy of $99.61\%$ . In addition, we perform a statistical comparison analysis to identify the structural community features that are statistically significant between benign and malicious communities.
Image online monitoring technology has been widely used in transmission line inspection, but intelligent and efficient foreign object detection still has a gap with the ideal. The focus of this study is to develop an ...
详细信息
Image online monitoring technology has been widely used in transmission line inspection, but intelligent and efficient foreign object detection still has a gap with the ideal. The focus of this study is to develop an advanced system using the Multi-Fusion Mixed Attention Module-You Only Look Once (MFMAM-YOLO) algorithm for real-time detection of foreign objects on transmission lines. The main objective is to improve the safety and reliability of power transmission systems by swiftly identifying and removing any foreign objects that may pose hazards. The results demonstrate the effectiveness and efficiency of the proposed approach in accurately detecting foreign objects, thereby providing a valuable tool for maintaining the integrity of transmission lines.
Effective ship detection in synthetic aperture radar (SAR) imagery is crucial for maritime safety and surveillance. Despite the advancements in deep learning for SAR ship detection, significant challenges remain, part...
详细信息
Effective ship detection in synthetic aperture radar (SAR) imagery is crucial for maritime safety and surveillance. Despite the advancements in deep learning for SAR ship detection, significant challenges remain, particularly in large scenes. These challenges are twofold: the detection of extremely small ships is often hindered by inadequate feature extraction, and the presence of inshore ships is obscured by pronounced land-based interference, both of which lead to reduced detection accuracy. To address these issues, we present a novel deep learning framework that integrates constant false alarm rate (CFAR) processing with dual-polarization data, termed the CFAR-guided dual-polarization fusion framework (CFAR-DP-FW). The integration is designed to enhance the detection sensitivity for small-scale maritime targets by utilizing dual-polarization's rich feature representation, and CFAR's strength in suppressing background noise, highlighting potential targets. The proposed CFAR-DP-FW consists of three core components: the CFAR dual-polarization detector provides initial target indication;the CFAR field generator constructs a probabilistic ship presence map, reducing reliance on CFAR's hard thresholds;and the CFAR guidance dual-polarization network incorporates a novel feature extractor and enhancement module, tailored to amplify relevant features, and suppress noise. This strategic fusion within our framework markedly improves the detection of small and inshore ships. Evaluated on the large-scale SAR ship detection dataset-v1.0, our framework demonstrates superior performance, surpassing 20 state-of-the-art models. It achieves a 3.28% increase in mean average precision for inshore ships over the next best-performing model, validating its efficacy in tackling the intricate challenges of large-scale SAR ship detection.
Quotations are essential in lending credibility to news articles. A direct quote, typically enclosed in quotation marks, not only stands out visually but also indicates a reliable source. However, there is a practice ...
详细信息
Quotations are essential in lending credibility to news articles. A direct quote, typically enclosed in quotation marks, not only stands out visually but also indicates a reliable source. However, there is a practice known as 'contextomizing,' where words are extracted from their original context, changing the speaker's intended meaning. This results in a headline quote that semantically diverges from any other quote in the main article. This misrepresentation can lead to misunderstandings, especially in online environments where information is often consumed solely through headlines. To address this issue, this paper introduces QuoteCSE++, a data-centric contrastive embedding framework designed for the representation of quote semantics. Utilizing knowledge about the data and the news domain, QuoteCSE++ enhances a BERT-like transformer encoder to represent the complex semantics of news quotes and enables the detection of articles with contextomized headline quotes accurately. Our evaluation experiments demonstrate the superiority of the proposed method over both general-purpose embedding and domain-adapted methods in terms of detection accuracy. Remarkably, the proposed method exhibits a few-shot detection capability, achieving the performance level of SimCSE with just 200 training samples. We also test the ability of this framework for more general tasks of retrieving relevant quotes, implying its potential contribution to relevant fields. We release a dataset of 3,000 examples with high-quality manual annotations to support future research endeavors. Code and dataset are available at https://***/ssu-humane/contextomized-quotes-access.
Cervical spine fractures are a medical emergency that can cause permanent paralysis and even death. Traditional fracture detection techniques, such as manual radiography image interpretation, are time-consuming and pr...
详细信息
Cervical spine fractures are a medical emergency that can cause permanent paralysis and even death. Traditional fracture detection techniques, such as manual radiography image interpretation, are time-consuming and prone to human error. Deep learning algorithms have shown promising results in various medical imaging applications i.e., disease diagnosis, including fracture detection of bones. In this study, we propose a two-stage approach for detecting cervical spine fractures. The first stage employs a convolutional neural network (CNN) model to determine the presence or absence of a fracture in the cervical spine, using a dataset of cervical spine Computed Tomography (CT) scan images as well as Grad-CAM for enhanced visualization and interpretation. In the second stage, our focus shifts to specific vertebrae within the cervical spine. To accomplish this task, we trained and evaluated the performance of the YOLOv5 and YOLOv8 models with 9170 images consisting of seven vertebrae. The detection results of both YOLO versions are compared and evaluated. The precision, recall, mAP50, and mAP50-90 were 0.900, 0.890, 0.935, 0.872, respectively. The results of this research demonstrate the potential of deep learning-based approaches for cervical spine fracture detection. By automating the detection process, these algorithms can assist radiologists and healthcare professionals in making accurate and timely diagnoses, leading to improved patient outcomes.
暂无评论