Using machine vision and imageprocessing methods has an important role in the identification of defects of agricultural products, especially potatoes. The applications of imageprocessing and artificial intelligence ...
详细信息
Using machine vision and imageprocessing methods has an important role in the identification of defects of agricultural products, especially potatoes. The applications of imageprocessing and artificial intelligence in agriculture in identifying and classifying pests and diseases of plants and fruits have increased and research in this field is ongoing. In this paper, we use the convolution neural network (CNN) methods, also, we examined 5 classes of potato diseases with the names: Healthy, Black Scurf, Common Scab, Black Leg, Pink Rot. We used a database of 5000 potato images. We compared the results of potato defect classification our methods with other methods such as Alexnet, Googlenet, VGG, R-CNN, Transfer Learning. The results show that the accuracy of the deep learning proposed method is higher than other existing works. We get 100% and 99% accuracy in some of the classes, respectively.
Detecting small objects in drone-captured images or aerial videos is challenging due to their minimal representation. As data traverses deep learning networks, the information about small objects can diminish, making ...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
Detecting small objects in drone-captured images or aerial videos is challenging due to their minimal representation. As data traverses deep learning networks, the information about small objects can diminish, making high-resolution images essential for enhanced detection performance. However, high-resolution images increase computational load undesirably. Leveraging this fact, we propose a streamlined neural network designed specifically for small object detection in high-resolution images. The proposed network encompasses three main components: i) Enhanced High-Resolution processing Module (EHRPM), ii) the Small Object Feature Amplified Feature Pyramid Network (SOFA-FPN) with its Edge Enhancement Module (EEM), Cross Lateral Connection Module (CLCM), and Dual Bottom-up Convolution Module (DBCM), and iii) the Sigmoid Re-weighting Module (SRM). Compared to several state-of-the-art networks, our method delivers superior performance with fewer parameters and a lower computational demand. The source code is available at https://***/datu0615/EHRPM.
Medical image segmentation is challenging because the boundaries of lesions are often hazy or unclear. As a result, how to better distinguish the lesion boundaries to improve segmentation accuracy has become the main ...
详细信息
ISBN:
(纸本)9798350359329;9798350359312
Medical image segmentation is challenging because the boundaries of lesions are often hazy or unclear. As a result, how to better distinguish the lesion boundaries to improve segmentation accuracy has become the main focus of research in recent years. The diffusion model, a potent visual image generation model, has extended applications in medical image segmentation. In this work, we propose a Poisson-distribution diffusion model (PDDM), a novel conditional diffusion model based on the Poisson distribution, to fully use the diffusion model's enormous potential in medical image segmentation. To increase the segmentation diversity of diffusion models in discrete spaces and produce more precise segmentation maps, we first explore utilizing Poisson noise as the diffusion kernel rather than Gaussian noise. Then, by making use of the diffusion model's stochastic nature, our model repeatedly samples the original Poisson noise and intermediate latent variables at random to generate a variety of segmentation masks. Moreover, we include a multi-step weighted fusion (MSWF) module in the reasoning process to merge the output of the diffusion model at each step, to improve the robustness of the PDDM's prediction results. Finally, we validate the effectiveness and generalization of our model on three medical image segmentation datasets of different imaging modalities.
Soil pollution resulting from petroleum hydrocarbons (PHCs) arising from industrialization and human activities has emerged as a progressively severe global concern. Establishing an accurate spatial distribution predi...
详细信息
Soil pollution resulting from petroleum hydrocarbons (PHCs) arising from industrialization and human activities has emerged as a progressively severe global concern. Establishing an accurate spatial distribution prediction model for PHCs through limited sampling data play an important role in understanding the migration characteristics of PHCs and effectively preventing soil pollution. This article employs soil samples within 8 m of a chemical plant, in conjunction with hydrogeological data, to model the spatial distribution of PHC content using a feedforward neural network (FNN). The prediction outcomes are characterized through three-dimensional visualization. The findings indicate that FNN demonstrates superior estimation accuracy compared to traditional interpolation method. Regarding the horizontal distribution within surface soil, there is pronounced lateral migration of PHC content in both the storage area and manufacturing shop, with migration aligning following the direction of groundwater. Vertically, PHC content exhibits a consistent pattern of increasing and then decreasing with greater depth. It is predominantly enriched in the lower section of the aeration zone and the upper part of the saturated zone, particularly within 4 m, under influence of groundwater. In this study, the prediction model offers an original approach to the spatial distribution of soil pollutants.
Cancer is one of the deadliest diseases in the present days. Its survivability is mostly corelated to early detection and treatment, which means that it is of utmost importance to successfully diagnose the patients. U...
详细信息
Cancer is one of the deadliest diseases in the present days. Its survivability is mostly corelated to early detection and treatment, which means that it is of utmost importance to successfully diagnose the patients. Unfortunately, even with years of experience human errors can happen which leads to the death of many individuals being misdiagnosed. Throughout the years there have been several applications created which could possibly aid doctors in the diagnosis. neuralnetworks have always been a powerful tool which can be used in different applications that require an accurate model and the complexity of these models exceeds a human's computational capabilities. In imageprocessing for example, a convolutional neural network can analyze each particular pixel and determine through the convolution function the common properties of different pictures. The objective of this study is to analyze different types of cancer diagnosing methods that have been developed and tested using imageprocessing methods. The analyzed factors are training parameters, imageprocessing technique and the obtained performances. This survey/review can be of significant value to researchers and professionals in medicine and computer science, highlighting areas where there are opportunities to make significant new contributions.
Real-time imageprocessing is a key area of focus, but computationally intensive. neuralnetworks effectively address classification tasks, but they are not always a viable option, particularly in environments where h...
详细信息
ISBN:
(纸本)9781510673199;9781510673182
Real-time imageprocessing is a key area of focus, but computationally intensive. neuralnetworks effectively address classification tasks, but they are not always a viable option, particularly in environments where high power consumption or computational requirements are limiting factors. Hardware devices such as Field-Programmable Gate Arrays (FPGAs) offer significant parallelization capabilities that can be fully exploited when the implemented circuit is composed solely of logic gates. In addition, FPGAs are also interesting alternatives to traditional GPU-based implementations in terms of power consumption and reconfiguration capabilities. They can be used as a demonstration platform to validate a hardware design that can be later manufactured, creating the final Application-Specific Integrated Circuit (ASIC). This paper introduces a practical demonstration platform based on an FPGA that highlights the great capabilities of logic neuralnetworks, a type of neural network constructed exclusively with logic gates. By harnessing FPGA parallelization and logic gates, we have achieved a balance between computational power and real-time performance. This approach ensures that image classification occurs at speeds on the order of nanoseconds. This ultra-fast processing is well-suited for real-time image analysis applications across various domains. Industries that rely on quality control, such as manufacturing, will benefit from rapid and precise assessments. In the field of medical imageprocessing, where quick diagnoses are crucial, this technology promises transformative advancements. The demonstration platform developed serves as a proof of concept for logic neuralnetworks, offering a solution to the challenge of real-time imageprocessing and representing the first step towards the implementation of future architectures of logic networks in hardware.
Convolution operations that are consecutively applied in a typical CNN architecture, cause the loss of original details in input image signals at the cost of extracting new features. Among these details are the coarse...
详细信息
ISBN:
(纸本)9798350388978;9798350388961
Convolution operations that are consecutively applied in a typical CNN architecture, cause the loss of original details in input image signals at the cost of extracting new features. Among these details are the coarse patterns the network model tries to capture in deeper layers. However, those coarse details can be easily detected in lower image resolutions and incorporated into the higher level features. Based on this hypothesis, in this study we propose a novel multi-scale multi-input recursive context aggregation network which works on semantic segmentation tasks and show that it outperforms baseline U-Net model by 2% in mIoU on Oxford-IIIT Pet dataset.
Currently, image generation and synthesis have remarkably progressed with generative models. Despite photo-realistic results, intrinsic discrepancies are still observed in the frequency domain. The spectral discrepanc...
详细信息
ISBN:
(纸本)1577358872
Currently, image generation and synthesis have remarkably progressed with generative models. Despite photo-realistic results, intrinsic discrepancies are still observed in the frequency domain. The spectral discrepancy appeared not only in generative adversarial networks but in diffusion models. In this study, we propose a framework to effectively mitigate the disparity in frequency domain of the generated images to improve generative performance of both GAN and diffusion models. This is realized by spectrum translation for the refinement of image generation (STIG) based on contrastive learning. We adopt theoretical logic of frequency components in various generative networks. The key idea, here, is to refine the spectrum of the generated image via the concept of image-to-image translation and contrastive learning in terms of digital signal processing. We evaluate our framework across eight fake image datasets and various cutting-edge models to demonstrate the effectiveness of STIG. Our framework outperforms other cutting-edges showing significant decreases in FID and log frequency distance of spectrum. We further emphasize that STIG improves image quality by decreasing the spectral anomaly. Additionally, validation results present that the frequency-based deepfake detector confuses more in the case where fake spectrums are manipulated by STIG.
Federated learning enhanced with Differential Privacy (DP) is a powerful privacy-preserving strategy to protect individuals sharing their sensitive data for processing in fields such as medicine and healthcare. Many m...
详细信息
ISBN:
(纸本)9783031776090;9783031776106
Federated learning enhanced with Differential Privacy (DP) is a powerful privacy-preserving strategy to protect individuals sharing their sensitive data for processing in fields such as medicine and healthcare. Many medical applications, for example magnetic resonance imaging (MRI), rely on complex-valued signal processing techniques for data acquisition and analysis. However, the appropriate application of DP to complex-valued data is still underexplored. To address this issue, from the theoretical side, we introduce the complex-valued Gaussian mechanism, whose behaviour we characterise in terms of f-DP, (epsilon, delta)-DP and Renyi-DP. Moreover, we generalise the fundamental algorithm DP stochastic gradient descent to complex-valued neuralnetworks and present novel complex-valued neural network primitives compatible with DP. Experimentally, we showcase a proof-of-concept by training federated complex-valued neuralnetworks with DP on a real-world task (MRI pulse sequence classification in k-space), yielding excellent utility and privacy. Our results highlight the relevance of combining federated learning with robust privacy-preserving techniques in the MRI context.
Deep learning (DL) computing has emerged as the Gold Standard in the machine learning (ML) community in recent years. There are numerous recommender systems (RS) being used to treat information over-load problems in e...
详细信息
ISBN:
(纸本)9789819984756;9789819984763
Deep learning (DL) computing has emerged as the Gold Standard in the machine learning (ML) community in recent years. There are numerous recommender systems (RS) being used to treat information over-load problems in e-commerce, entertainment, and social media today. This review presents a comprehensive and comparative analysis of deep learning models based on Convolutional neuralnetworks (CNNs), Long Short-Term Memory (LSTM), Recurrent neuralnetworks (RNNs), Rectified Linear Unit-Deep neural Network (ReLU-DNN), Adaptive Deep Learning-based method for the Recommendation System (ADRS) model, and several other hybrid DL methodologies in the context of RS. Open research problems in the domain, such as collection and feedback of user data, cold start, data sparsity, and scalability have been explored and studies to tackle these issues have been highlighted. It also assesses potential areas for future research and development of hybrid methods combining strengths of a multitude of architectures and incorporating attention mechanisms to enhance the recommendation quality metrics. We analyze the strengths and limitations of each architecture and find that hybrid techniques, like LSTM-RNN, demonstrated outstanding efficacy in problems of text classification. Overall, this review provides a comprehensive understanding of the applications and capabilities of CNNs, LSTM, and RNNs in recommendation systems, serving as a valuable resource for researchers and practitioners in the field. Further, the study will be comprehended by identifying an efficient model with prominent accuracy reports.
暂无评论