While image segmentation is crucial in various computer vision applications, such as autonomous driving, grasping, and robot navigation, annotating all objects at the pixel-level for training is nearly impossible. The...
详细信息
While image segmentation is crucial in various computer vision applications, such as autonomous driving, grasping, and robot navigation, annotating all objects at the pixel-level for training is nearly impossible. There-fore, the study of unsupervised image segmentation methods is essential. In this paper, we present a pixel-level clustering framework for segmenting images into regions without using ground truth annotations. The proposed framework includes feature embedding modules with an attention mechanism, a feature statistics computing module, image reconstruction, and superpixel segmentation to achieve accurate unsupervised segmentation. Additionally, we propose a training strategy that utilizes intra-consistency within each superpixel, inter-similarity/dissimilarity between neighboring superpixels, and structural similarity between images. To avoid potential over-segmentation caused by superpixel-based losses, we also propose a post-processing method. Furthermore, we present an extension of the proposed method for unsupervised semantic segmentation. We conducted experiments on three publicly available datasets (Berkeley segmentation dataset, PASCAL VOC 2012 dataset, and COCO-Stuff dataset) to demonstrate the effectiveness of the proposed framework. The experimental results show that the proposed framework outperforms previous state-of-the-art methods.
imageprocessing is a vigorous area of study that utilizes various algorithms to manipulate, analyze, and enhance digital images. image denoising is one of the crucial applications of imageprocessing. Still, the occu...
详细信息
ISBN:
(纸本)9783031686528;9783031686535
imageprocessing is a vigorous area of study that utilizes various algorithms to manipulate, analyze, and enhance digital images. image denoising is one of the crucial applications of imageprocessing. Still, the occurrence of image noise is inevitable due to various sources, including low light conditions, high ISO settings, and transmission artifacts, necessitating the availability of denoising techniques to significantly improve visual image quality. This is particularly important in fields such as computer vision, medical imaging and remote sensing. Not only does it facilitate image analysis by retaining important details, but it also optimizes the performance of compression algorithms, improves storyteller detection. In this project, we propose an in-depth study of image denoising, focusing on the use of convolutional neuralnetworks (CNNs). The problem of Gaussian noise will be treated by applying different levels of s (low sigma = 15, medium sigma = 25, and high sigma = 50). During this project, a full comparative analysis will be made with the three mainCNNarchitectures: DnCNN, RIDNet, and IRCNN, illustrative of the quantitative and qualitative experimental results obtained by these different approaches. In fact, these approaches have shown impressive performance in imageprocessing tasks, including image denoising, since they used different techniques that can be adopted in CNN, such as regularization methods, batch normalization, and residual learning.
Agriculture is often known as the art and science of nurturing soil. It involves preparing plants and animals for use in products. Agriculture is the process of growing crops and rearing animals for human consumption,...
详细信息
Accurate diagnosis of plant diseases by the assessment of pathogen presence to reduce disease-related production loss is one of the most fundamental issues for farmers and specialists. This will improve product qualit...
详细信息
Accurate diagnosis of plant diseases by the assessment of pathogen presence to reduce disease-related production loss is one of the most fundamental issues for farmers and specialists. This will improve product quality, increase productivity, reduce the use of fungicides, and reduce the final cost of agricultural production. Today, new technologies such as imageprocessing, artificial intelligence, and deep learning have provided reliable solutions in various fields of precision agriculture and smart farm management. In this research, microscopic imageprocessing and machine learning have been used to identify the spores of four common tomato fungal diseases. A dataset including 100 microscopic images of spores for each disease was developed, followed by the extraction of the texture, color, and shape features from the images. The classification results using random forest revealed an accuracy higher than 98%. Besides, as a reliable feature selection algorithm, the butterfly optimization algorithm (BOA) was used to detect the effective image features to identify and classify diseases. This algorithm recognized image textural features as the most effective features in the diagnosis and classification of disease spores. Considering only the eight most effective features selected with BOA resulted in an accuracy of 95% in disease detection. To further investigate the performance of the proposed method, its accuracy was compared with the accuracies of convolutional neuralnetworks and EfficientNet as two reliable deep learning algorithms. Not only the prediction accuracy of these methods was not favorable (65 and 83.55%, respectively), they were very time-consuming. According to the findings, the proposed framework has high reliability in disease diagnosis and can help in the management of tomato fungal diseases.
Approximate computing is a promising paradigm for improving the performance parameters of electronic systems at the expense of accuracy in error-resilient tasks such as multimedia processing, image multiplication, and...
详细信息
ISBN:
(纸本)9798331522452;9798331522445
Approximate computing is a promising paradigm for improving the performance parameters of electronic systems at the expense of accuracy in error-resilient tasks such as multimedia processing, image multiplication, and neuralnetworks. While approximate circuits utilizing CMOS technology have been extensively studied, integrating approximate computing with emerging technologies like memristors offers further performance enhancements. HyCMAx investigates a hybrid CMOS-memristor approach for designing approximate circuits. In this paper, an approximate subtractor has been proposed, which was subsequently used to implement a restoring divider using the hybrid CMOS-memristor approach. HyCMAx dividers implemented in 28nm CMOS technology node gave up to similar to 43.8% dynamic power reduction and similar to 31.3% transistor count reduction as compared to only-CMOS implementation. Different levels of approximation were introduced in the divider to study the limits of approximation, which would give acceptable results. The proposed designs were then evaluated in the context of neuralnetworks and imageprocessingapplications. This study highlights the potential of combining CMOS and memristor technologies to create high-performance, power-efficient approximate circuits suitable for various error-resilient computational tasks.
Abnormal electrical activities due to brain tumor, developmental anomaly, neural-atrophy in cortical/sub-cortical brain regions cause an epileptic seizure. Electroencephalography (EEG) is an important diagnostic test ...
详细信息
ISBN:
(纸本)9798350343557
Abnormal electrical activities due to brain tumor, developmental anomaly, neural-atrophy in cortical/sub-cortical brain regions cause an epileptic seizure. Electroencephalography (EEG) is an important diagnostic test used for observing waveforms such as epileptic brain activities. In this study, a new method which detects epileptic seizure from EEG signals automatically is proposed. Discrete wavelet transform and time dependent entropy based statistical features of the EEG signal are used to train artificialneuralnetworks. The proposed method has been applied on EEG signals obtained from healthy individuals and epileptic patients for epileptic seizure detection, and accuracy of 100% has been achieved. This method has also been applied on EEG signals containing normal, interictal and ictal states, and accuracy, sensitivity and specificity of 98.6%, 96.0% and 99.3% have been achieved, respectively.
Although attention mechanisms have been widely applied in natural language processing (NLP) tasks, there are still limitations in their utilization within the field of computer vision. To integrate the advantages of c...
详细信息
Brain cancer is one of the most deadly illnesses. It causes abnormal cells to grow in the brain. Planning for treatment and the prognosis of patients with brain tumors depend greatly on early diagnosis. Brain tumors c...
详细信息
In the past years, machine learning (ML) and deep learning (DL) have led to the advancement of several applications, including computer vision, natural language processing, and audio processing. These complex tasks re...
详细信息
ISBN:
(纸本)9798400716164
In the past years, machine learning (ML) and deep learning (DL) have led to the advancement of several applications, including computer vision, natural language processing, and audio processing. These complex tasks require large models, which is a challenge to deploy in devices with limited resources. These resource-constrained devices have limited computation power and memory. Hence, the neuralnetworks must be optimized through network acceleration and compression techniques. This paper proposes a novel method to compress and accelerate neuralnetworks from a small set of spatial convolution kernels. Firstly, a novel pruning algorithm is proposed based on the density-based clustering method that identifies and removes redundancy in CNNs while maintaining the accuracy and throughput tradeoff. Secondly, a novel pruning algorithm based on the grid-based clustering method is proposed to identify and remove redundancy in CNNs. The performance of the three pruning algorithms (density-based, grid-based, and partitional-based clustering algorithms) is evaluated against each other. The experiments were conducted using the deep CNN compression technique on the VGG-16 and ResNet models to achieve higher accuracy on image classification than the original model at a higher compression ratio and speedup.
This research proposes an innovative method for correcting banding errors in satellite images based on Generative Adversarial networks (GAN). Small satellites are frequently launched into space to obtain images that c...
详细信息
This research proposes an innovative method for correcting banding errors in satellite images based on Generative Adversarial networks (GAN). Small satellites are frequently launched into space to obtain images that can be used in scientific or military research, commercial activities, and urban planning, among other applications. However, its small cameras are more susceptible to radiometric, geometric errors, and other distortions caused by atmospheric interference. The proposed method was compared to the conventional correction technique using experimental data, showing the similar performance (92.64% and 90.05% accuracy, respectively). These experimental results suggest that generative models utilizing artificial Intelligence (AI) techniques, specifically Deep Learning, are getting closer to achieving automatic correction close to conventional methods. Advantages of the GAN models include automating the task of correcting banding in satellite images, reducing the required time, and facilitating the processing without requiring prior technical knowledge in handling Geographic Information Systems (GIS). Potentially, this technique could represent a valuable tool for satellite imageprocessing, improving the accuracy of the results and making the process more efficient. The research is particularly relevant to the field of remote sensing and can have practical applications in various industries.
暂无评论