The fusion of visible light and infrared images has garnered significant attention in the field of imaging due to its pivotal role in various applications, including surveillance, remote sensing, and medical imaging. ...
详细信息
Diabetic Retinopathy (DR) is a health condition caused due to Diabetes Mellitus (DM). It causes vision problems and blindness due to disfigurement of human retina. According to statistics, 80% of diabetes patients bat...
详细信息
Diabetic Retinopathy (DR) is a health condition caused due to Diabetes Mellitus (DM). It causes vision problems and blindness due to disfigurement of human retina. According to statistics, 80% of diabetes patients battling from long diabetic period of 15 to 20 years, suffer from DR. Hence, it has become a dangerous threat to the health and life of people. To overcome DR, manual diagnosis of the disease is feasible but overwhelming and cumbersome at the same time and hence requires a revolutionary method. Thus, such a health condition necessitates primary recognition and diagnosis to prevent DR from developing into severe stages and prevent blindness. Innumerable machine Learning (ML) models are proposed by researchers across the globe, to achieve this purpose. Various feature extraction techniques are proposed for extraction of DR features for early detection. However, traditional ML models have shown either meagre generalization throughout feature extraction and classification for deploying smaller datasets or consumes more of training time causing inefficiency in prediction while using larger datasets. Hence Deep Learning (DL), a new domain of ML, is introduced. DL models can handle a smaller dataset with help of efficient data processing techniques. However, they generally incorporate larger datasets for their deep architectures to enhance performance in feature extraction and image classification. This paper gives a detailed review on DR, its features, causes, ML models, state-of-the-art DL models, challenges, comparisons and future directions, for early detection of DR.
Satellite image classification is crucial in various applications such as urban planning,environmental monitoring,and land use *** this study,the authors present a comparative analysis of different supervised and unsu...
详细信息
Satellite image classification is crucial in various applications such as urban planning,environmental monitoring,and land use *** this study,the authors present a comparative analysis of different supervised and unsupervised learning methods for satellite image classification,focusing on a case study in Casablanca using Landsat 8 *** research aims to identify the most effective machine-learning approach for accurately classifying land cover in an urban *** methodology used consists of the pre-processing of Landsat imagery data from Casablanca city,the authors extract relevant features and partition them into training and test sets,and then use random forest(RF),SVM(support vector machine),classification,and regression tree(CART),gradient tree boost(GTB),decision tree(DT),and minimum distance(MD)*** a series of experiments,the authors evaluate the performance of each machine learning method in terms of accuracy,and Kappa *** work shows that random forest is the best-performing algorithm,with an accuracy of 95.42%and 0.94 Kappa *** authors discuss the factors of their performance,including data characteristics,accurate selection,and model influencing.
Redundancy elimination techniques are extensively investigated to reduce storage overheads for cloud-assisted health *** eliminates the redundancy of duplicate blocks by storing one physical instance referenced by mul...
详细信息
Redundancy elimination techniques are extensively investigated to reduce storage overheads for cloud-assisted health *** eliminates the redundancy of duplicate blocks by storing one physical instance referenced by multiple *** compression is usually regarded as a complementary technique to deduplication to further remove the redundancy of similar blocks,but our observations indicate that this is disobedient when data have sparse duplicate *** addition,there are many overlapped deltas in the resemblance detection process of post-deduplication delta compression,which hinders the efficiency of delta compression and the index phase of resemblance detection inquires abundant non-similar blocks,resulting in inefficient system ***,a multi-feature-based redundancy elimination scheme,called MFRE,is proposed to solve these *** similarity feature and temporal locality feature are excavated to assist redundancy elimination where the similarity feature well expresses the duplicate ***,similarity-based dynamic post-deduplication delta compression and temporal locality-based dynamic delta compression discover more similar base blocks to minimise overlapped deltas and improve compression ***,the clustering method based on block-relationship and the feature index strategy based on bloom filters reduce IO overheads and improve system *** demonstrate that the proposed method,compared to the state-of-the-art method,improves the compression ratio and system throughput by 9.68%and 50%,respectively.
This paper explores the utilization of OpenCV (Open-Source Computer vision Library) in artificial intelligence (AI) systems, elucidating its pivotal role in advancing various applications across diverse domains. OpenC...
详细信息
The fusion of edge computing and artificial intelligence, known as Edge AI, represents a paradigm shift that facilitates the direct execution of AI algorithms on edge devices. As these devices become increasingly powe...
详细信息
ISBN:
(纸本)9798350360875;9798350360868
The fusion of edge computing and artificial intelligence, known as Edge AI, represents a paradigm shift that facilitates the direct execution of AI algorithms on edge devices. As these devices become increasingly powerful, their role in developing and deploying AI systems becomes more significant. By eliminating the need to transmit and analyze data at remote machines, Edge AI applications can significantly reduce latency and enhance efficiency by processing data closer to the source. In this study, we thoroughly investigate the performance of our object classification model deployed in a vision inspection system on four types of edge devices (Jetson AGX Orin, Jetson Orin Nano, NUC, and Raspberry Pi). Our object classification models are trained using proprietary industrial datasets provided by industry partners. These models, in FP32, are converted into lower precision processing, being INT8, to evaluate the accuracy variation between FP32 and INT8 precision, and inference speed for different edge devices. In our experiments, we identified that the average accuracy deviation for INT8 models is -2.78%, with some models exhibiting variations exceeding - 10.95%. Most devices have an average inference speed less than 100 ms per image (as requested by industrial partners), except the Raspberry Pi, which records more than 2 seconds of inferencing an image. Intel NUC consumes 107 W, which is averagely comparable with a server PC, while AGX Orin, Orin Nano, and Raspberry Pi consume less than 20 W of power. The outcomes of our evaluations offer valuable insights for selecting appropriate devices for specific scenarios. These detailed observations on the strengths and limitations of different edge devices can guide future research and advancements in Edge AI technology.
Detecting and segmenting fruits in an orchard environment is a vital technique in multiple applications of precision agriculture, such as automated harvesting and yield estimation. This study aims to improve the accur...
详细信息
The classification of an individual as male or female is a significant issue with several practical implications. In recent years, automatic gender identification has garnered considerable interest because of its pote...
详细信息
The classification of an individual as male or female is a significant issue with several practical implications. In recent years, automatic gender identification has garnered considerable interest because of its potential applications in e-commerce and the accumulation of demographic data. Recent observations indicate that models based on deep learning have attained remarkable success in a variety of problem domains. In this study, our aim is to establish an end-to-end model that capitalizes on the strengths of competing convolutional neural network (CNN) and vision transformer (ViT) models. To accomplish this, we propose a novel approach that combines the MobileNetV2 model, which is recognized for having fewer parameters than other CNN models, with the ViT model. Through rigorous evaluations, we have compared our proposed model with other recent studies using the accuracy metric. Our model attained state-of-the-art performance with a remarkable score of 96.66% on the EarVN1.0 dataset, yielding impressive results. In addition, we provide t-SNE results that demonstrate our model's superior learning representation. Notably, the results show a more effective disentanglement of classes.
This research presents a novel computer vision-based attention monitoring system designed for both online and offline contexts. Leveraging advanced imageprocessing and machine learning algorithms, the system analyzes...
详细信息
image captioning is a fascinating and demanding work with applications in many different fields, including image retrieval, organizing and finding user-interested images, etc. It has enormous potential to replace the ...
详细信息
暂无评论