The practical deployment of machinevision presents particular challenges for resource constrained edge devices. With a clear need to execute multiple tasks with variable workloads, there is a need for a robust approa...
详细信息
The practical deployment of machinevision presents particular challenges for resource constrained edge devices. With a clear need to execute multiple tasks with variable workloads, there is a need for a robust approach that can dynamically adapt at runtime and which can maintain the maximum quality of service (QoS) within the available resource constraints. A lightweight approach that monitors the runtime workload constraints and leverages accuracy-throughput trade-offs on a graphics processing unit (GPU), is presented. It includes optimisation techniques that identify the configurations for each task in terms of optimal accuracy, energy and memory and management of the transparent switching between configurations. Using a neural network architecture search that statically generates a range of implementations that target a resource-precision trade-off, we explore the detection of the optimal parameters for the required QoS under specific memory and energy constraints. For an accuracy loss of 1%, we demonstrate that a 1.6x higher frame processing rate can be achieved on GPU with further improvements possible at further relaxed accuracy. In order to further improve the switching between configurations, we enhance the proposed mechanism by employing central processing units (CPUs) for offloading some of the executed frames, which helps to improve the frame rate by further 0.9%.
Coronary artery disease (CAD) is a prevalent cardiovascular condition and a leading cause of mortality. An accurate and timely diagnosis of CAD is crucial for treatment. This study aims to detect stenosis in real-time...
详细信息
Coronary artery disease (CAD) is a prevalent cardiovascular condition and a leading cause of mortality. An accurate and timely diagnosis of CAD is crucial for treatment. This study aims to detect stenosis in real-time and automatically during angiographic imaging for CAD diagnosis, using the YOLOv9c model. A dataset comprising 8325 grayscale images was utilized, sourced from 100 patients diagnosed with one-vessel CAD. To enhance sensitivity and accuracy during the training, testing, and validation phases of stenosis detection, fine-tuning and augmentations were applied. The Python API, utilizing YOLO and Ultralytics libraries, was employed for these processes. The analysis revealed that the YOLOv9c model achieved remarkably high performance in both processing speed and detection accuracy, with an F1-score of 0.99 and mAP@50 of 0.99. The inference time was reduced to 18 ms, fine-tuning time to 3.5 h, and training time to 11 h. When the same dataset was tested using another significant diagnostic algorithm, SSD MobileNet v1, the YOLOv9c model outperformed it by achieving 1.36 x better F1-score and 1.42 x better mAP@50. These results indicate that the developed YOLOv9c algorithm can provide highly accurate and real-time results for stenosis detection.
In order to obtain the low resolution(LR) image's detailed information, super resolution(SR) image reconstruction is essential. From 1D projections, we can reconstruct images in 2D and 3D. The LR images attained h...
详细信息
This research proposes a ground-breaking technique for protecting agricultural fields against animal invasion, addressing a key challenge in the agriculture industry. The suggested system guarantees real-time intrusio...
详细信息
This research proposes a ground-breaking technique for protecting agricultural fields against animal invasion, addressing a key challenge in the agriculture industry. The suggested system guarantees real-time intrusion detection and quick reactions by combining cutting-edge sensor technologies, imageprocessing capabilities, and the Internet of Things (IoT), successfully safeguarding crops and reducing agricultural losses. This study involves a thorough examination of five models-Inception, Xception, vGG16, AlexNet, and Yolov8-against three different datasets. The Yolov8 model emerged as the most promising, with exceptional accuracy and precision, exceeding 99% in both categories. Following that, the Yolov8 model's performance was compared to previous study findings, confirming its excellent capabilities in terms of intrusion detection in agricultural settings. Using the capabilities of the Yolov8 model, an IoT device was designed to provide real-time intrusion alarms on farms. The ESP32cam module was used to build this gadget, which smoothly integrated this cutting-edge model to enable efficient farm security measures. The incorporation of this technology has the potential to transform farm monitoring by providing farmers with timely, actionable knowledge to prevent possible threats and protect agricultural production.
The world's ocean depths conceal a big mystery, and obtaining the information contained therein is a significant challenge that must be overcome. With the advent of computer vision technologies and robotics, the u...
详细信息
The world's ocean depths conceal a big mystery, and obtaining the information contained therein is a significant challenge that must be overcome. With the advent of computer vision technologies and robotics, the underwater environment is explored recently. The vast data collected from numerous underwater sensors have a variety of complications related to inadequate image quality, difficulty in acquiring training samples, and uncontrolled objects in the underwater environment. When these images are processed using machine learning techniques that involve manual intervention, the time taken to process a huge amount of images will be relatively high and prone to errors. To tackle these, we propose a novel hybrid capuchin-based coevolving particle swarm optimization ((HCPSO)-P-2) algorithm with a ResNet model of Convolutional Neural Network (CNN) architecture for underwater object identification. This work mainly aims to explore different underwater objects such as fish, corals, sea urchins, etc. The speckle-reducing anisotropic diffusion (SRAD) filter performs the pre-processing step. The denoising autoencoder (DA) is used for feature extraction which can enhance the partially distorted sample images and offer increased robustness. To overcome the overfitting issue in CNN, the (HCPSO)-P-2 algorithm is used. The experimental works are handled in MATLAB software. Both with and without pre-processing results in terms of SRAD filter are checked and evaluated. The proposed method's effectiveness is evaluated through various measures like accuracy, specificity, sensitivity, false-positive rate, false-negative rates, etc. The accuracy of the (HCPSO)-P-2-CNN classifier is higher when compared to the standard CNN classifier in recognizing the underwater objects when evaluated with different performance metrics.
Diabetic retinopathy (DR) is an impediment of diabetes mellitus, which if not treated early may result in complete loss of vision, even without any preemptive symptoms. DR is caused by high level of glucose in the blo...
详细信息
Diabetic retinopathy (DR) is an impediment of diabetes mellitus, which if not treated early may result in complete loss of vision, even without any preemptive symptoms. DR is caused by high level of glucose in the blood, causing alterations in the microvasculature of retina. However, early screening of diabetic patients through retinal fundus imaging, along with proper diagnosis and treatment can control the prevalence of DR complications. Manual inspection of pathological changes in retinal fundus images is an extremely challenging and tedious task. Therefore, computer-aided diagnosis (CAD) system is an efficient and effective method for early detection of DR and can greatly assist the ophthalmologists. CAD system encompasses DR detection and severity grading that includes detection, classification, localization and segmentation of lesions from the fundus images. Significant contributions have been made in DR severity grading using conventional imageprocessing approaches using hand-engineered features and traditional machine-learning (ML) techniques. In the recent years, significant development of deep learning (DL) methods alleviated by the advancement of hardware computation power and efficient learning algorithms, has triumphed over the traditional ML methods in DR detection and grading tasks. Many researchers have employed the established as well as customized DL models in different DR image repositories and reported their findings. In this paper, we conduct a detailed review of the recent state-of-the-art contributions in the field of DL based DR classification by explaining their methodologies and highlighting their advantages and limitations. A detailed comparative study based on certain statistical parameters has also been conducted to quantitatively evaluate the methods, models and preprocessing techniques. In addition, the challenges in designing an efficient, accurate and robust deep-learning model for DR classification are explored in details to help t
Drill pipe joint’s thread quality directly affects the machining performance and the drill pipe’s service life. machinevision can quickly detect thread parameters to determine the thread processing quality, but thi...
详细信息
Oral cancer primarily affects the oral chamber within the head and neck area, and underscores the critical need for effective classification to initiate timely treatment. Deep learning (DL)-based computer-aided diagno...
详细信息
Robotic harvesting of fruits and vegetables is an advanced technology that leverages Robotics, Artificial Intelligence, and machinevision to harvest the fruits autonomously from plants or trees. This technology aims ...
详细信息
Cocoa hybridisation generates new varieties which are resistant to several plant diseases, but has individual chemical characteristics that affect chocolate production. image analysis is a useful method for visual dis...
详细信息
Cocoa hybridisation generates new varieties which are resistant to several plant diseases, but has individual chemical characteristics that affect chocolate production. image analysis is a useful method for visual discrimination of cocoa beans, while deep learning (DL) has emerged as the de facto technique for imageprocessing . However, these algorithms require a large amount of data and careful tuning of hyperparameters. Since it is necessary to acquire a large number of images to encompass the wide range of agricultural products, in this paper, we compare a Deep Computer vision System (DCvS) and a traditional Computer vision System (CvS) to classify cocoa beans into different varieties. For DCvS, we used a Resnet18 and Resnet50 as backbone, while for CvS, we experimented traditional machine learning algorithms, Support vector machine (SvM), and Random Forest (RF). All the algorithms were selected since they provide good classification performance and their potential application for food classification A dataset with 1,239 samples was used to evaluate both systems. The best accuracy was 96.82% for DCvS (ResNet 18), compared to 85.71% obtained by the CvS using SvM. The essential handcrafted features were reported and discussed regarding their influence on cocoa bean classification. Class Activation Maps was applied to DCvS's predictions, providing a meaningful visualisation of the most important regions of the images in the model.
暂无评论