deeplearning in automatic video colonoscopy processing may result in missing small polyps or detecting them with low confidence. We conducted a study to demonstrate that we can overcome such problems by fitting the d...
详细信息
ISBN:
(数字)9781665485579
ISBN:
(纸本)9781665485586
deeplearning in automatic video colonoscopy processing may result in missing small polyps or detecting them with low confidence. We conducted a study to demonstrate that we can overcome such problems by fitting the dimensions of the input image to the dimensions of the input layer of a trained Mobilenet deeplearning network and by grouping polyps in classes according to their size. The results show that it is possible to substantially improve the confidence rates and we propose a scheme to increase small polyps detection in real-time video colonoscopies based on two trained Mobilenet networks.
On-board object detection is an important requirement for Unmanned Aerial Vehicles (UAVs) when carrying out a variety of tasks such as obstacle avoidance, search and rescue, and automatic target recognition. One of th...
详细信息
ISBN:
(纸本)9781665405355
On-board object detection is an important requirement for Unmanned Aerial Vehicles (UAVs) when carrying out a variety of tasks such as obstacle avoidance, search and rescue, and automatic target recognition. One of the main difficulties of conducting object detection for a UAV is that, because the objects of interest are observed from altitude, this causes them to appear very small in images acquired from an onboard camera. In this work, we attempt to provide a solution to this difficulty, while also seeking to meet two other criteria that are important to the deployment of Artificial Intelligence with UAVs: firstly, the capability to operate in real-time;and secondly, the ability of the user to be able to trust the predictions it makes. To meet the challenge of small object detection we present the use of an image Tile Loader to enhance the capability of deep Neural Network (DNN) style detectors, while minimising the processingtime costs. Furthermore, we also introduce the practice of using Grad-CAM to provide better insights into a detection style architecture as a means to enhance trustworthiness.
Nowadays, the world is experiencing an increasing boom in applications of artificial intelligence, especially deeplearning. This is more and more used in many areas such as industry, medicine and security systems, et...
详细信息
In the beverage manufacturing industry-the process of closing and sealing the bottle caps by an automated machine could cause defects due to imperfect equipment alignment. Small businesses rely on humans as a means to...
详细信息
real-time detection of human activities has become very important in terms of surveillance and security of Bank-Automated Teller Machines (ATMs), public offices because of the day-to-day increase in criminal activitie...
详细信息
Facial recognition technology has gained widespread use in various applications, raising concerns about the weakness of frameworks to confront mocking assaults. This study presents an implementation of face spoofing d...
详细信息
ISBN:
(数字)9798350395327
ISBN:
(纸本)9798350395334
Facial recognition technology has gained widespread use in various applications, raising concerns about the weakness of frameworks to confront mocking assaults. This study presents an implementation of face spoofing detection using machine learning techniques. The exploration utilizes a far-reaching system that envelops data combination, preprocessing, incorporate extraction, and model readiness. A diverse dataset comprising genuine and spoofed facial images, representing various spoofing techniques, is utilized. Feature extraction leverages Convolutional Brain Organizations (CNNs) to catch discriminative facial elements. The selected machine learning model is trained and fine-tuned, with a focus on achieving robustness against evolving spoofing methods. The evaluation of the implemented system involves rigorous testing on a separate dataset, utilizing estimations like precision, exactness, survey, and F1-score. The study investigates post-processing techniques and considerations for real-time deployment, ensuring practical applicability is done by the method convolutional neural network (CNN). Cross-approval is performed to evaluate the model's speculation capacities, and the deployment phase explores integration into real-world scenarios. Ethical considerations, user feedback, and compliance with data privacy regulations are integral components of the study.
Wireless capsule endoscopy (WCE) is an efficient tool to investigate gastrointestinal tract disorders and perform painless imaging of the intestine. Despite that, several concerns make its wide applicability and adapt...
详细信息
Wireless capsule endoscopy (WCE) is an efficient tool to investigate gastrointestinal tract disorders and perform painless imaging of the intestine. Despite that, several concerns make its wide applicability and adaptation challenging like efficacy, tolerance, safety, and performance. Besides, automatic analysis of the WCE provided dataset is of great importance for detecting abnormalities. Imaging of the patient's digestive tract through WCE produces a large dataset that requires a substantial amount of time and a special skill set from a medical practitioner for analysis. Several computer-aided and vision-based solutions have been proposed to resolve these issues, yet, they do not provide the desired level of accuracy and further improvements are still needed. The current study aims to devise a system that can perform the task of automatic analysis of WCE images to identify abnormalities and assist practitioners for robust diagnosis. This study adopts a deep neural network approach and proposes a model name BIR (bleedy image recognizer) that combines the MobileNet with a custom-built convolutional neural network (CNN) model to classify WCE bleedy images. BIR uses the MobileNet model for initial-level computation for its lower computation power requirement and subsequently the output is fed to the CNN for further processing. A dataset of 1650 WCE images is used to train and test the model using the measures of accuracy, precision, recall, F1 score, and Cohen's kappa to evaluate the performance of the BIR. Results indicate the promising outcomes with achieved accuracy, precision, recall, F1 score, and Cohen's kappa of 0.993, 1.000, 0.994, 0.997, and 0.995 respectively. The accuracy of the BIR model is 0.978 with the Google collected WCE image dataset which is better than the state-of-art approaches.
In the critical care unit, bedside monitors track patients’ vital signs to help physicians make choices. With increased storage and analysis capability, huge datasets may be processed faster. Lately, physiological da...
详细信息
ISBN:
(数字)9798350371406
ISBN:
(纸本)9798350371413
In the critical care unit, bedside monitors track patients’ vital signs to help physicians make choices. With increased storage and analysis capability, huge datasets may be processed faster. Lately, physiological data have showed promise for identifying specific consequences and occurrences. At the critical care unit. Our method was aimed to Live bedside vital sign data may be automatically retrieved. Devices that can predict patient mechanical ventilation for more than 4 days in a pediatric critical care unit. 284 ICU patients had their vital signs and clinical history documented continuously from admission. 24 hours of artificial breathing in Boston Children’s Hospital’s pediatric medical-surgical critical care unit. They trained many machine learning models using subsets. Assess the likelihood of each person staying long. We examined our systems’ capacity to anticipate outcomes for unknown persons. Adding vital sign data increased model performance to 90% (area under the curve) from >83% when using just vital sign data. static participant clinical data in electronic health records. For research group representative sample reliability, we employed this approach on 300 individually trained tests employing a variety of sets for training and hold-out validation. Other deeplearning methods failed to forecast like ours. Gaining equivalent work outcomes prediction skills. We may be able to help create scalable ICU-based prediction systems that leverage real-time patient monitor data.
image segmentation is a very popular and important task in computer vision. In this paper, inverse quantum Fourier transform (IQFT) for image segmentation has been explored and a novel IQFT-inspired algorithm is propo...
image segmentation is a very popular and important task in computer vision. In this paper, inverse quantum Fourier transform (IQFT) for image segmentation has been explored and a novel IQFT-inspired algorithm is proposed and implemented by leveraging the underlying mathematical structure of the IQFT. Specifically, the proposed method takes advantage of the phase information of the pixels in the image by encoding the pixels’ intensity into qubit relative phases and applying IQFT to classify the pixels into different segments automatically and efficiently. To the best of our knowledge, this is the first attempt of using IQFT for unsupervised image segmentation. The proposed method has low computational cost comparing to the deeplearning based methods and more importantly it does not require training, thus make it suitable for real-time applications. The performance of the proposed method is compared with K-means and Otsu-thresholding. The proposed method outperforms both of them on the PASCAL VOC 2012 segmentation benchmark and the xVIEW2 challenge dataset by as much as 50% in terms of mean Intersection-Over-Union (mIOU).
In recent times, technological advancement has brought a tremendous change in the field o f c ricket, which is a popular sport in many countries. Technology is being utilized to figure out projected score prediction, ...
详细信息
暂无评论