The objective of this project is to develop an advanced quality human intrusion detection system, integrating IoT hardware with advanced software technologies. This will be done by relying on real-time video footage a...
详细信息
Cloud-based Object Storage Systems (OSS) are known for their scalability, durability, availability, and concurrency. However, there is a noticable vaccum in open-source OSS for a straightforward way for users and admi...
详细信息
Cloud-based Object Storage Systems (OSS) are known for their scalability, durability, availability, and concurrency. However, there is a noticable vaccum in open-source OSS for a straightforward way for users and administrators to conduct data searches within object storage without fully utilizing the cloud infrastructure. In our research, we present Sherlock, a novel Content-Based Searching (CoBS) framework. Sherlock enhances search capabilities by using extra information from images and documents, incorporating this information into an Elasticsearch-powered database to enable content-driven searches. The framework operates through a two-stage process. First, it classifies the incoming data by type, directing images to an object detection model and processing documents for keyword extraction. Then, Elasticsearch catalogs the extracted data, facilitating searches based on content. The effectiveness of our searches is largely dependent on the precision of these models, which we improve by training them on large-scale datasets: the Microsoft COCO Dataset for multimedia content and the SemEval2017 Dataset for text documents. We further test our system's performance by integrating it with the open-source OSS, OpenStack Swift, and conducting real-world experiments with image uploads to evaluate how our model performs within Swift's object storage environments.
Purpose Specular reflections (SRs) are highlight artifacts commonly found in endoscopy videos that can severely disrupt a surgeon's observation and judgment. Despite numerous attempts to restore SR, existing metho...
详细信息
Purpose Specular reflections (SRs) are highlight artifacts commonly found in endoscopy videos that can severely disrupt a surgeon's observation and judgment. Despite numerous attempts to restore SR, existing methods are inefficient and time consuming and can lead to false clinical interpretations. Therefore, we propose the first complete deep-learning solution, SpecReFlow, to detect and restore SR regions from endoscopy video with spatial and temporal coherence. Approach SpecReFlow consists of three stages: (1) an image preprocessing stage to enhance contrast, (2) a detection stage to indicate where the SR region is present, and (3) a restoration stage in which we replace SR pixels with an accurate underlying tissue structure. Our restoration approach uses optical flow to seamlessly propagate color and structure from other frames of the endoscopy video. Results Comprehensive quantitative and qualitative tests for each stage reveal that our SpecReFlow solution performs better than previous detection and restoration methods. Our detection stage achieves a Dice score of 82.8% and a sensitivity of 94.6%, and our restoration stage successfully incorporates temporal information with spatial information for more accurate restorations than existing techniques. Conclusions SpecReFlow is a first-of-its-kind solution that combines temporal and spatial information for effective detection and restoration of SR regions, surpassing previous methods relying on single-frame spatial information. Future work will look to optimizing SpecReFlow for real-time applications. SpecReFlow is a software-only solution for restoring image content lost due to SR, making it readily deployable in existing clinical settings to improve endoscopy video quality for accurate diagnosis and treatment.
The surface plasmon resonance (SPR) sensors are technologically attractive for applications that demand quick and accurate biological substance monitoring. Through its typical SPR image response, the resonance conditi...
详细信息
The surface plasmon resonance (SPR) sensors are technologically attractive for applications that demand quick and accurate biological substance monitoring. Through its typical SPR image response, the resonance condition indicated by the minimum reflectivity values works like an optical signature for changes in the refractive index (RI) of the substance under analysis. Recently, the incorporation of machine and deeplearning methods (MDLMs) on SPR sensors to create intelligent tasks along the signal processing chain employed in SPR biosensing was witnessed. One possible intelligent application is substance identification based on the analysis of SPR responses. Occasionally, this problem is addressed with data from SPR curves, requiring a prior SPR image manipulation for the respective curve generation, leading to extra process steps and time consumption. This article presents the design of an intelligent SPR sensor with analyte identification capabilities directly from its SPR image, offering guidance on the precise moment for substance switching during injection routines. An image-based prediction model with convolutional neural networks (CNNs) was fine-tuned to directly identify individually aqueous solutions with different refractive indices. A new approach was described to generate SPR images (Fresnel images) from calculation with the Fresnel analysis (FA) framework. The proposed CNN architecture was evaluated and compared with seven state-of-the-art CNN architectures. The models were integrated into the experimental setup for real-time identification. The experimental tests demonstrate the viability of the overall pipeline for the model conception, being able to reach more than 96% accuracy in performing the identification task.
Noise decreases image quality in optical coherence tomography (OCT) and can obscure important features in real-time visualizations. In this work, we show that a neural network can be applied to denoise volumetric OCT ...
详细信息
ISBN:
(纸本)9781510658394;9781510658400
Noise decreases image quality in optical coherence tomography (OCT) and can obscure important features in real-time visualizations. In this work, we show that a neural network can be applied to denoise volumetric OCT data for intra-surgical visualization in real-time. We adapt a self-supervised training approach, not requiring any paired data for training. Several optimizations and trade-offs in deployment are required, with which we achieved processingtimes of only few milliseconds. While still being limited by the real-time requirements, denoising in this scenario can enhance surface visibility, and therefore allow guidance for more precise intra-surgical maneuvers.
Artificial intelligence, Machine learning, and deeplearning are increasingly making significant contributions to the field of medicine. Individual patient conditions, disease localization, and various influencing fac...
详细信息
ISBN:
(纸本)9798350373981;9798350373974
Artificial intelligence, Machine learning, and deeplearning are increasingly making significant contributions to the field of medicine. Individual patient conditions, disease localization, and various influencing factors underscore the complexity of disease diagnosis and treatment planning. Introducing new technologies can revolutionize medical diagnostics, facilitating swift and accurate assessments. Among the noninvasive diagnostic methods, Magnetic Resonance Imaging (MRI) stands out, particularly in tumor diagnosis. UNet, renowned for its effectiveness in medical image analysis, serves as a robust model for semantic segmentation, as does deepLabV3+. However, these models are inherently complex, and their inference process can be time-consuming. By leveraging the OpenVINO toolkit, the inference process is significantly reduced. In this study, nearly a 2-fold acceleration is achieved in inference time with the deepLabV3+ model and a roughly 1.2-fold improvement with the UNet model on CPU. Moreover, when employing GPU with FP16 precision, the acceleration reached almost 2.5-fold for UNet and nearly 3-fold for deepLabV3+, showcasing the substantial performance enhancements attainable through optimized hardware utilization.
imageprocessing techniques such as blurring, JPEG compression are applied to natural images to meet different objectives. Additionally, corruptions such as Gaussian and shot noise appear on images due to digital fluc...
详细信息
ISBN:
(纸本)9783031581731;9783031581748
imageprocessing techniques such as blurring, JPEG compression are applied to natural images to meet different objectives. Additionally, corruptions such as Gaussian and shot noise appear on images due to digital fluctuations. Unfortunately, standard vision models tend to perform quite poorly under such unavoidable corruptions, i.e., these models are not robust to the distribution shifts induced by these corruptions at test time. The standard approach for overcoming this issue for a known corruption is by augmenting the training data with images perturbed using the corruption of interest. Motivated by settings where the corruption might not be known during training, Gaussian noise is used as an augmentation strategy to gain robustness to high-frequency corruptions. In this paper, we try to understand its properties from a Fourier lens. However, we show that Gaussian augmentation fails to maintain robustness to few high-frequency corruptions at high severity levels. Analyzing the Fourier signature of those corruptions reveal a change in behavior - at high severity they corrupt low frequencies as well. A Gaussian-trained model loses its performance due to this change. Current augmentation strategies for low-frequency corruptions are discussed at the end.
This paper proposes a deep joint source-channel coding (DJSCC) to minimize the age of information (AoI) for image transmission. A new content-based AoI metric called age of misclassified information (AoMI) is introduc...
详细信息
The real-time assessment and prognostication of food texture during mastication are paramount for comprehending and emulating the chewing process, bearing significant ramifications for the food industry. This study un...
详细信息
ISBN:
(纸本)9780791887639
The real-time assessment and prognostication of food texture during mastication are paramount for comprehending and emulating the chewing process, bearing significant ramifications for the food industry. This study unveils a deeplearning-oriented recognition framework utilizing Mask R-CNN to analyze food texture throughout the mastication process. We create an extensive food bolus image compendium and train a deeplearning model proficient in texture discrimination and processing parameter estimation. A vision system, incorporating an Intel realSense D435i camera, is harnessed to capture high-resolution images of the food bolus. The dataset encompasses images of masticated peanut samples, with the number of chews varying from 1 to 12 cycles. Although the model's predictions occasionally diverge from the actual data points, with a maximum discrepancy of +/- 2 chewing instances chews except for predicting peanuts chewed 11 cycles, the proposed system lays the groundwork for real-time evaluation and prediction of food texture during mastication. Future inquiries could concentrate on ameliorating the model's precision, broadening its applicability to diverse food categories, and refining the training dataset. This investigation holds the potential to influence the advancement of delectable and nourishing food products, bestowing benefits upon both the food industry and consumers.
This paper explores the integration of Digital Twin (DT) technology and machine learning (ML) techniques for predictive maintenance in electric vehicles (EVs), particularly focusing on fault diagnosis in electric moto...
详细信息
ISBN:
(纸本)9798331540845;9789887581598
This paper explores the integration of Digital Twin (DT) technology and machine learning (ML) techniques for predictive maintenance in electric vehicles (EVs), particularly focusing on fault diagnosis in electric motors (EMs). DT technology, which creates a digital replica of a physical entity, combined with real-time sensor data analytics, offers significant potential for improving the reliability and operational efficiency of EMs under variable and intense conditions. By incorporating advanced ML models like Transformers, which are adept at capturing long-range dependencies, into the DT framework, the study addresses critical challenges in fault diagnosis under non-stationary conditions typical in EVs. The research introduces a novel approach that combines Transformers with convolutional layers to enhance the fault diagnosis capabilities by effectively capturing complex patterns in time-series data. Furthermore, incremental learning is proposed within the DT framework to adapt to evolving environments and maintain high diagnostic performance, aiming for domain invariance in fault diagnosis models. The paper evaluates various deeplearning models, including Transformers, FCNs, CNNs, Inception, ResNet, and MLP, across metrics such as accuracy, training duration, and inference time, identifying CNNs and FCNs as optimal due to their balance between performance and processingtime. This study not only demonstrates the feasibility of advanced ML techniques in enhancing DT-based fault diagnosis but also provides insights into selecting appropriate models for predictive maintenance applications in EVs, highlighting the potential for future research in adapting these technologies to different datasets and operational scenarios.
暂无评论