With the rapid development of deeplearning in the last decade, generating and processingreal-timeimages have become one of critical methods in intelligent driving systems for new energy vehicles. However, the real-...
详细信息
With the rapid development of deeplearning in the last decade, generating and processingreal-timeimages have become one of critical methods in intelligent driving systems for new energy vehicles. However, the real-timeimages captured by sensors are susceptible to variations in various environments, including different weather and lighting conditions. To enhance the real-timeimage generation performance for new energy vehicles in complex environments, and improve real-time visual imageprocessing capabilities, this study proposes an energy-efficient real-time visual image adversarial generation and processing algorithm, called as ENV-GAN. It hypothesizes a shared latent domain among mixed image domains after analyzing driving situations under various weather and lighting conditions. Mappings are established between different image domains. Besides, a multi-encoder weight-sharing technique is utilized to enhances the generative adversarial network model. Additionally, the algorithm integrates an attention module to enhance the model & acirc;(TM) s image generation. Experimental results and analysis demonstrate that the new algorithm outperforms existing algorithms in tasks such as defogging, rain removal, and lighting enhancement, offering high energy efficiency and low energy consumption.
deeplearning methods have emerged as powerful tools for analyzing histopathological images, but current methods are often specialized for specific domains and software environments, and few open-source options exist ...
详细信息
deeplearning methods have emerged as powerful tools for analyzing histopathological images, but current methods are often specialized for specific domains and software environments, and few open-source options exist for deploying models in an interactive interface. Experimenting with different deeplearning approaches typically requires switching software libraries and reprocessing data, reducing the feasibility and practicality of experimenting with new architectures. We developed a flexible deeplearning library for histopathology called Slideflow, a package which supports a broad array of deeplearning methods for digital pathology and includes a fast whole-slide interface for deploying trained models. Slideflow includes unique tools for whole-slide image data processing, efficient stain normalization and augmentation, weakly-supervised whole-slide classification, uncertainty quantification, feature generation, feature space analysis, and explainability. Whole-slide imageprocessing is highly optimized, enabling whole-slide tile extraction at 40x magnification in 2.5 s per slide. The framework-agnostic data processing pipeline enables rapid experimentation with new methods built with either Tensorflow or PyTorch, and the graphical user interface supports real-time visualization of slides, predictions, heatmaps, and feature space characteristics on a variety of hardware devices, including ARM-based devices such as the Raspberry Pi.
deeplearning (DL) drives academics to create models for cancer diagnosis using medical imageprocessing because of its innate ability to recognize difficult-to-detect patterns in complex, noisy, and massive data. The...
详细信息
deeplearning (DL) drives academics to create models for cancer diagnosis using medical imageprocessing because of its innate ability to recognize difficult-to-detect patterns in complex, noisy, and massive data. The use of deeplearning algorithms for realtime cancer diagnosis is explored in depth in this work. real-time medical diagnosis determines the illness or condition that accounts for a patient's symptoms and outward physical manifestations within a predetermined time frame. With a waiting period of anywhere between 5 days and 30 days, there are currently several ways, including screening tests, biopsies, and other prospective methods, that can assist in discovering a problem, particularly cancer. This article conducts a thorough literature review to understand how DL affects the length of this waiting period. In addition, the accuracy and turnaround time of different imaging modalities is evaluated with DL-based cancer diagnosis. Convolutional neural networks are critical for real-time cancer diagnosis, with models achieving up to 99.3% accuracy. The effectiveness and cost of the infrastructure required for real-timeimage-based medical diagnostics are evaluated. According to the report, generalization problems, data variability, and explainable DL are some of the most significant barriers to using DL in clinical trials. Making DL applicable for cancer diagnosis will be made possible by explainable DL.
Driver drowsiness is among the strong causes of global road accidents. To counter such risk, we introduce a sophisticated detection system based on deeplearning and imageprocessing to counteract that type of risk. W...
详细信息
ISBN:
(数字)9798331506452
ISBN:
(纸本)9798331506469
Driver drowsiness is among the strong causes of global road accidents. To counter such risk, we introduce a sophisticated detection system based on deeplearning and imageprocessing to counteract that type of risk. We visualize designing an end-to-end solution about the precise model to recognize drowsiness from facial images effectively. In the detailed literature review of published works based on drowsiness detection, we identified critical gaps and improvements. Our approach combines the use of CNNs to extract relevant features and classify states of alertness with special techniques applied for imageprocessing to identify facial and eye states. In developing the system, we carefully set up a detailed procedure for data acquisition, model training, and testing. The performance results indicate tremendous strides in detection accuracy; the model yields strong results in controlled experiments and real-world use cases. This work outlines opportunities for enhanced driver-assistance technologies in improving safety on the roads, and we describe future research directions and application scenarios to further extend the development of drowsiness detection systems.
Recent advances in camera design and imaging technology have enabled the capture of high-quality images using smartphones. However, due to the limited dynamic range of digital cameras, the quality of photographs captu...
详细信息
Recent advances in camera design and imaging technology have enabled the capture of high-quality images using smartphones. However, due to the limited dynamic range of digital cameras, the quality of photographs captured in environments with highly imbalanced lighting often results in poor-quality images. To address this issue, most devices capture multi-exposure frames and then use some multi-exposure fusion method to merge those frames into a final fused image. Nevertheless, most traditional and current deeplearning approaches are unsuitable for real-time applications on mobile devices due to their heavy computational and memory requirements. We propose MobileMEF, a new method for multi-exposure fusion based on an encoder-decoder deeplearning architecture with efficient building blocks tailored for mobile devices. This efficient design makes MobileMEF capable of processing 4K resolution images in less than 2 s on mid-range smartphones. MobileMEF outperforms state-of-the-art techniques regarding full-reference quality measures and computational efficiency (runtime and memory usage), making it ideal for real-time applications on hardware-constrained devices. Our code is available at: https://***/LucasKirsten/MobileMEF.
In this paper, we introduce a physics-guided deeplearning approach for high-quality, real-time Fourier-domain optical coherence tomography (FD-OCT) image reconstruction. Unlike traditional supervised deeplearning me...
详细信息
In this paper, we introduce a physics-guided deeplearning approach for high-quality, real-time Fourier-domain optical coherence tomography (FD-OCT) image reconstruction. Unlike traditional supervised deeplearning methods, the proposed method employs unsupervised learning. It leverages the underlying OCT imaging physics to guide the neural networks, which could thus generate high-quality images and provide a physically sound solution to the original problem. Evaluations on synthetic and experimental datasets demonstrate the superior performance of our proposed physics-guided deeplearning approach. The method achieves the highest image quality metrics compared to the inverse discrete Fourier transform (IDFT), the optimization-based methods, and several state-of-the-art methods based on deeplearning. Our method enables real-time frame rates of 232 fps for synthetic images and 87 fps for experimental images, which represents significant improvements over existing techniques. Our physics-guided deeplearning-based approach could offer a promising solution for FD-OCT image reconstruction, which potentially paves the way for leveraging the power of deeplearning in real-world OCT imaging applications. (c) 2024 Optica Publishing Group under the terms of the Optica Open Access Publishing Agreement
Twitter spam needs to be detected and arrested quickly. The paper examines methods for classification of spam in terms of determination of important features, comparative performance of classification models, and impr...
详细信息
Twitter spam needs to be detected and arrested quickly. The paper examines methods for classification of spam in terms of determination of important features, comparative performance of classification models, and improvement in time performance for classification. It presents a conceptualization of several novel rich, deep, and na & iuml;ve features. The extraction processes for rich and deep features increase the time complexity of spam classification. To address this, the proposed model selectively segregates and combines features to enable near real-timeprocessing. This supersedes the time performance of standard machine learning and deeplearning models, with no compromise on the quality of classification.
One of the most important occupations in India is agriculture. Out of all the crops, cotton is the best and is crucial to the agricultural economy of the country. In India, 40-50 million people work in the cotton trad...
详细信息
ISBN:
(数字)9798350319019
ISBN:
(纸本)9798350319026
One of the most important occupations in India is agriculture. Out of all the crops, cotton is the best and is crucial to the agricultural economy of the country. In India, 40-50 million people work in the cotton trade and processing, while six million farmers directly depend on the crop. The cotton leaf disease has grown in importance over the last few decades, resulting in losses to crops, farming operations, and financial resources. To achieve this aim, we first need to acquire different images of cotton plants. We can use imageprocessing techniques to analyze dead leaf images and extract features like color, texture, and other characteristics with the deep CNN model’s assistance. In addition to being less expensive and more straightforward, automatic disease detection supports machine vision, which offers image-based automated process control and inspection. To properly train the algorithm, we will be using a dataset of approximately 1752(approximately 440 images in each class) images classified into different categories according to the diseases. This model will be developed using tools present in Anaconda such as Jupyter Notebook, Spyder etc. The results of this project will demonstrate whether using it in real-time applications is feasible and whether traditional or manual disease and pest identification could benefit from the use of IT- based solutions.
The increasing integration of intelligent systems in metropolitan environments emphasizes the important role effective image and video processing performs in applications including the traffic management, the monitori...
详细信息
ISBN:
(数字)9798350357530
ISBN:
(纸本)9798350357547
The increasing integration of intelligent systems in metropolitan environments emphasizes the important role effective image and video processing performs in applications including the traffic management, the monitoring of safety, and the urban planning. When handling dynamic scenarios, that is, changes in traffic conditions, occlusions, and the need for real-time analysis, conventional signal processing methods sometimes fail. Strong and adaptable solutions able to manage both these complexity as well as others are thus much needed. This work intends to introduce a deeplearning-based smart signal system designed to process real-time video feeds for uses connected to urban traffic and safety. The proposed system uses recurrent neural networks (RNNs) for temporal sequence comprehension; conversely, convolutional neural networks (CNNs) are used for spatial data analysis. Two approaches used in order to improve the performance of the model are data augmentation and transfer learning, which increase the accuracy of decision-making using multi-modal data sources, video streams, sensor inputs, environmental parameters. This helps one to get beyond the challenge of small datasets. System performance is discovered using benchmark datasets and video from actual traffic events. Better than the methods now in use, the results show object detection has 95% accuracy and anomaly detection has 92% accuracy. Promising for use in smart cities, the proposed framework exhibits scalability and flexibility over a wide range of events. Its characteristics, traffic optimization, safety monitoring, and data-driven urban planning, which comprise traffic control, safety monitoring, and data-driven urban planning, have the ability to drastically change urban transportation infrastructure management.
In this paper, an urban object detection system via unmanned aerial vehicles (UAVs) is developed to collect real-time traffic information, which can be further utilized in many applications such as traffic monitoring ...
详细信息
In this paper, an urban object detection system via unmanned aerial vehicles (UAVs) is developed to collect real-time traffic information, which can be further utilized in many applications such as traffic monitoring and urban traffic management. The system includes an object detection algorithm, deeplearning model training, and deployment on a real UAV. For the object detection algorithm, the Mobilenet-SSD model is applied owing to its lightweight and efficiency, which make it suitable for real-time applications on an onboard microprocessor. For model training, federated learning (FL) is used to protect privacy and increase efficiency with parallel computing. Last, the FL-trained object detection model is deployed on a real UAV for real-time performance testing. The experimental results show that the object detection algorithm can reach a speed of 18 frames per second with good detection performance, which shows the real-time computation ability of a resource-limited edge device and also validates the effectiveness of the developed system.
暂无评论