Automated and precise segmentation of breast lesions can facilitate early diagnosis of breast cancer. Recent research studies employ deeplearning for automatic segmentation of breast lesions using ultrasound imaging....
详细信息
Automated and precise segmentation of breast lesions can facilitate early diagnosis of breast cancer. Recent research studies employ deeplearning for automatic segmentation of breast lesions using ultrasound imaging. Numerous studies introduce somewhat complex modifications to the well adapted segmentation network, U-Net for improved segmentation, however, at the expense of increased computational time. Towards this aspect, this study presents a low complex deeplearning network, i.e., dense multiplicative attention enhanced encoder decoder network, for effective breast lesion segmentation in the ultrasound images. For the first time in this context, two dense multiplicative attention components are utilized in the encoding layer and the output layer of an encoder-decoder network with depthwise separable convolutions, to selectively enhance the relevant features. A rigorous performance evaluation using two public datasets demonstrates that the proposed network achieves dice coefficients of 0.83 and 0.86 respectively with an average segmentation latency of $19 ms$ . Further, a noise robustness study using an in-clinic recorded dataset without pre-processing indicates that the proposed network achieves dice coefficient of 0.72. Exhaustive comparison with some commonly used networks indicate its adeptness with low time and computational complexity demonstrating feasibility in realtime.
deeplearning-based models have recently shown a strong potential in Underwater image Enhancement (UIE) that are satisfying and have the right colors and details, but these methods significantly increase the parameter...
详细信息
deeplearning-based models have recently shown a strong potential in Underwater image Enhancement (UIE) that are satisfying and have the right colors and details, but these methods significantly increase the parameters and complexity of the imageprocessing models and therefore cannot be deployed directly to the edge devices. Vision Transformers (ViT) based architectures have recently produced amazing results in many vision tasks such as image classification, super-resolution, and image restoration. In this study, we introduced a lightweight Context-Aware Vision Transformer (CAViT), based on the Mean Head tokenization strategy and uses a self-attention mechanism in a single branch module that is effective at simulating long-distance dependencies and global features. To further improve the image quality we proposed an efficient variant of our model which derived results by applying White Balancing and Gamma Correction methods. We evaluated our model on two standard datasets, i.e., Large-Scale Underwater image (LSUI) and Underwater image Enhancement Benchmark Dataset (UIEB), which subsequently contributed towards more generalized results. Overall findings indicate that our real-time UIE model outperforms other deeplearning based models by reducing the model complexity and improving the image quality (i.e., 0.6 dB PSNR improvement while using only 0.3% parameters and 0.4% float operations).
deep-learning inversion has recently drawn attention in geological carbon storage research due to its potential of imaging and monitoring carbon storage in realtime, significantly improving efficiency and safety of c...
详细信息
deep-learning inversion has recently drawn attention in geological carbon storage research due to its potential of imaging and monitoring carbon storage in realtime, significantly improving efficiency and safety of carbon storage operations. We present a deep-learning full waveform inversion method that after the neural network has been trained can image CO2 saturation and its uncertainty in realtime. Our deep-learning inversion method is based on the U-Net architecture with the neural network trained on pairs of synthetic seismic data and CO2 saturation models. Accordingly, our training establishes a mapping relationship between seismic data and CO2 saturation models and once fully trained directly estimates CO2 saturation as a function of subsurface location. We further quantify uncertainties of CO2 saturation estimates using the Monte Carlo dropout method and a bootstrap aggregating method. For this proof-of-concept study, the CO2 training models and data are derived from the Kimberlina 1.2 model, a hypothetical 3D geological carbon storage model that is constructed based on various geological and hydrological data from the Southern San Joaquin Basin, California. We perform deep-learning inversion experiments using noise-free and noisy training and test data sets and compare the results. Our modelling experiments show that (1) the deep-learning inversion can estimate 2D distributions of CO2 fairly well even in the presence of Gaussian random noise and (2) both CO2 saturation imaging and uncertainty quantification can be done in realtime. Our results suggest that the deep-learning inversion method can serve as a robust real-time monitoring tool for geological carbon storage and/or other time-varying reservoir/aquifer properties that result from injection, extraction, and/or other subsurface transport phenomena.
Modern digital color cameras depend on Color Filter Arrays (CFA) for capturing color information. The majority of the commercial CFAs are designed by hand with different physical and application-specific consideration...
详细信息
ISBN:
(纸本)9781510673878;9781510673861
Modern digital color cameras depend on Color Filter Arrays (CFA) for capturing color information. The majority of the commercial CFAs are designed by hand with different physical and application-specific considerations. The available machine learning (ML)-based CFA learning architectures dismiss the considerations of a physical camera device. This study aims to develop an alternative approach to jointly learn binary Color Filter Arrays (CFA) in a deeplearning-based filtering-demosaicing pipeline. The proposed approach provides higher reconstruction performance than the compared hand-designed filters while learning physically applicable CFAs. This paper includes the learned binary CFAs for various color configurations and training data size, their analysis with common reconstruction metrics, and a short discussion on future works.
In a world of electronic data and communication, there is an urgent need for safe, fast, and automatic review on documents. With this in mind, this project is meant to develop an all-encompassing system that can accur...
详细信息
ISBN:
(数字)9798350373783
ISBN:
(纸本)9798350373790
In a world of electronic data and communication, there is an urgent need for safe, fast, and automatic review on documents. With this in mind, this project is meant to develop an all-encompassing system that can accurately detect text information in submitted documents as well as verify signatures. The system incorporates a set of external records supplied by consumers like the document under consideration and reference signatures. The system has two main objectives: the anchor model is used in the first case while authenticating signatures in the second one. The objective is to establish if the signature is genuine or contains defects pointing at falsification or illegal tampering. Generally, these processes give rise to comprehensive reports. These reports give credence and authenticity of the submitted documents while providing clarification whether a submitted document is valid or not through determination of match between document’s signature and reference signatures, as well as any inconsistencies noted throughout the evaluation process. This paper will help ensure safe and faster authentication of documents at a time when the world depends on accuracy and speed of document verification.
Motion compensation in radiation therapy is a challenging scenario that requires estimating and forecasting motion of tissue structures to deliver the target dose. Ultrasound offers direct imaging of tissue in real-ti...
详细信息
Motion compensation in radiation therapy is a challenging scenario that requires estimating and forecasting motion of tissue structures to deliver the target dose. Ultrasound offers direct imaging of tissue in real-time and is considered for image guidance in radiation therapy. Recently, fast volumetric ultrasound has gained traction, but motion analysis with such high-dimensional data remains difficult. While deeplearning could bring many advantages, such as fast data processing and high performance, it remains unclear how to process sequences of hundreds of image volumes efficiently and effectively. We present a 4D deeplearning approach for real-time motion estimation and forecasting using long-term 4D ultrasound data. Using motion traces acquired during radiation therapy combined with various tissue types, our results demonstrate that long-term motion estimation can be performed markerless with a tracking error of 0.35 +/- 0.2 mm and with an inference time of less than 5 ms. Also, we demonstrate forecasting directly from the image data up to 900 ms into the future. Overall, our findings highlight that 4D deeplearning is a promising approach for motion analysis during radiotherapy.
image encryption is one of the most widely used techniques for securing images in trusted and unrestricted public media. However, the drawback is weak security in chaotic encryption algorithms, small key space and har...
详细信息
image encryption is one of the most widely used techniques for securing images in trusted and unrestricted public media. However, the drawback is weak security in chaotic encryption algorithms, small key space and hardware implementation complexities. Various appropriate encryption algorithms have been carried out for secure image transmission;it becomes a critical issue to protect image integrity, confidentiality and authenticity. To overcome these issues, this article examines several existing memristor-based image encryption algorithms with various chaotic maps. As a result, this research aims to deliver comprehensive literature on image encryption techniques through memristor to help the researchers. Finally, this survey provides all vital literature on the current image encryption techniques with their benefits, drawbacks, developments, and future directions. We also give a general guideline about cryptography. We conclude that all methods are helpful for real-timeimage encryption. A comparison has been made between many imageprocessing approaches (conventional and memristor) based on metrics such as the number of pixels change rate (NPCR) with the value of 0.9986, histogram, entropy (7.9993), unified average varying intensity (UACI) achieves the value as 0.4996, energy consumption as 0.32pJ and time complexity as 0.9s. The experimental results of various approaches, key sensitivity and statistical analysis revealed that the survey of memristor-based image encryption schemes provides an effective way to secure image transmission in real-time application.
deeplearning has the potential to enhance speech signals and increase their intelligibility for users of hearing aids. deep models suited for real-world application should feature a low computational complexity and l...
详细信息
deeplearning has the potential to enhance speech signals and increase their intelligibility for users of hearing aids. deep models suited for real-world application should feature a low computational complexity and low processing delay of only a few milliseconds. In this paper, we explore deep speech enhancement that matches these requirements and contrast monaural and binaural processing algorithms in two complex acoustic scenes. Both algorithms are evaluated with objective metrics and in experiments with hearing-impaired listeners performing a speech-in-noise test. Results are compared to two traditional enhancement strategies, i.e., adaptive differential microphone processing and binaural beamforming. While in diffuse noise, all algorithms perform similarly, the binaural deeplearning approach performs best in the presence of spatial interferers. Through a post-analysis, this can be attributed to improvements at low SNRs and to precise spatial filtering.
This study analyzes the phenomenon of electromagnetic (EM) leakage that occurs through cables and explores the potential for information forensics using deeplearning-based image-processing algorithms. We focus on the...
详细信息
This study analyzes the phenomenon of electromagnetic (EM) leakage that occurs through cables and explores the potential for information forensics using deeplearning-based image-processing algorithms. We focus on the transition-minimized differential signaling (TMDS) interface to analyze information leakage caused by the inherent differential signal synchronization errors in video graphics controllers (VGC). Our analysis includes detailed mathematical modeling of the EM leakage phenomena from the video display unit (VDU) interface that uses the TMDS protocol. Furthermore, this study presents mathematical models for distortions and alterations caused by the VDU characteristics and its associated RF front-end system. Utilizing mathematical models of EM phenomena, this paper presents a method for creating training datasets for deeplearning-based signal processing algorithms by generating and augmenting pseudo leakage signals (PLS) that closely resemble actual leakage signals. This study confirms the practical utility of signal enhancement models trained with generated and augmented PLS in real-world scenarios. Validation involves applying the trained model to measured actual VDU leakage signals and evaluating the results using image quality metrics: peak signal-to-noise ratio (PSNR), signal-to-noise ratio (SNR), and the structural similarity index measure (SSIM). Ultimately, this study demonstrates the potential to develop deeplearning models using theoretically generated PLS for VDU-targeted side-channel attacks, where collecting real training data poses a challenge. This suggests the potential for expanding into high-performance deeplearning algorithms in future developments.
The quality of welds is critical to the safety and reliability of steel structure connections, underscoring the importance of accurate inspection during the welding process. To enhance inspection effectiveness, deep l...
详细信息
The quality of welds is critical to the safety and reliability of steel structure connections, underscoring the importance of accurate inspection during the welding process. To enhance inspection effectiveness, deeplearning methods have gained popularity in weld defect detection for their ability to automatically learn and refine image features. However, the complex multi-stage training and inference process of these methods often fails to meet the requirements of real-time performance and accuracy. To address this problem, a framework based on the real-time DEtection TRansformer (RT-DETR) for deeplearning-based welding defect detection is proposed. This framework improves the Transformer backbone by eliminating the most time-consuming non-maximum suppression (NMS) step, achieving real-time detection without sacrificing accuracy. A diverse welding dataset with 1,134 images from real-world manufacturing and construction environments was developed for model training and validation. In addition, three data enhancement algorithms were explored to enhance the model's generalization ability. The model achieved detection accuracy scores of mAP@0.5 at 0.996 and mAP@0.5:0.95 at 0.801, with a detection speed of 67 frames per second (FPS). Compared to the previous Faster R-CNN, SSD, YOLOv5, YOLOv11 and DETR models, the proposed RT-DETR model demonstrates superior efficiency and accuracy. The proposed framework was further validated in the on-site inspections of metal additive manufacturing, and the results confirmed that the RT-DETR-based model meets the stringent requirements for real-time inspection in metal additive manufacturing.
暂无评论