image Quality Assessment (IQA) has got importance in the computer vision applications as it provides tool to evaluate and rate different imageprocessingalgorithms. image Fusion is a process in which information from...
详细信息
image Quality Assessment (IQA) has got importance in the computer vision applications as it provides tool to evaluate and rate different imageprocessingalgorithms. image Fusion is a process in which information from multiple images is combined into a single image. Due to specific nature of fused images present IQA methods have limitations for evaluation of image Fusion algorithms. With the recent development of Deep Convolutional Neural Networks (Deep CNNs), No- reference image quality assessment is becomes reality. This article has proposed the pre-trained Deep CNNs based image fusion classification using Alexnet, vGG19, Inception v3 and ResNet-50. Four states–of–the-art image fusion algorithms used for image fusion are Laplacian Pyramid (LP), Shift Invariant DWT (SIDWT), Discrete Wavelet Transform (DWT) and Ratio Pyramid (RP). To achieve the effective IQA, sufficiently large dataset of synthetically fused images is created and same is evaluated by using Deep CNNs. The results show that recent deep CNN methods correctly classify the fused images into corresponding categories based on its fusion algorithms. The results are consistent with FR-IQA methods. ResNet-50 provides best classification accuracy with less number of epochs and time to converge due to sparse network connections.
Breast cancer continues to be a major health concern worldwide. Early and accurate prediction is crucial for effective treatment and improving survival rates. Computer Aided Diagnosis system serves as an invaluable to...
详细信息
Breast cancer continues to be a major health concern worldwide. Early and accurate prediction is crucial for effective treatment and improving survival rates. Computer Aided Diagnosis system serves as an invaluable tool for radiologists, aiming to reduce diagnostic errors and enhance the accuracy of diagnosis. These systems incorporate various processing techniques, including pre-processing, segmentation, feature extraction, and classification. Moreover, deep learning methods frequently suffer from sub optimal performance and demand substantial computational resources. This study focuses on developing an automated classification model for mammography images to aid in breast cancer diagnosis. Our proposed model initiates with noise removal using median filters, followed by the removal of the pectoral muscle in images through the Canny-edge detection method. On these preprocessed images, we applied data augmentation using a two-point crossover technique, addressing issues of small datasets and class imbalances common in medical image analysis. The images then undergo multi-scale representation via the fourth-order complex diffusion algorithm. Feature extraction is conducted on these multi-scaled images using a Hierarchical variational Auto-encoders and then classified using a Support vector Machine. Employing fourth-order complex diffusion for initial multi-scale representation significantly enhances the accuracy of feature extraction resulting in robust classification performance. The training process involves two different datasets like MIAS and the KAU-BCMD. Test results for the KAU-BCMD dataset include: accuracy of 99.80%, Area Under the Curve of 99.30%, F1-score of 99.20%, balanced accuracy of 99.80%, and Matthews correlation coefficient of 99.20%. For the MIAS dataset, test results show accuracy of 99.30%, Area Under the Curve of 99.10%, F1-score of 98.30%, balanced accuracy of 99.00%, and Matthews correlation coefficient of 99.00%. Our validation results clearl
Nowadays, video analysis applications are gaining popularity given the rise of CCTvsystems and the availability of video cameras to the general public, such as cameras in mobile devices. Many image analysis and proce...
详细信息
Colorectal cancer is the third-most common cancer in the Western Hemisphere. The segmentation of colorectal and colorectal cancer by computed tomography is an urgent problem in medicine. Indeed, a system capable of so...
详细信息
Smart cities are planned to have millions of Internet-connected sensors and devices. Sensors can create a huge amount of data in a range of applications. In modern urban environments, quality of life in a Smart City i...
详细信息
The significance of high-speed machine vision in scientific and technological fields is growing, especially with the era of Industry 4.0 technologies. There are several pattern-matching algorithms that have various in...
详细信息
The significance of high-speed machine vision in scientific and technological fields is growing, especially with the era of Industry 4.0 technologies. There are several pattern-matching algorithms that have various intriguing applications in ultralow-latency machine vision processing. However, the low frame rate of image sensors—which usually operate at tens of hertz—fundamentally limits the processing rate. The paper will conceptualize and develop the computerized pattern recognition technique that can be applied to investigate light beam profiles and extract the desired information according to the purpose required in this case study. In the current work, the automatic detection and inspection of laser spots were designed to perform analysis and alignment for the laser beam in comparison with the electron spot beam using the LabvIEW graphical programming environment, especially when the laser and electron beams overlap. This is one of the important steps for realizing the fundamental aim of test-FEL to produce short wavelengths with the second, third, and fifth harmonics at 131.5, 88, and 53 nm, respectively. The tentative version of the program achieved the elementary purpose, which fulfilled the accurate transversal alignment of the ultrashort laser pulses with the electron beam in the system of the FEL test facility at MAX-Lab, in addition to studying the beam’s stability and jittering range.
Ardabil is well-known for offering the ideal environment for a good, cheap medicinal herb. various plant parts are used as essential components in producing natural medicines. According to IUCN (International Union fo...
详细信息
We show an experimental method of quantifying the effect of light scattering by liquid crystals (LCs) and then apply rather simple imageprocessingalgorithms (Wiener deconvolution and contrast-limited adaptive histog...
详细信息
We show an experimental method of quantifying the effect of light scattering by liquid crystals (LCs) and then apply rather simple imageprocessingalgorithms (Wiener deconvolution and contrast-limited adaptive histogram equalization) to improve the quality of obtained images when using electrically tunable LC lenses (TLCLs). Better contrast and color reproduction have been achieved. We think that this approach will allow the use of thicker LC cells and thus increase the maximum achievable optical power of the TLCL without a noticeable reduction of image quality. This eliminates one of the key limitations for their use in various adaptive imaging applications requiring larger apertures. (C) 2020 Optical Society of America
While generative models such as text-to-image, large language models and text-to-video have seen significant progress, the extension to text-to-virtual-reality remains largely unexplored, due to a deficit in training ...
详细信息
ISBN:
(数字)9798350376876
ISBN:
(纸本)9798350376883
While generative models such as text-to-image, large language models and text-to-video have seen significant progress, the extension to text-to-virtual-reality remains largely unexplored, due to a deficit in training data and the complexity of achieving realistic depth and motion in virtual environments. This paper proposes an approach to coalesce existing generative systems to form a stereoscopic virtual reality video from text. Carried out in three main stages, we start with a base text-to-image model that captures context from an input text. We then employ Stable Diffusion on the rudimentary image produced, to generate frames with enhanced realism and overall quality. These frames are processed with depth estimation algorithms to create left-eye and right-eye views, which are stitched side-by-side to create an immersive viewing experience. Such systems would be highly beneficial in virtual reality production, since filming and scene building often require extensive hours of work and post-production effort. We utilize image evaluation techniques, specifically Fréchet Inception Distance and CLIP Score, to assess the visual quality of frames produced for the video. These quantitative measures establish the proficiency of the proposed method. Our work highlights the exciting possibilities of using natural language-driven graphics in fields like virtual reality simulations.
Deep Neural Network (DNN) belongs to an important class of machine learning algorithms generally used to classify digital data in the form of image and speech recognition. The computational complexity of a DNN-based i...
详细信息
暂无评论