image fusion is a technique used to merge two or more source images into a single image that incorporates more details than the originals and still offering an accurate depiction about the captured information. Result...
详细信息
image fusion is a technique used to merge two or more source images into a single image that incorporates more details than the originals and still offering an accurate depiction about the captured information. Resultant fused images are more accurate and provide comprehensive information for both the human and machinevision perception for further processing of the image. image fusion provides better performance in the areas like pattern recognition, imageprocessing, computer vision, machine learning and artificial intelligence. In the recent years image fusion has moved out of the laboratories and used in the real time applications. This paper provides the insight of various techniques for image fusion like primitive fusion (Simple averaging, Maxima and Minima, etc.), Discrete Wavelet Transform (DWT) based fusion, Principal Component Analysis (PCA) based fusion, Curvelet transform based fusion etc. On-going through various literatures, it is found that image fusion in spatial domain provides high resolution images, although the fusion algorithms are dependent on the nature of image and also depends on the application for which the image is to be fused. Hence, spectral domain fusion and hybrid fusion techniques are introduced and it is proven to be better than the spatial domain fusion. Comparison of all the techniques along with recent approaches are done to find the best approach towards future research to provide new direction to the researchers in medical sector.
Change detection in multi-temporal remote sensing data enables crucial urban analysis and environmental monitoring applications. However, complex factors like illumination variance and occlusion make robust automated ...
详细信息
ISBN:
(纸本)9798350350494;9798350350500
Change detection in multi-temporal remote sensing data enables crucial urban analysis and environmental monitoring applications. However, complex factors like illumination variance and occlusion make robust automated change interpretation challenging. We propose MaskChanger - a novel deep learning paradigm tailored for satellite image change detection. Our method adapts the segmentation-specialized Mask2Former architecture by incorporating Siamese networks to extract features separately from bi-temporal images, while retaining the original mask transformer decoder. To our knowledge, this is the first study in which change detection is converted from the existing per-pixel classification approach into a mask classification approach. Evaluated on the LEVIR-CD benchmark of over 600 very high-resolution image pairs exhibiting real-world rural and urban changes, MaskChanger achieves F1-Score of 91.96%, outperforming prior transformer-based change detection approaches.
Artificial vision systems will be essential in intelligent machine-visionapplications such as autonomous vehicles, bionic eyes, and humanoid robot eyes. However, conventional digital electronics in these systems face...
详细信息
Artificial vision systems will be essential in intelligent machine-visionapplications such as autonomous vehicles, bionic eyes, and humanoid robot eyes. However, conventional digital electronics in these systems face limitations in system complexity, processing speed, and energy consumption. These challenges have been addressed by biomimetic approaches utilizing optoelectronic synapses inspired by the biological synapses in the eye. Nano- materials can confine photogenerated charge carriers within nano-sized regions, and thus offer significant potential for optoelectronic synapses to perform in-sensor image-processing tasks, such as classifying static multicolor images and detecting dynamic object movements. We introduce recent developments in optoelectronic synapses, focusing on use of photosensitive nanomaterials. We also explore applications of these synapses in recognizing static and dynamic optical information. Finally, we suggest future directions for research on optoelectronic synapses to implement neuromorphic artificial vision.
The mitigation of material defects from additive manufacturing (AM) processes is critical to reliability in their fabricated parts and is enabled by modeling the complex relations between available build monitoring si...
详细信息
The mitigation of material defects from additive manufacturing (AM) processes is critical to reliability in their fabricated parts and is enabled by modeling the complex relations between available build monitoring signals and final mechanical performance. To this end, the present study investigates a machine learning approach for predicting mechanical properties for Ti-6Al-4V fabricated through laser powder bed fusion (PBF-LB) AM using in situ photodiode processing signals. Samples were fabricated under different processing parameters, varying laser powers and scan speeds for the purpose of probing a wide range of microstructure and property variations. Photodiode data were collected during fabrication, later to be arranged in image format and extracted to information-dense vectors by the transferal of deep convolutional neural network (DCNN) structures and weights pretrained on a large computer vision benchmark image database. The extracted features were then used to train and test a newly designed regression model for mechanical properties. Average cross-validation accuracies were found to be 98.7% (r(2) value of 0.89) for the prediction of ultimate tensile strength, which ranged from 900 to 1150 MPa in the samples studied, and 93.1% (r(2 )value of 0.96) for the prediction of elongation to fracture, which ranged from 0 to 17%. Thus, with high accuracy and hardware-accelerated inference speeds, we demonstrate that a transfer learning framework can be used to predict strength and ductility of metal AM components based on processing signals in PBF-LB, illustrating a potential route toward real-time closed-loop control and process optimization of PBF-LB in industrial applications.
Visual content is increasingly being processed by machines for various automated content analysis tasks instead of being consumed by humans. Despite the existence of several compression methods tailored for machine ta...
详细信息
Visual content is increasingly being processed by machines for various automated content analysis tasks instead of being consumed by humans. Despite the existence of several compression methods tailored for machine tasks, few consider real-world scenarios with multiple tasks. In this paper, we aim to address this gap by proposing a task-switchable pre-processor that optimizes input images specifically for machine consumption prior to encoding by an off-the-shelf codec designed for human consumption. The proposed task-switchable pre-processor adeptly maintains relevant semantic information based on the specific characteristics of different downstream tasks, while effectively suppressing irrelevant information to reduce bitrate. To enhance the processing of semantic information for diverse tasks, we leverage pre-extracted semantic features to modulate the pixel-to-pixel mapping within the pre-processor. By switching between different modulations, multiple tasks can be seamlessly incorporated into the system. Extensive experiments demonstrate the practicality and simplicity of our approach. It significantly reduces the number of parameters required for handling multiple tasks while still delivering impressive performance. Our method showcases the potential to achieve efficient and effective compression for machinevision tasks, supporting the evolving demands of real-world applications.
The interactivity of tourism product design improves the user experience and promotes tourism development. However, it faces challenges in technology realization, user experience, data processing, differentiated desig...
详细信息
In the machinevision-based online monitoring of the flotation process, froth images acquired in real-time are subject to color distortion and excessive bright spots caused by inconsistent illumination, which hinders ...
详细信息
In the machinevision-based online monitoring of the flotation process, froth images acquired in real-time are subject to color distortion and excessive bright spots caused by inconsistent illumination, which hinders the effectiveness of image analysis and further online measurement for operating performance indicators. Current imageprocessing methods struggle to correct color distortion and remove excess bright spots in froth images simultaneously. Therefore, in this article, an illumination domain signal-guided unsupervised generative adversarial network (IDS-GUGAN) is proposed for illumination consistency processing of flotation froth images. First, considering the varying effects of inconsistent illumination on froth images, the illumination domain signal-guided image generation (IDS-GIG) mechanism based on the theory of unsupervised disentangled representation learning is designed to achieve adaptive correction of froth images with varying degrees of distortion. Moreover, a novel lightweight double-closed-loop network architecture is introduced to support unsupervised learning utilizing unpaired froth images and improve computational efficiency, which makes the proposed approach highly suitable for industrial applications. Comprehensive experiments on a real tungsten cleaner flotation process dataset and two public benchmark datasets related to image illumination processing tasks consistently endorse the superiority of IDS-GUGAN.
In the field of art and design, different artistic styles endow works with unique charm and expressive power. Computer-aided design (CAD) model processing in art and design refers to the stage of using computer techno...
详细信息
Photoadaptive synaptic devices enable in-sensor processing of complex illumination scenes, while second-order adaptive synaptic plasticity improves learning efficiency by modifying the learning rate in a given environ...
详细信息
Photoadaptive synaptic devices enable in-sensor processing of complex illumination scenes, while second-order adaptive synaptic plasticity improves learning efficiency by modifying the learning rate in a given environment. The integration of above adaptations in one phototransistor device will provide opportunities for developing high-efficient machinevision system. Here, a dually adaptable organic heterojunction transistor as a working unit in the system, which facilitates precise contrast enhancement and improves convergence rate under harsh lighting conditions, is reported. The photoadaptive threshold sliding originates from the bidirectional photoconductivity caused by the light intensity-dependent photogating effect. Metaplasticity is successfully implemented owing to the combination of ambipolar behavior and charge trapping effect. By utilizing the transistor array in a machinevision system, the details and edges can be highlighted in the 0.4% low-contrast images, and a high recognition accuracy of 93.8% with a significantly promoted convergence rate by about 5 times are also achieved. These results open a strategy to fully implement metaplasticity in optoelectronic devices and suggest their visionprocessingapplications in complex lighting scenes. Organic heterojunction transistors are designed to integrate light intensity-adaptive threshold sliding and second-order adaptive metaplasticity. The unique dual adaptability enables the highlighting of 0.4% low-contrast images, and the efficient recognition can be achieved benefiting from the learning rate changes in the backpropagation process. image
This study presents a novel approach for pH estimation in buffer solutions using images of solutions prepared with Hibiscus sabdariffa L. as a natural pH indicator. The images of the solutions, each displaying distinc...
详细信息
This study presents a novel approach for pH estimation in buffer solutions using images of solutions prepared with Hibiscus sabdariffa L. as a natural pH indicator. The images of the solutions, each displaying distinctive colours indicative of their pH levels, were transformed into standardized 200x200-pixel images through the application of imageprocessing techniques. Following this, a pH prediction model was constructed using the Adaptive Boosting regressor algorithm. The pH values of the training data used when training the model were distributed irregularly between 0-14. The models were trained with 94 pictures and 1880 experimental values. In addition, a reliable pre-processing part has been placed into the model using imageprocessing techniques, allowing test data to be obtained in any desired environment. The obtained training and test data were separated from noise parameters, affecting the prediction results negatively. A smartphone application based on the model has been developed and made available to everyone. This innovative methodology bridges the gap between traditional pH measurement techniques and computer vision, offering amore accessible and eco-friendly means of pH assessment. The practical applications of this research extend to various fields, including environmental monitoring, agriculture, and educational settings.
暂无评论