Fashion is the way we present ourselves which mainly focuses on vision, has attracted great interest from computer vision researchers. It is generally used to search fashion products in online shopping malls to know t...
详细信息
Reference-based image super-resolution methods, which enhance the restoration of a low-resolution (LR) images by introducing an additional high-resolution (HR) reference image, have made rapid and remarkable progress ...
详细信息
ISBN:
(数字)9783031251986
ISBN:
(纸本)9783031251979;9783031251986
Reference-based image super-resolution methods, which enhance the restoration of a low-resolution (LR) images by introducing an additional high-resolution (HR) reference image, have made rapid and remarkable progress in the field of image super-resolution in recent years. Most of the existing methods use an implicit correspondence matching approach to transfer HR features from the reference image (Ref) to the LR image. However, these methods lack the further judgment and processing of the HR features from Ref, which limits them in challenging cases. In this paper, We propose an image super-resolution method based on mixed attention and feature transfer (MAFT). First, we obtain the deep features of the LR and Ref images through the encoder network, then extract the transferred features from Ref through the attention network, and perform adaptive optimization processing on the features, and finally fuse the transferred features with LR features to achieve a high-quality image reconstruction. The quantitative and qualitative experiments on these benchmarks, i.e., CUFED5, Urban100 and Manga109, show that MAFT outperforms the state-of-the-art baselines with significant improvements.
The importance of speech emotion recognition has increased as a result of the acceptance of intelligent conversational assistant services. The communication between humans and machines may be made better via emotion r...
详细信息
As a kind of noise, speckle seriously affects the imaging quality of optical imaging system. However, the speckle image carries a large amount of information related to the physical characteristics of the object surfa...
详细信息
Facial expression recognition is a popular and challenging area of research in machine learning applications. Facial expressions are critical to human communication and allow us to convey complex thoughts and emotions...
详细信息
Facial expression recognition is a popular and challenging area of research in machine learning applications. Facial expressions are critical to human communication and allow us to convey complex thoughts and emotions beyond spoken language. The complexity of facial expressions creates a difficult problem for computer vision systems, especially edge computing systems. Current Deep Learning (DL) methods rely on large-scale Convolutional Neural Networks (CNN) which require millions of floating point operations (FLOPS) to accomplish similar image classification tasks. However, on edge and IoT devices, large-scale convolutional models can cause problems due to memory and power limitations. The intent of this work is to propose a neural network architecture inspired by deep CNNs which is tuned for deployment on edge devices and small-form-factor edge AI accelerators. This will be carried out by strategically reducing the size of the network while still achieving good discrimination between classes. Additionally, performance metrics such as latency, accuracy, throughput, and power consumption will be captured and compared with several popular deep CNN models. It is expected that there will be trade-offs between network size and performance when the model is deployed and running model inference on edge AI accelerators such as the Intel Movidius Neural Compute Stick ii and the NVIDIA Jetson Nano GPU accelerator. An additional benefit of smaller-scale convolutional models is that they are better suited to be converted into spiking neural networks and deployed on neuromorphic hardware such as the Intel Loihi neuromorphic chip. Furthermore, this work will also examine various imageprocessing techniques across multiple datasets in an effort to increase the performance of the edge-efficient model.
image segmentation is a central topic in imageprocessing and computer vision and a key issue in many applications, e.g., in medical imaging, microscopy, document analysis and remote sensing. According to the human pe...
详细信息
Signal processing has become central to many fields, from coherent optical telecommunications, where it is used to compensate signal impairments, to video imageprocessing. imageprocessing is particularly important f...
详细信息
Signal processing has become central to many fields, from coherent optical telecommunications, where it is used to compensate signal impairments, to video imageprocessing. imageprocessing is particularly important for observational astronomy, medical diagnosis, autonomous driving, big data and artificial intelligence. For these applications, signal processing traditionally has mainly been performed electronically. However these, as well as new applications, particularly those involving real time video imageprocessing, are creating unprecedented demand for ultrahigh performance, including high bandwidth and reduced energy consumption. Here, we demonstrate a photonic signal processor operating at 17 Terabits/s and use it to process video image signals in real-time. The system processes 400,000 video signals concurrently, performing 34 functions simultaneously that are key to object edge detection, edge enhancement and motion blur. As compared with spatial-light devices used for imageprocessing, our system is not only ultra-high speed but highly reconfigurable and programable, able to perform many different functions without any change to the physical hardware. Our approach is based on an integrated Kerr soliton crystal microcomb, and opens up new avenues for ultrafast robotic vision and machine learning.
In the image super-resolution algorithm model, a large receptive field can provide more valuable features, so the Transformer with strong information interaction ability has achieved excellent results in image super-r...
详细信息
The article proposes a fusion technique and an algorithm for combining images recorded in the IR and visible spectrum in relation to the problem of processing products by robotic complexes in dust and fog. Primary dat...
详细信息
ISBN:
(纸本)9781510655461
The article proposes a fusion technique and an algorithm for combining images recorded in the IR and visible spectrum in relation to the problem of processing products by robotic complexes in dust and fog. Primary data processing is based on the use of a multi-criteria processing with complex data analysis and cross-change of the filtration coefficient for different types of data. The search for base points is based on the application of the technique of reducing the range of clusters (image simplification) and searching for transition boundaries using the approach of determining the slope of the function in local areas. As test data used to evaluate the effectiveness, pairs of test images obtained by sensors with a resolution of 1024x768 (8 bit, color image, visible range) and 640x480 (8 bit, color, IR image) are used. images of simple shapes are used as analyzed objects.
Neural volume rendering methods, especially NeRF, have demonstrated remarkable performance in novel view synthesis. However, NeRF relies solely on image data and lacks explicit geometric information, necessitating a l...
详细信息
ISBN:
(纸本)9789819607730;9789819607747
Neural volume rendering methods, especially NeRF, have demonstrated remarkable performance in novel view synthesis. However, NeRF relies solely on image data and lacks explicit geometric information, necessitating a large number of posed images and a computationally intensive ray sampling strategy to learn accurate scene representations. This poses challenges and may result in incomplete or locally optimal scene geometry when views are sparse or incomplete, as the limited views may not provide sufficient constraints to determine a unique geometry solution for complex scenes. Meanwhile, sparse point clouds provide an attractive source of scene information, especially for geometry, to complement images in neural scene representations, particularly when input views are sparse. To overcome these limitations, we propose (SNeRF)-Ne-2, a novel Neural Radiance Field that simultaneously incorporates features from both point clouds and images for volume rendering. Specifically, (SNeRF)-Ne-2 extracts patch-wise point features from point clouds and raywise image features from adjacent views. Then the scene feature volume is constructed by implicitly fusing these point and image features through self-attention. Finally, the volume feature is utilized to render novel views of the scene. Experimental results on the challenging TartanAir dataset demonstrate that, thanks to the integration of feature volume from point clouds and images, (SNeRF)-Ne-2 achieves state-of-the-art performance in novel view synthesis.
暂无评论