In recent years, convolutional neural networks have significantly advanced image segmentation, particularly for brain images, where important edge features are automatically found. However, accurate segmentation of tu...
详细信息
Nowadays, Face Super-Resolution (FSR) models utilize the fusion approach, which combines the attention technique with the super-resolution network. The fusion approach has been proposed and solves the problem of FSR. ...
Nowadays, Face Super-Resolution (FSR) models utilize the fusion approach, which combines the attention technique with the super-resolution network. The fusion approach has been proposed and solves the problem of FSR. Facial attributes have been effectively used to guide low-level information of the face to perform robust face image restoration. Iterative techniques appraised the value of facial landmarks to boost the reconstruction capability of the super-resolution network. Nevertheless, the network parameters in FSR are high, while the learning rate is still low. This paper proposes an attention mechanism combined with the Face Alignment Network (FAN). The proposed attention mechanism consists of channel attention and a non-local module. Our proposed model outperforms at the scale of $\times 16$ compared to the other state-of-the-art models.
In recent years, convolutional neural networks (CNNs) have achieved remarkable advancement in the field of remote sensing image super-resolution due to the complexity and variability of textures and structures in remo...
详细信息
ISBN:
(数字)9798350381559
ISBN:
(纸本)9798350381566
In recent years, convolutional neural networks (CNNs) have achieved remarkable advancement in the field of remote sensing image super-resolution due to the complexity and variability of textures and structures in remote sensing images (RSIs), which often repeat in the same images but differ across others. Current deep learning-based super-resolution models focus less on high-frequency features, which leads to suboptimal performance in capturing contours, textures, and spatial information. State-of-the-art CNN-based methods now focus on the feature extraction of RSIs using attention mechanisms. However, these methods are still incapable of effectively identifying and utilizing key content attention signals in RSIs. To solve this problem, we proposed an advanced feature extraction module called Channel and Spatial Attention Feature Extraction (CSA-FE) for effectively extracting the features by using the channel and spatial attention incorporated with the standard vision transformer (ViT). The proposed method trained over the UCMerced dataset on scales 2, 3, and 4. The experimental results show that our proposed method helps the model focus on the specific channels and spatial locations containing high-frequency information so that the model can focus on relevant features and suppress irrelevant ones, which enhances the quality of super-resolved images. Our model achieved superior performance compared to various existing models.
In recent years, convolutional neural networks (CNNs) have achieved remarkable advancement in the field of remote sensing image super-resolution due to the complexity and variability of textures and structures in remo...
详细信息
Recent Face Super-resolution (FSR) based on iterative collaboration between facial image recovery network and landmark estimation has succeeded in super-resolving facial images. However, the existing noise in coarse f...
Recent Face Super-resolution (FSR) based on iterative collaboration between facial image recovery network and landmark estimation has succeeded in super-resolving facial images. However, the existing noise in coarse features at the low-level feature extraction leads to inaccurate facial priors such as landmarks and component maps, consequently degrading the super-resolved face image on a large scale. This paper proposes, a Non-local technique for deep attentive face super-resolution network (NLDA). A Non-local module has been designed before the residual channel attention block (RCAB) to eliminate noise degradation on coarse features effectively. The proposed model optimizes feature extraction and improves facial landmark fusion to yield higher-quality super-resolved images. This approach facilitates more accurate landmark estimation and boosts the performance of our model on a large scale and various face poses. Quantitative and qualitative experiments over CelebA and Helen face image datasets show that the proposed method outperforms other state-of-the-art FSR methods in recovering high-quality face images in various face poses and at a large scale.
Golf is widely recognized as one of the most popular sports globally. However, one drawback of playing golf is the relatively high cost of equipment and coaching. While numerous training programs are available to assi...
Golf is widely recognized as one of the most popular sports globally. However, one drawback of playing golf is the relatively high cost of equipment and coaching. While numerous training programs are available to assist players in their practice, there is currently no swing analysis program developed by Thai professionals. In this project, advanced deep learning models were employed: SwingNet, capable of predicting the sequence of eight golf swing events in videos and determining the confidence level of each swing, and MoveN et, designed to identify joint positions on the body and represent them as skeletons. These models were integrated into a customized template-matching algorithm that utilized angle-based measurements to analyze the sequence of golf swings. This analysis assessed the similarity score, represented as a percentage, between two individuals for each golf swing event. Furthermore, various techniques were implemented to enhance the efficiency of SwingN et. Through performance evaluation, it was observed that the efficiency of SwingN et surpassed by one percent compared to the pre-trained model.
This study introduces an advanced methodology to overcome the intrinsic challenges associated with Light Detection and Ranging (LiDAR) technology integrated into Apple's devices, specifically focusing on optimizin...
详细信息
ISBN:
(数字)9798350381764
ISBN:
(纸本)9798350381771
This study introduces an advanced methodology to overcome the intrinsic challenges associated with Light Detection and Ranging (LiDAR) technology integrated into Apple's devices, specifically focusing on optimizing depth map accuracy through multiview angle analysis. Despite the extensive utility of LiDAR in generating detailed three-dimensional models for various applications, from autonomous vehicle navigation to environmental conservation, the efficiency of the technology is often compromised by difficulties in accurately rendering complex geometries from multiple perspectives and by its substantial demands on computational resources. To address these limitations, depth map data are systematically collected and evaluated from four distinct viewing angles (0, 90, 180, and 270 degrees) against established ground truth benchmarks. With a threshold of Intersection over Union (IoU) set at 0.8, it is inferred that the predicted shape exhibits over 80% similarity to the actual shape. When only one view is utilized, an IoU of 0.95 is achieved, alongside a size reduction of 73.07%, a Perimeter Accuracy reduction of 1.56%, and an increase in Root Mean Square Error (RMSE) of 29.37%. This comprehensive analysis highlights the critical need for precise geometric representation and marks a significant step toward enhancing the fidelity of LiDAR measurements. The results of this research indicate a substantial improvement in the accuracy of three-dimensional representations, suggesting a potential shift in the utilization of LiDAR technology across various scientific and infrastructural domains. By proposing a novel framework that optimizes data precision and processing efficiency, this paper contributes to the ongoing discourse on improving 3D modeling and analysis techniques.
Stroke is a significant cause of mortality and disability globally, with its occurrence in the human brain and motor function being linked to various parts of the human body. Stroke victims often experience disabiliti...
详细信息
暂无评论