Brain MRI synthesis technology addresses the challenge of missing MRI modalities in the clinical domain. We strongly emphasize harnessing the full potential of multi-modal MRI data and the spatial correlations within ...
详细信息
ISBN:
(数字)9798350387780
ISBN:
(纸本)9798350387797
Brain MRI synthesis technology addresses the challenge of missing MRI modalities in the clinical domain. We strongly emphasize harnessing the full potential of multi-modal MRI data and the spatial correlations within brain structures, so we proposed a method for synthesizing brain MRI images. This method distills latent information from available MRI modalities, providing guidance for synthesizing the missing MRI modality, thereby transcending the constraints of spatial structural relevance within the task of 3D brain MRI image synthesizing. Our experiments demonstrate that our method can generate high-quality 3D brain MRI images.
Face super resolution can greatly improve the performance of various facial analysis applications, such as face recognition and facial expression analysis. This paper introduces a facial super-resolution network model...
详细信息
ISBN:
(数字)9798331506056
ISBN:
(纸本)9798331506063
Face super resolution can greatly improve the performance of various facial analysis applications, such as face recognition and facial expression analysis. This paper introduces a facial super-resolution network model that utilizes residual estimation for enhancing image quality. The model engages a two-stage reconstruction process to generate super resolution face images, which effectively simplifies the learning complexities of the network. Moreover, the model incorporates prior facial information into the loss function to mitigate the influence of the background on the facial region within the image. This incorporation facilitates a more accurate reconstruction of super resolution facial images. The robustness and effectiveness of this facial super-resolution network based on residual estimation are evaluated through both quantitative and qualitative assessments, employing some classical facial image datasets.
In the past two years, the large language model has set off a new wave of research in the field of natural language processing, showing the ability of general-purpose artificial intelligence, which has been widel...
详细信息
Micro-expression (ME) is a subtle change in the face, which can be used to judge human subjective feelings. It has broad application prospects in medical diagnosis and business negotiation. However, due to the complex...
Micro-expression (ME) is a subtle change in the face, which can be used to judge human subjective feelings. It has broad application prospects in medical diagnosis and business negotiation. However, due to the complexity of ME muscle movements and the lack of ME trainable data, the research of micro-expression recognition (MER) still faces a series of challenges. In this paper, we propose a new divided block multiscale convolution network (DBMNet), which could learn from four different optical flow (OF) feature images obtained between the onset and apex frames of ME samples. Through the proposed block-divided multiscale convolution module (BMCM), more detailed and useful multiscale advanced features of ME could be effectively extracted. In order to better address the problem of class imbalance on the ME dataset, this paper uses the weighted cross entropy (CE) loss function, which could obviously alleviate the impact of class imbalance. Finally, 5-class experiment is conducted on the composite dataset to show that the proposed method has superior performance and is comparable to those of the most advanced methods.
Electronic payments have become the primary mode of payment today, and bank card recognition is widely used in industries such as mobile, mobile banking, and third-party payments. To address issues such as low fault t...
详细信息
With the popularity of depth sensors, research on RGB-D salient object detection (SOD) is also thriving. However, given the limitations of the external environment and the sensor itself, depth information is often les...
With the popularity of depth sensors, research on RGB-D salient object detection (SOD) is also thriving. However, given the limitations of the external environment and the sensor itself, depth information is often less credible. To meet this challenge, existing models often purify the depth information using complex convolution and pooling operations. This causes a large amount of useful information besides noise to be dropped as well, and multi-modality interaction chances between RGB and depth become less. Also, with the gradual loss of information, the hidden relationship of features between multi-level is thus ignored. To tackle the aforementioned problems, we propose a Hierarchical Transformer U-Shape Network (HierNet) that include three aspects: 1) With a simple structure, a depth calibration module provides faithful depth information with minimal loss of information, providing conditions for cross-modality cross-layer information interaction; 2) With multi-head attention, a set of global view-based transformer encoders are employed to find the potential coherence between RGB and depth modalities. With weight sharing, several transformer encoder sets comprise the hierarchical transformer embedding module to search long-range dependencies cross-level; 3) Considering the complementary features of U-shape network, we use dual-stream U-shape network as our backbone. Extensive fair experiments on four challenging datasets have demonstrated the outstanding performance of the proposed model compared to state-of-the-art models.
Object localization is utilized as the first step in standard 6D object pose estimation methods to obtain the position information of the objects. However, these object localization methods cannot be directly applied ...
Object localization is utilized as the first step in standard 6D object pose estimation methods to obtain the position information of the objects. However, these object localization methods cannot be directly applied to unseen objects, which is the focus of recent research on 6D object pose estimation. In this paper, an accurate and efficient localization method for unseen object is proposed, based on a template matching strategy. The Hybrid Channel-Spatial Attention Model (HCSAM) is designed to focus on the target object by enhancing the contextual differences between the target object and background. Additionally, The Multi-Scale Integration Transformer (MSIT) module is designed to eliminate noise interference and enhance semantic information in low-dimensional features by integrating multidimensional information. Our method outperforms existing approaches on the complicated occluded dataset LINEMOD, as well as on the challenging generalized pose estimation dataset GenMOP.
Drug screening is an extremely costly and time-consuming process, wherein only small datasets are available in practice. We presented a particular method to estimate values of inhibition constant(Ki) or half-maximal i...
Drug screening is an extremely costly and time-consuming process, wherein only small datasets are available in practice. We presented a particular method to estimate values of inhibition constant(Ki) or half-maximal inhibition concentration(IC50) of unknown compounds through a lightweight mutual information and logistic regression(MI-LR) united model that only needed to be trained on a small dataset. Biologists could then use this model to determine whether the compounds were initially eligible for screening, increasing efficiency of their work. A data augmentation strategy was used to sort independent samples of training datasets and solved the problem of sample shortage caused by the lightweight model, and transform a prediction task into a simpler binary classification task. In addition, we proposed an effective constraint mechanism to deal with the case when the classification results were contrary to the facts. By accurately predicting the interval of its inhibitory effect, we can improve the efficiency and accuracy of drug screening. Numerous evaluations on the Ki and IC50 dataset demonstrated high reliability of the MI-LR united approach to sort compounds according to a selected set of molecular descriptors.
3D single object tracking plays an important role in computer vision and autonomous driving. The mainstream methods mainly rely on point clouds to achieve geometry matching between target template and search area. How...
详细信息
Electroencephalography (EEG), as a non-invasive and convenient method for implementing Brain-Computer Interface (BCI), has been widely used in clinical and research fields. EEG data often requires the acquisition of d...
Electroencephalography (EEG), as a non-invasive and convenient method for implementing Brain-Computer Interface (BCI), has been widely used in clinical and research fields. EEG data often requires the acquisition of dozens or even hundreds of channels. Channel selection can reduce irrelevant and redundant channels, improve computational efficiency, and enhance the quality of EEG signals. This study introduces a filter method for channel selection based on Pearson correlation coefficient (PCC) with the candidate channel and employs topographic maps of EEG channel scores, derived from data collected across all subjects, to visualize the spatial distribution of channels selected by different methods. In addition, a generalized channel selection algorithm is proposed to determine consistent channels across all subjects in the experimental group. The effectiveness of the proposed method was evaluated on two steady-state visual evoked potential (SSVEP) datasets, and the results indicated that this method exhibits superior performance compared to both the all-channel method and other channel selection methods. And the application of the generalized channel algorithm has further improved the classification performance. This study uses selected generalized channels applied to new subjects with low BCI performance, yielding a significant improvement. The selected channels have a wide range of applicability, helping to simplify EEG acquisition and improve EEG data quality.
暂无评论