At present, the biometric template algorithms with state of the art recognition performance cannot guarantee the security of the system, whereas the security biometric template algorithms usually suffer from inaccurat...
At present, the biometric template algorithms with state of the art recognition performance cannot guarantee the security of the system, whereas the security biometric template algorithms usually suffer from inaccurate recognition. In order to improve both the recognition performance and security of feature template for finger vein, a cancelable template algorithm is proposed in this paper. Firstly, a variable curvature Gabor filter is designed for finger vein orientation feature extraction. The maximum and sub-maximum orientation of Gabor filter are combined with differential excitation respectively to generate joint distribution features, which serve as feature vectors of finger vein. Secondly, the extracted feature vectors are reduced dimension by PCA, and the reduced feature vectors are combined with the pseudo random matrix, which is produced by a token to generate a cancelable template to guarantee the security. Finally, cancelable templates in two orientations are fused to increase the irreversibility of templates by using the improved canonical correlation analysis. Extensive experiments on the SDUMLA-FV and PolyU databases demonstrate the superiority of the proposed method in terms of verification performance, diversity, revocability/reusability and irreversibility.
Pilot allocation is one of the effective means to reduce pilot pollution in massive Multiple-Input Multiple-Output (MIMO) systems. The goal of this paper is to improve the uplink achievable sum rates of strong users, ...
详细信息
Pilot allocation is one of the effective means to reduce pilot pollution in massive Multiple-Input Multiple-Output (MIMO) systems. The goal of this paper is to improve the uplink achievable sum rates of strong users, and ensure the quality of service (QoS) requirements of weak users at the same time, so that the sum rates of system can be improved. Combining with the technical advantage of pilot grouping, a low complexity pilot allocation scheme based on matching algorithm is proposed, which divides the users in the target cell into weak user group and strong user group, and adopts the minimum-maximum matching method to allocate pilots in weak user group. Through the introduction of Hungarian algorithm, a pilot allocation method is designed to ensure the fairness of the strong users. The simulation results show that, compared with the smart pilot allocation scheme, the pilot allocation scheme based on Hungarian algorithm, the pilot allocation scheme based on user grouping and the random pilot allocation scheme, the system performance of the proposed scheme has been effectively improved.
Existing Lexical Punctuation Prediction methods are mainly trained on the standard clean data while losing the generalization in practical automatic speech recognition (ASR) system with ubiquitous transcription errors...
详细信息
Existing Lexical Punctuation Prediction methods are mainly trained on the standard clean data while losing the generalization in practical automatic speech recognition (ASR) system with ubiquitous transcription errors. To bridge the gap between clean training data and noisy testing data, we propose three random (3R) data augmentation strategies: random word deletion (RWD), random word substitution (RWS), and random phoneme edition (RPE) in both word and phoneme levels on the training dataset. Specifically, we contribute an acoustically similar vocabulary with phoneme level editions for acoustically similar word substitution. In addition, we first introduce the RoBERTa-large model into a punctuation prediction task to capture the semantics and the long-distance dependencies in language. Extensive experiments on the English dataset IWSLT2011 yield to a new state-of-the-art comparing to the prevalent punctuation prediction methods.
Automatic report generation plays a crucial role in clinical practice by alleviating the heavy workload on doctors and helping to prevent misdiagnoses or missed diagnoses. In the context of radiology report generation...
详细信息
Automatic report generation plays a crucial role in clinical practice by alleviating the heavy workload on doctors and helping to prevent misdiagnoses or missed diagnoses. In the context of radiology report generation, knowledge injection is essential, particularly with the encoder-decoder framework commonly used in image captioning tasks. However, existing studies predominantly rely on expert knowledge, which is both challenging to collect and lacks universality. Additionally, this expert knowledge typically influences single-mode information and overlooks the importance of bridging the visual-to-textual gap. To address these challenges, we propose a hybrid graph-based approach for radiology report generation. Our method integrates two key components: the semantic homogeneous graph (SHG) and the cross-modal heterogeneous graph (CHG). Specifically, the SHG is constructed by mining semantic relationships between keywords across the entire corpus to generate universal knowledge. The CHG, conversely, is built from visual features, textual features, and corresponding knowledge embeddings, enabling knowledge injection during modal interaction. By leveraging graph convolutional networks to enhance graph embeddings, our model improves the quality of generated reports. Experimental results on two widely used benchmark datasets, IU-Xray and MIMIC-CXR, demonstrate the effectiveness of our approach. Notably, our method achieves a BLEU-4 score of 0.185 on the IU-Xray dataset and an F1 score of 0.418 on the MIMIC-CXR dataset, significantly outperforming existing methods.
It is fundamental to detect seismic events reliably and efficiently when processing continuous waveform data recorded by seismic stations. Recently, convolutional neural network (CNN) based detecting methods have been...
详细信息
ISBN:
(数字)9781728169262
ISBN:
(纸本)9781728169279
It is fundamental to detect seismic events reliably and efficiently when processing continuous waveform data recorded by seismic stations. Recently, convolutional neural network (CNN) based detecting methods have been proposed for seismic events detection and obtained great success in this area, where the learning of seismic event detecting network of all seismic stations is considered as one learning task and numerous labeled data need to be collected for training the detecting network. However, they tend to ignore the differences between seismic stations caused by geographic position. Moreover, due to a few seismic activities and high cost of manual data labeling, in some areas, the labeled data for seismic event detecting tasks is insufficient. Under this condition, these methods always encounter over-fitting problem leading to bad detection performance. In this paper, we propose a multi-task based framework based on convolutional neural network for accurate seismic event detection under the condition of insufficient labeled data. Specifically, we first cluster the seismic stations into several station clusters and treat the learning of seismic event detecting network of every station cluster as a learning task, and then we propose a deep multi-task network named detectMTIA among multiple tasks. Experimental results on a real-world seismic dataset with nine stations demonstrate the effectiveness of the proposed framework, especially when the labeled data is insufficient.
Log-structured merge tree (i.e., LSM-tree) based key-value stores, which are widely used in big-data applications, provide high performance. NAND Flash-based Solid state disks (i.e., SSDs) become the popular devices t...
详细信息
Double Toeplitz (DT) codes are codes with a generator matrix of the form (I, T) with T a Toeplitz matrix, that is to say constant on the diagonals parallel to the main. When T is tridiagonal and symmetric we determine...
详细信息
Reversible data hiding in encrypted images (RDHEI) receives growing attention because it protects the content of the original image while the embedded data can be accurately extracted and the original image can be rec...
详细信息
暂无评论