Converting whisper to normal vocalized speech has been a hot research topic in speech signalprocessing area. A complete and large scale whisper database is a major basis for this task. In this paper, we propose a mul...
详细信息
作者:
Xiao, YunZhang, ChunleiJiang, BoChen, YuanTang, JinAnhui University
Key Laboratory of Intelligent Computing & Signal Processing (Anhui University) Anhui Provincial Key Laboratory of Security Artificial Intelligence School of Artificial Intelligence Hefei 230601 China Anhui University
Information Materials and Intelligent Sensing Laboratory of Anhui Province Anhui Provincial Key Laboratory of Multimodal Cognitive Computation School of Computer Science and Technology Hefei 230601 China Anhui University
School of Internet Hefei 230601 China
Multi-modal remote sensing images registration ensures that images from different sensors or modalities are spatial and informational consistent for effective comparison and analysis. However, due to the non-linear mo...
详细信息
In order to improve the uplink channel estimation performance and reduce pilot contamination(PC),this paper proposes a pilot allocation method of jointing cell grouping and alliance game(JCG-AG) for massive multiple-i...
详细信息
In order to improve the uplink channel estimation performance and reduce pilot contamination(PC),this paper proposes a pilot allocation method of jointing cell grouping and alliance game(JCG-AG) for massive multiple-input multiple-output(MIMO) cellular *** can be divided into two ***,by properly dividing all cells into different cell-groups,the users in different cell-groups are allocated different orthogonal pilot ***,the users in the same cell-group are further divided into several mutually disjoint *** belonging to different sub-alliances are allocated different pilots,and users belonging to the same sub-alliance reuse the same *** the second step,the alliance formation algorithm is used to suppress *** JCG-AG method takes practical application as the starting point,focuses on the random distribution of users,and uses the idea of alliance *** these make it more flexible and *** demonstrated in the simulation results,the JCG-AG method can greatly mitigate the PC and reduce the average mean square error(MSE) for all channel estimations in the uplink.
In this paper,we propose a new user grouping and power allocation scheme based on beamforming for downlink non-orthogonal multiple access *** proposed user grouping scheme can effectively reduce the interference from ...
详细信息
In this paper,we propose a new user grouping and power allocation scheme based on beamforming for downlink non-orthogonal multiple access *** proposed user grouping scheme can effectively reduce the interference from the other user and other beams as well,and can effectively improve the weak user rate especially when the SNR is not *** addition,a power allocation scheme that can maximize the sum capacity while satisfying a certain fairness index is *** the simulation results,the user grouping and power allocation method proposed in this paper can not only improve the overall system throughput performance,but also improve the fairness of weak users.
Automated neural network design has received ever-increasing attention with the evolution of deep convolutional neural networks (CNNs), especially involving their deployment on embedded and mobile platforms. One of th...
详细信息
Automated neural network design has received ever-increasing attention with the evolution of deep convolutional neural networks (CNNs), especially involving their deployment on embedded and mobile platforms. One of the biggest problems that neural architecture search (NAS) confronts is that a large number of candidate neural architectures are required to train, using, for instance, reinforcement learning and evolutionary optimisation algorithms, at a vast computation cost. Even recent differentiable neural architecture search (DNAS) samples a small number of candidate neural architectures based on the probability distribution of learned architecture parameters to select the final neural architecture. To address this computational complexity issue, we introduce a novel architecture parameterisation based on scaled sigmoid function, and propose a general Differentiable Neural Architecture Learning (DNAL) method to optimize the neural architecture without the need to evaluate candidate neural networks. Specifically, for stochastic supernets as well as conventional CNNs, we build a new channel-wise module layer with the architecture components controlled by a scaled sigmoid function. We train these neural network models from scratch. The network optimization is decoupled into the weight optimization and the architecture optimization, which avoids the interaction between the two types of parameters and alleviates the vanishing gradient problem. We address the non-convex optimization problem of neural architecture by the continuous scaled sigmoid method with convergence guarantees. Extensive experiments demonstrate our DNAL method delivers superior performance in terms of neural architecture search cost, and adapts to conventional CNNs (e.g., VGG16 and ResNet50), lightweight CNNs (e.g., MobileNetV2) and stochastic supernets (e.g., ProxylessNAS). The optimal networks learned by DNAL surpass those produced by the state-of-the-art methods on the benchmark CIFAR-10 and ImageN
Distributed compressed sensing theory is applied to many practical problems,ECG signal,color imaging,*** order to improve the reconstruction accuracy of multi-dimensional signals,this paper applies singular value deco...
详细信息
Distributed compressed sensing theory is applied to many practical problems,ECG signal,color imaging,*** order to improve the reconstruction accuracy of multi-dimensional signals,this paper applies singular value decomposition to the multi-measure vector problem in DCS,then distributed compressed sensing reconstruction method based on singular value decomposition is *** method can achieve row orthogonality of the measurement matrix and does not affect the design of the reconstruction *** experiments verify the effectiveness of the proposed method,which can significantly improve the reconstruction quality of the signal and the robustness to noise.
In the scenario of time division duplexing(TDD) massive multiple-input multiple-output(MIMO) system,when there is pilot contamination(PC) in the uplink,different pilot allocation schemes will affect the results of the...
详细信息
In the scenario of time division duplexing(TDD) massive multiple-input multiple-output(MIMO) system,when there is pilot contamination(PC) in the uplink,different pilot allocation schemes will affect the results of the uplink channel *** the channel estimation results are used for downlink transmission precoding,which will further affects the downlink *** paper analyses the impact of different pilot allocation(PA) schemes on downlink *** theoretical analysis,the system downlink signal to interference plus noise ratio(SINR) and spectral efficiency expressions of different pilot allocation schemes are obtained *** simulation results show that the pilot allocation schemes with cell grouping is better than the schemes without cell *** SINR increases as the number of cell grouping increases,and the spectrum efficiency increases first and then decreases as the number of cell grouping increases.
The noncontact detection methods of blood volume pulse (BVP) based on facial videos have become a hot spot in recent years. However, these kinds of methods are highly sensitive to face movement. To address this proble...
The noncontact detection methods of blood volume pulse (BVP) based on facial videos have become a hot spot in recent years. However, these kinds of methods are highly sensitive to face movement. To address this problem, a novel BVP detection method based on kanade-lucas-tomasi (KLT) and independent component analysis (ICA) was proposed, in which, KLT method was employed for tracking and locating the region of interest (ROI) in facial images, and ICA method was used to improve the signal to noise ratio (SNR) of BVP signal. Based on 120 recorded facial videos, we carried out the comparative study of the proposed method and other commonly-used methods, the experimental results show that our method has obvious advantages in eliminating motion interference. In the implementation of the blind source separation method, we also conducted four separation methods. The results show that the second order blind identification (SOBI) algorithm is the best one to separate the BVP signal.
The multilevel characteristic basis function method(MLCBFM)with the adaptive cross approximation(ACA)algorithm for accelerated solution of electrically large scattering problems is studied in this *** the conventional...
详细信息
The multilevel characteristic basis function method(MLCBFM)with the adaptive cross approximation(ACA)algorithm for accelerated solution of electrically large scattering problems is studied in this *** the conventional MLCBFM based on Foldy-Lax multiple scattering equations,the improvement is only made in the generation of characteristic basis functions(CBFs).However,it does not provide a change in impedance matrix filling and reducing matrix calculation procedure,which is *** reality,all the impedance and reduced matrix of each level of the MLCBFM have low-rank property and can be calculated ***,ACA is used for the efficient generation of two-level CBFs and the fast calculation of reduced matrix in this *** results are given to demonstrate the accuracy and efficiency of the method.
In this paper, we propose a method of enhancing whisper, using whisper without any pretreatment combined with Wavenet. Our method is end-to-end, that is, inputing noised whisper to get clean whisper. The input to our ...
In this paper, we propose a method of enhancing whisper, using whisper without any pretreatment combined with Wavenet. Our method is end-to-end, that is, inputing noised whisper to get clean whisper. The input to our method is the original whisper without any processing, reducing the loss of features caused by other operations. We use speech denoising Wavenet to enhance whisper. Wavenet can not only enhance whisper well, but also tackle the issue of intelligibility. Specifically, use symmetric dilated convolution to obtain noisy speech context, help the model to enhance the speech for better denoising effect. Experimental results show that the enchanced whisper gains better performance both in the aspect of speech quality and intelligibility.
暂无评论