In this paper, we propose a novel loss by integrating a deep clustering (DC) loss at the frame-level and a speaker recognition loss at the segment-level into a single network without additional data requirements and e...
详细信息
ISBN:
(数字)9798350390155
ISBN:
(纸本)9798350390162
In this paper, we propose a novel loss by integrating a deep clustering (DC) loss at the frame-level and a speaker recognition loss at the segment-level into a single network without additional data requirements and exhaustive computation. The DC loss implicitly generates soft pseudo-phoneme labels for each frame-level feature, which facilitates extracting more discriminant speaker representation by suppressing phonetic content information. We study the DC loss not only on the acoustic feature, but also on the features extracted by the pre-trained models, such as wav2vec 2.0, HuBERT and WavLM. Experimental results on the VoxCeleb dataset shows that the overall system performance based on the pre-trained model features are better than the one on the acoustic feature. The proposed loss is significantly effective for systems on the acoustic feature and has a marginal improvement for systems on the pre-trained model feature.
Electromagnetic Inverse Scattering Problems (EISP) have gained wide applications in computational imaging. By solving EISP, the internal relative permittivity of the scatterer can be non-invasively determined based on...
详细信息
Three-dimensional tissue cytometry is an important technique for quantitative analysis of cell structures in large fluorescence microscopy volumes. Accurate nuclei detection and segmentation is an important step for 3...
Three-dimensional tissue cytometry is an important technique for quantitative analysis of cell structures in large fluorescence microscopy volumes. Accurate nuclei detection and segmentation is an important step for 3D tissue cytometry. Deep learning methods have shown promising results for nuclei detection and segmentation. However, manually annotating ground truth for training deep learning methods is labor-intensive and not practical for large 3D volumes. In this paper, we propose a 3D nuclei synthesis method, known as 3DSpCycleGAN, for generating 3D ground truth volumes along with corresponding synthetic microscopy volumes. Experimental results using fluorescence microscopy volumes demonstrate that our method generates more realistic 3D volumes when evaluated both visually and quantitatively than previously reported. We also show that using the synthetic volumes generated by 3DSpCycleGAN as training data improves segmentation accuracy for deep learning segmentation techniques.
Multiple Instance Learning (MIL) has been widely applied to medical imaging diagnosis, where bag labels are known and instance labels inside bags are unknown. Traditional MIL assumes that instances in each bag are ind...
详细信息
Association rule mining is intended for searching for the relationships between attributes in transaction databases. The whole process of rule discovery is very complex, and involves pre-processing techniques, a rule ...
详细信息
This paper presents a new adaptive algorithm for joint beamforming and antenna selection in mobile communication systems. Such an algorithm is of particular interest for massive multiple-input multiple-output (mMIMO) ...
详细信息
This work confirms the performance gain achieved by deploying a simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) in covert communication. To maximize the system covert rate, we ...
详细信息
In this work, we propose a small-size and low-cost phase shifter based on defective microstrip structure (DMS) technique, with a modified reconfigurable unit cell (MRDMS) for WLAN applications at 5.2 GHz. The phase sh...
In this work, we propose a small-size and low-cost phase shifter based on defective microstrip structure (DMS) technique, with a modified reconfigurable unit cell (MRDMS) for WLAN applications at 5.2 GHz. The phase shifter is made up of an MRDMS unit, created by etching slots on a simple microstrip line with an outline of three bridges. To further increase the phase shift, a T-shaped slot is introduced in the central bridge and PIN diodes on the edges of the outer bridges are used. By switching the diodes on and off, the MRDMS unit cell is capable of operating in a variety of different modes to give a phase shift of 37.71° at 5.2 GHz. Additionally, MRDMS unit cells are cascaded for higher phase shift as required. Cascaded two- and four-cell phase shifters were designed and simulated in HFSS and a parametric study on various parameters was conducted to show the detailed operating properties of the proposed phase shifter. Compared to previously reported research, the present study shows that two cascaded phase shifters introduce 75.08° phase shifts and 151.75°, respectively. To the best of our knowledge, this is the maximum phase shift obtained by a phase shifter based on MRDMS.
Physical Human-Human Interaction (pHHI) involves the use of multiple sensory modalities. Studies of communication through spoken utterances and gestures are well established, but communication through force signals is...
详细信息
Terahertz (THz) frequencies play a crucial role in the advancement of next-generation wireless systems, primarily owing to their substantial available bandwidths. The inherent limitation of limited range, attributed t...
详细信息
ISBN:
(数字)9781728190549
ISBN:
(纸本)9781728190556
Terahertz (THz) frequencies play a crucial role in the advancement of next-generation wireless systems, primarily owing to their substantial available bandwidths. The inherent limitation of limited range, attributed to high attenuation in these frequencies, can be effectively addressed by implementing densely deployed heterogeneous networks, complemented by Unmanned Aerial Vehicles (UAVs) within a three-dimensional hyperspace. Yet, the success of THz communications relies on the precise alignment of beams. Inadequate handling of beam alignment can lead to diminished signal strength at the receiver, significantly affecting THz signals more than their conventional counter-parts. This research underscores the paramount importance of meticulous alignment in THz communication systems. The profound impact of proper alignment is substantiated through comprehensive measurements conducted using a state-of-the-art measurement setup, facilitating accurate data collection across the 240 GHz to 300 GHz spectrum. These measurements encompass varying angles and distances within an anechoic chamber to eliminate reflections. Through a meticulous analysis of the channel frequency and impulse responses derived from these extensive measurements, this study pioneers quantifiable results, providing an assessment of the effects of beam misalignment in THz frequencies.
暂无评论