A wavelength-tunable, silicon photon-pair source based on spontaneous four-wave mixing, integrated with a pump rejection filter in a single, flip-chip packaged CMOS chip, is demonstrated with a coincidence-to-accident...
详细信息
This paper presents a novel system architecture that integrates blind source separation with joint beat and downbeat tracking in musical audio signals. The source separation module segregates the percussive and non-pe...
详细信息
We propose a simple and generic construction of the variational tensor network operators to study the quantum spin systems by the synergy of ideas from the imaginary-time evolution and the variational optimization of ...
详细信息
We propose a simple and generic construction of the variational tensor network operators to study the quantum spin systems by the synergy of ideas from the imaginary-time evolution and the variational optimization of trial wave functions. By applying these operators to simple initial states, accurate variational ground-state wave functions with extremely few parameters can be obtained. Furthermore, the framework can be applied to spontaneously study symmetry-breaking, symmetry-protected topological, and intrinsic topologically ordered phases, and we show that symmetries of the local tensors associated with these phases can emerge directly after the optimization without any gauge fixing. This provides a universal way to identify quantum phase transitions without prior knowledge of the system.
The scale of modem Artificial Intelligence systems has been growing and entering more research territories by incorporating Deep Learning (DL) and Deep Reinforcement Learning (DRL) methods. More specifically, multi-ag...
详细信息
ISBN:
(数字)9781728187082
ISBN:
(纸本)9781728187099
The scale of modem Artificial Intelligence systems has been growing and entering more research territories by incorporating Deep Learning (DL) and Deep Reinforcement Learning (DRL) methods. More specifically, multi-agent DRL methods have been widely applied to address the problems of high-dimensional computation, which interpret the conditions that real-world systems mainly encounter and the issues that require resolving. However, the current approaches of DL and DRL are often challenged for their untransparent and time-consuming modeling processes in their attempt to achieve a practical and applicable inference based on human-level perspective and acceptance. This paper presents an explainable and adaptable augmented knowledge attention network for multi-agent DRL systems, which uses game theory simulation to tackle the problem of non-stationarity at the beginning, while improving the learning exploration built upon the strategic ontology to achieve the learning convergence more efficiently for autonomous agents. We anticipate that our approach will facilitate future research studies and potential research inspections of emerging multi-agent DRL systems for increasingly complex and autonomous environments.
An oblique helicoidal cholesteric liquid crystal ChOH represents a unique optical material with a single-harmonic periodic modulation of the refractive index and a pitch that can be tuned by an electric or magnetic fi...
详细信息
An oblique helicoidal cholesteric liquid crystal ChOH represents a unique optical material with a single-harmonic periodic modulation of the refractive index and a pitch that can be tuned by an electric or magnetic field in a broad range from submicrometers to micrometers. In this work, we demonstrate that the oblique helicoidal cholesteric doped with azoxybenzene molecules can be tuned by both the electric field and light irradiation. The tuning mechanism is explained by the kinetics of trans-cis photoisomerization of the azoxybenzene molecules. At a fixed voltage, UV irradiation causes a redshift of the reflection peak by more than 200 nm. The effect is caused by an increase of the bend elastic constant of ChOH under irradiation. The demonstrated principle has the potential for applications such as smart windows, sensors, tunable lasers, and filters.
In this study, deep embedding of acoustic and articulatory features are combined for speaker identification. First, a convolutional neural network (CNN)-based universal background model (UBM) is constructed to generat...
ISBN:
(数字)9781509066315
ISBN:
(纸本)9781509066322
In this study, deep embedding of acoustic and articulatory features are combined for speaker identification. First, a convolutional neural network (CNN)-based universal background model (UBM) is constructed to generate acoustic feature (AC) embedding. In addition, as the articulatory features (AFs) represent some important phonological properties during speech production, a multilayer perceptron (MLP)-based AF embedding extraction model is also constructed for AF embedding extraction. The extracted AC and AF embeddings are concatenated as a combined feature vector for speaker identification using a fully-connected neural network. This proposed system was evaluated by three corpora consisting of King-ASR, LibriSpeech and SITW, and the experiments were conducted according to the properties of the datasets. We adopted all three corpora to evaluate the effect of AF embedding, and the results showed that combining AF embedding into the input feature vector improved the performance of speaker identification. The LibriSpeech corpus was used to evaluate the effect of the number of enrolled speakers. The proposed system achieved an EER of 7.80% outperforming the method based on x-vector with PLDA (8.25%). And we further evaluated the effect of signal mismatch using the SITW corpus. The proposed system achieved an EER of 25.19%, which outperformed the other baseline methods.
This paper aims to improve speaker embedding representation based on x-vector for extracting more detailed information for speaker verification. We propose a statistics pooling time delay neural network (TDNN), in whi...
ISBN:
(数字)9781509066315
ISBN:
(纸本)9781509066322
This paper aims to improve speaker embedding representation based on x-vector for extracting more detailed information for speaker verification. We propose a statistics pooling time delay neural network (TDNN), in which the TDNN structure integrates statistics pooling for each layer, to consider the variation of temporal context in frame-level transformation. The proposed feature vector, named as statsvector, are compared with the baseline x-vector features on the VoxCeleb dataset and the Speakers in the Wild (SITW) dataset for speaker verification. The experimental results showed that the proposed stats-vector with score fusion achieved the best performance on VoxCeleb1 dataset. Furthermore, considering the interference from other speakers in the recordings, we found that the proposed statsvector efficiently reduced the interference and improved the speaker verification performance on the SITW dataset.
Sum of Absolute Transformed Differences (SATD) is a distortion metric based on the Hadamard Transform. It is used in current video encoders inside the refinement stage of motion estimation, which decides the best bloc...
详细信息
ISBN:
(数字)9781728134277
ISBN:
(纸本)9781728134284
Sum of Absolute Transformed Differences (SATD) is a distortion metric based on the Hadamard Transform. It is used in current video encoders inside the refinement stage of motion estimation, which decides the best block of pixels to be used as prediction for each block to be encoded. To reduce the computational effort incurred by the SATD calculation, this work proposes an approximate SATD hardware accelerator by excluding columns of adders/subtractors of the 8 × 8 Hadamard Transform (HT). The proposed approximate accelerator reduces energy by up to 40.78% compared to the precise SATD accelerator.
Intelligent computing techniques have a paramount importance to the treatment of cybersecurity incidents. In such Artificial Intelligence (AI) context, while most of the algorithms explored in the cybersecurity domain...
详细信息
暂无评论