For the task of content retrieval,analysis and generation of film and television scene images in the field of intelligent editing,fine-grained emotion recognition and prediction of images is of great *** this paper,th...
详细信息
For the task of content retrieval,analysis and generation of film and television scene images in the field of intelligent editing,fine-grained emotion recognition and prediction of images is of great *** this paper,the fusion of traditional perceptual features,art features and multi-channel deep learning features are used to reflect the emotion expression of different levels of the *** addition,the integrated learning model with stacking architecture based on linear regression coefficient and sentiment correlations,which is called the LS-stacking model,is proposed according to the factor association between multi-dimensional *** experimental results prove that the mixed feature and LS-stacking model can predict well on the 16 emotion categories of the self-built image *** study improves the fine-grained recognition ability of image emotion by computers,which helps to increase the intelligence and automation degree of visual retrieval and post-production system.
The target detection ability of radar in environments with interference or clutter can be further improved by the adaptive beamforming method. However, most existing research results only address the receiving beam, a...
详细信息
Music source separation, the process of extracting independent audio streams from a complex mix, has traditionally focused on isolating vocals, drums, bass, and other primary sources. This study tackles the more intri...
详细信息
Analyzing the timbre characteristics of different singing genres is an important issue in the field of singing acoustics. Formants are an important characteristic parameter for timbre perception, but classical acousti...
详细信息
As an integral component of modern cultural works, the role of film and television music in artistic expression and emotional communication has become increasingly significant. Timbre, as a crucial attribute of music,...
详细信息
With the development of intelligent agents pursuing humanisation,artificial intelligence must consider emotion,the most basic spiritual need in human *** emotional dialogue systems usually use an external emotional di...
详细信息
With the development of intelligent agents pursuing humanisation,artificial intelligence must consider emotion,the most basic spiritual need in human *** emotional dialogue systems usually use an external emotional dictionary to select appropriate emotional words to add to the response or concatenate emotional tags and semantic features in the decoding step to generate appropriate ***,selecting emotional words from a fixed emotional dictionary may result in loss of the diversity and consistency of the *** propose a semantic and emotion-based dual latent variable generation model(Dual-LVG)for dialogue systems,which is able to generate appropriate emotional responses without an emotional *** from previous work,the conditional variational autoencoder(CVAE)adopts the standard transformer ***,Dual-LVG regularises the CVAE latent space by introducing a dual latent space of semantics and *** content diversity and emotional accuracy of the generated responses are improved by learning emotion and semantic features ***,the average attention mechanism is adopted to better extract semantic features at the sequence level,and the semi-supervised attention mechanism is used in the decoding step to strengthen the fusion of emotional features of the *** results show that Dual-LVG can successfully achieve the effect of generating different content by controlling emotional factors.
With the acceleration of modernization and the rapid development of popular music, the splendid culture of Guqin, a type of Chinese traditional musical instrument which has lasted for nearly 1,000 years, is facing a s...
详细信息
Frequency diverse array(FDA) offer potential applications for enhancing the RF safety performance of FDA systems. This is due to the time-varying angle-distance coupling characteristics of the transmit beam map, which...
详细信息
The Mossformer model excels in speech separation but has not been effectively applied to music source separation. Music sources have complex characteristics and higher sampling rates, making separation tasks more chal...
详细信息
The extensive integration of distributed resources such as distributed photovoltaic system and electric vehicles enhances the uncertainty of the load side greatly. Corresponding demand response strategies can be devel...
详细信息
暂无评论