Load forecasting is an important guarantee for power system design, planning and operation. In order to further improve the accuracy of power load forecasting, a combined forecasting model based on gating recurrent ne...
详细信息
Multi objective optimization evolutionary algorithms (MOEAs) play a crucial role in addressing multi-objective optimization problems (MOPs) in the field of artificial intelligence. However, MOEAs often struggle to sim...
详细信息
With the development of artificial intelligence technology,various sectors of industry have *** them,the autonomous vehicle industry has developed considerably,and research on self-driving control systems using artifi...
详细信息
With the development of artificial intelligence technology,various sectors of industry have *** them,the autonomous vehicle industry has developed considerably,and research on self-driving control systems using artificial intelligence has been extensively *** on the use of image-based deep learning to monitor autonomous driving systems have recently been *** this paper,we propose an advanced control for a serving robot.A serving robot acts as an autonomous line-follower vehicle that can detect and follow the line drawn on the floor and move in specified *** robot should be able to follow the trajectory with speed *** controllers were used simultaneously to achieve *** neural networks(CNNs)are used for target tracking and trajectory prediction,and a proportional-integral-derivative controller is designed for automatic steering and speed *** study makes use of a Raspberry PI,which is responsible for controlling the robot car and performing inference using CNN,based on its current image input.
Multimodal emotion recognition is one of the mainstream frontier directions in the field of AI, and has potential significant application value and wide application in related application scenarios involving intellige...
Multimodal emotion recognition is one of the mainstream frontier directions in the field of AI, and has potential significant application value and wide application in related application scenarios involving intelligent science and human-computer interaction. Since the connection of semantic information features between contexts is a factor that needs to be focused on in multimodal emotion recognition (ER), although the CNN model can predict the information contained in the time series, it does not fully consider. The special relationship between semantic information has certain limitations. The CMN uses the semantic information that GRU can store in the memory unit. However, the practical application of this model is only to dynamically monitor the emotional changes of both parties over a period of time. There are deficiencies in the dialogue process involved. The recognition model based on variational autoencoder and multi-modal feature fusion constructed in this paper extracts rich contextual semantic information through Bi-LSTM, and uses the attention mechanism to analyze the multi-modal features extracted by multiple speakers. Features are fused to obtain important features, which effectively solves the problems mentioned above. The MFF-VAE recognition model proposed in this article effectively combines the extended feature extraction of horizontal multiple participants with the complementarity and connection of vertical semantic information, providing the next level of research direction for further improving recognition accuracy in the field of multimodal emotion recognition.
Facial reactions convey crucial emotional information and coordinating interpersonal relationships in human dyadic interactions. While existing Multiple Appropriate Facial Reaction Generation (MAFRG) methods focus on ...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Facial reactions convey crucial emotional information and coordinating interpersonal relationships in human dyadic interactions. While existing Multiple Appropriate Facial Reaction Generation (MAFRG) methods focus on generating multiple reasonable facial reactions, none of these approaches combines 2D and 3D facial behaviour information nor account for the influence of individuals’ facial identities, leading to inconsistencies in the generated facial reactions and limited capability in capturing subtle variations in facial depth and expression dynamics. This paper proposes a novel Hierarchical Multimodal Decoupling-Fusion (HMDF) framework that decouples 3D facial identity from expression behaviors, eliminating identity-based interference in the reaction generation process, which are integrated with audio-visual features through a cross-attention mechanism. Experiments show that our framework achieved the enhanced diversity and synchrony in the generated facial reactions.
Robotic dexterous grasping is a challenging problem due to the high degree of freedom (DoF) and complex contacts of multi-fingered robotic hands. Existing deep reinforcement learning (DRL) based methods leverage human...
详细信息
Consensus clustering methods provide better performance by fusing multiple clustering solutions in terms of accuracy, robustness and stability. However, most current methods suffer from different challenges: i) the hi...
Consensus clustering methods provide better performance by fusing multiple clustering solutions in terms of accuracy, robustness and stability. However, most current methods suffer from different challenges: i) the high-dimensional problem; ii) limitations of single clustering method; iii) the optimal number of clusters selecting for a certain validity measure; iv) redundant clustering candidate attributes. To overcome the above limitations, we propose a hybrid clustering solutions fusion method based on gated three-way decision (HCFG) for data analysis. By integrating multiple clustering solutions and executing information fusion, HCFG enjoys four properties: (1) multiple random subspace generation strategy is utilized to generate diverse low-dimensional subspaces effectively; (2) a fusion framework that considers characteristics of both the soft clustering and hard clustering methods is designed, in which potential boundary of feature attribute sets is explored; (3) the optimal number of clusters is set by utilizing multiple clustering validity indices; (4) clustering solutions is considered as attributes and a gated three-way decision method is proposed to adaptively conduct attribute reductions. Extensive comparative experiments on 24 real-world data sets demonstrates the effectiveness and superiority of HCFG. Moreover, nonparametric tests are conducted to compare HCFG with multiple consensus clustering methods.
Multimodal emotion recognition (MER), which relies on its role in processing and analysing comments posted on social media and identifying the corresponding target emotion states, has a very important position in educ...
Multimodal emotion recognition (MER), which relies on its role in processing and analysing comments posted on social media and identifying the corresponding target emotion states, has a very important position in education, social media and especially in the field of HCI. In order to solve the problem of emotion recognition for video, text and other forms of data in social media, we propose a Transformer with feature. The Transformer is able to take into account the combination of different feature relations and extract global keys, so it is introduced for long-range sequential context extraction, and the residual connections are placed after the Transformer to prevent information collapse. In addition, this paper uses a multi-headed attention mechanism to combine the extracted different feature laws, and finally the fused feature vectors are fed into the classifier to classify the sentiment and output it to achieve multimodal sentiment recognition in a multimodal context.
Data poisoning attacks, where adversaries manipulate training data to degrade model performance, are an emerging threat as machine learning becomes widely deployed in sensitive applications. This paper provides a comp...
Data poisoning attacks, where adversaries manipulate training data to degrade model performance, are an emerging threat as machine learning becomes widely deployed in sensitive applications. This paper provides a comprehensive overview of data poisoning including attack techniques, adversary incentives, impacts on security and reliability, detection methods, defenses, and key research gaps. We examine label flipping, instance injection, backdoors, and other attack categories that enable malicious outcomes ranging from IP theft to accidents in autonomous systems. Promising detection approaches include statistical tests, robust learning, and forensics. However, significant challenges remain in translating academic defenses like adversarial training and sanitization into practical tools ready for operational use. With safety and trustworthiness at stake, more research on benchmarking evaluations, adaptive attacks, fundamental tradeoffs, and real-world deployment of defenses is urgently needed. Understanding vulnerabilities and developing resilient machine learning pipelines will only grow in importance as data integrity is fundamental to developing safe artificial intelligence.
This research focuses on exploring the exponential $$H_\infty $$ stability of general conformable nonlinear system. In order to address the nonlinearity inherent in the system, a polynomial fuzzy (PF) method is employ...
This research focuses on exploring the exponential $$H_\infty $$ stability of general conformable nonlinear system. In order to address the nonlinearity inherent in the system, a polynomial fuzzy (PF) method is employed. Modeling a general conformable nonlinear system within the polynomial framework reduces the number of fuzzy rules compared to the classical Takagi–Sugeno fuzzy (TSF). Furthermore, controlling such a complex system, which accounts for perturbations, employs a PF model instead TSF model to describe its nonlinear dynamics, and incorporates a general conformable derivative instead of an integer-order one, is significantly more challenging, and remains unaddressed in previous studies. In this paper, a PF controller is designed in the form of sum of squares (SOS) to enhance the resilience against perturbations and ensure the exponential stability of the proposed model. The proposed SOS can be solved numerically, and partially symbolically, using the recently developed SOSTOOLS. In order to ensure the $$H_\infty $$ performance, a generalized criterion is defined for the general conformable nonlinear system. To demonstrate the effectiveness of the proposed method, a numerical example is provided.
暂无评论