Proton Exchange Membrane Fuel Cells (PEMFCs) play a vital role in the transition to clean energy technologies, requiring accurate and efficient models for their control, optimization, and real-time monitoring. This re...
详细信息
Artificial intelligence Probabilistic Neural Networks are a type of neural network based on the theory of probability density functions, widely applicable in areas such as patternrecognition. Addressing the current i...
详细信息
The next revolutionary innovation in the textile industry is artificially driven by intelligent quality control and next-generation textile materials. This review, based on AI-driven automation and sustainable practic...
详细信息
Accurate and timely detection of gastrointestinal diseases is critical to an effective treatment and the best outcomes for patients. Manual examination of endoscopic images during an endoscopy by trained specialists s...
详细信息
With around 1.6 million deaths yearly, primarily in low-resource environments, abstract-tuberculosis (TB) is a serious worldwide health issue. While human analysis can be error-prone and labor-intensive, early discove...
详细信息
The Irula language, which is predominantly spoken by the Irula tribes in southern India, offers specific challenges because of its linguistic features and limited resources. To address these challenges, this research ...
详细信息
Voice signals convey hidden, valuable information about speakers, such as age, gender, and emotional state. Extracting this kind of information from human speech is significant in human-computer interaction (HCI). It ...
详细信息
ISBN:
(纸本)9783031780134;9783031780141
Voice signals convey hidden, valuable information about speakers, such as age, gender, and emotional state. Extracting this kind of information from human speech is significant in human-computer interaction (HCI). It enables computers to understand human behaviors and develop interactive systems with customized responses, raising the significance of advancements in speech emotion recognition (SER), especially for languages with large numbers of speakers. Despite over 100 million people speak the Egyptian dialect, SER studies that address the Egyptian dialect are extremely scarce and predominantly rely on traditional machine learning models and convolutional neural networks (CNN) for classification. In this context, we proposed an enhanced compact convolution transformer (CCT) that detects the speaker's age, gender, and emotional state, leveraging the strengths of CNNs for capturing spatial features and transformers for modeling long-range dependencies. The proposed approach combines the best of both architectures, marking a novel architecture for the Egyptian emotion recognition task. To the best of our knowledge, this is the first work to address age detection from Egyptian speech, as well as the first to propose a unified model for the recognition of age, gender, and emotion from Egyptian speech. In the context of HCI improvements, the proposed model was applied in a real-world setting by integrating it into a custom-developed Egyptian chatbot to enhance the chatbot's ability to provide emotionally aware responses based on the user's emotional state.
The paper presents the development of a Data Science solution that applies generative AI models in order to create high-quality images to be used in visual marketing and advertising. Using the Python library PIL (Pill...
详细信息
In order to retain as many effective signals in EEG as possible and improve the accuracy of a signal classification and the performance of the brain-computer interface system, a steady-state visual evoked potential EE...
详细信息
Wireless mesh networks are been prominently developing in the past two decades and have received a lot of accomplished tasks in various applications. machine learning is an art of computational intelligence that enhan...
详细信息
暂无评论