In order to encourage carpooling between workplace personnel and college students, this research article gives a unique method that makes use of machine getting to know and synthetic intelligence (AI) algorithms. The ...
详细信息
The working environment of wind turbine gearboxes is complex and variable, with strong noise, which makes traditional fault diagnosis methods inadequate for accurate fault identification. To address this issue, this p...
详细信息
Human-centric Emotional Video Captioning (H-EVC) aims to generate fine-grained, emotion-related sentences for human-based videos, enhancing the understanding of human emotions and facilitating human-computer emotional...
详细信息
Human-centric Emotional Video Captioning (H-EVC) aims to generate fine-grained, emotion-related sentences for human-based videos, enhancing the understanding of human emotions and facilitating human-computer emotional interaction. However, existing video captioning methods primarily focus on overall event content, often overlooking sufficient subtle emotional clues and interactions in videos. As a result, the generated captions frequently lack emotional information. To address this, we propose a novel Emotion-oriented Cross-modal Prompting and Alignment (ECPA) approach for large foundation models to enhance H-EVC accuracy by effectively modeling fine-grained visual-textual emotion clues and interactions. Using large foundation models, our ECPA introduces two learnable prompting strategies: visual emotion prompting (VEP) and textual emotion prompting (TEP), as well as an emotion-oriented cross-modal alignment (ECA) module. In VEP, we develop two-level learnable visual prompts, i.e., emotion recognition (ER)-level and action unit (AU)-level prompting, to assist pre-trained vision-language foundation models to attend to both coarse and fine emotion-related visual information in videos. In TEP, we correspondingly devise two-level learnable textual prompts, i.e., sentence-level emotional tokens, and word-level masked tokens, for obtaining both whole and local textual prompt representations related to emotions. To further facilitate the interaction and alignment of visual-textual emotion prompt representations, our ECA introduces another two levels of emotion-oriented prompt alignment learning mechanisms: the ER-sentence level and the AU-word level alignment losses. Both enhance the model's ability to capture and integrate both global and local cross-modal emotion semantics, thereby enabling the generation of fine-grained emotional linguistic descriptions in video captioning. Extensive experiments not only demonstrate that our ECPA outperforms existing state-of-the-art ap
The fast increase of network traffic in recent times causes significant detection of intrusions in Internet of Things (IoT) environments. Currently, Deep Learning (DL) models play a crucial role in cyber security for ...
详细信息
Research on voice recognition for African languages is limited due to the scarcity of digital resources for training and adaptation, despite its broad usefulness. The Hausa language, spoken by almost fifty million inh...
详细信息
The low-intensity attack flows used by Crossfire attacks are hard to distinguish from legitimate *** methods to identify the malicious flows in Crossfire attacks are rerouting,which is based on *** these existing mech...
详细信息
The low-intensity attack flows used by Crossfire attacks are hard to distinguish from legitimate *** methods to identify the malicious flows in Crossfire attacks are rerouting,which is based on *** these existing mechanisms,the identification of malicious flows depends on the IP ***,the IP address is easy to be changed by *** the IP address,the certificate ismore challenging to be tampered with or ***,the traffic trend in the network is towards *** certificates are popularly utilized by IoT devices for authentication in encryption *** proposed a new way to verify certificates for resource-constrained IoT devices by using the SDN *** on DTLShps,the SDN controller can collect statistics on *** this paper,we proposeCertrust,a framework based on the trust of certificates,tomitigate the Crossfire attack by using SDN for *** goal is ***,the trust model is built based on the Bayesian trust system with the statistics on the participation of certificates in each Crossfire ***,the forgetting curve is utilized instead of the traditional decay method in the Bayesian trust system for achieving a moderate decay ***,for detecting the Crossfire attack accurately,a method based on graph connectivity is ***,several trust-based routing principles are proposed tomitigate the Crossfire *** principles can also encourage users to use certificates in *** performance evaluation shows that Certrust is more effective in mitigating the Crossfire attack than the traditional rerouting ***,our trust model has a more appropriate decay rate than the traditional methods.
In this groundbreaking research endeavor, we present a novel approach to breast cancer assessment, leveraging the power of deep learning and transfer learning techniques. Our methodology involves the fine-tuning of a ...
详细信息
ISBN:
(纸本)9798350384277
In this groundbreaking research endeavor, we present a novel approach to breast cancer assessment, leveraging the power of deep learning and transfer learning techniques. Our methodology involves the fine-tuning of a pre-trained DenseNet201 model using the extensive BreakHis dataset, aiming to achieve precise categorization of breast cancer tumors. The primary objective of our study is to enhance the accuracy and reliability of breast cancer diagnosis through the utilization of state-of-the-art deep learning architectures. Employing transfer learning, we fine-tuned the pre-trained DenseNet201 model on the BreakHis dataset, a comprehensive and diverse collection of breast histopathological images. This dataset encompasses various benign and malignant breast tumor cases, providing a robust foundation for our model to learn intricate patterns and features. During the training phase, our model exhibited remarkable performance, achieving an impressive accuracy of 97.00%. The validation phase further reinforced the model's capabilities, yielding a validation accuracy of 92.00%. These compelling results underscore the efficacy of our approach in accurately categorizing breast tumors, thereby contributing to the advancement of breast cancer diagnostics. This research not only showcases the potential of deep learning in the field of medical image analysis but also emphasizes the importance of leveraging transfer learning to optimize model performance. The ability to discern subtle patterns in histopathological images enables our model to provide clinicians with reliable information for more accurate and timely breast cancer diagnosis. Our study signifies a significant step forward in the ongoing efforts to improve breast cancer assessment methodologies, with potential implications for enhancing patient outcomes through early and precise detection. The integration of advanced technologies, such as deep learning, into medical diagnostics holds promise for revolutionizing the w
Vehicular edge computing (VEC) allows vehicles to process part of the tasks locally at the network edge while offloading the rest of the tasks to a centralized cloud server for processing. A massive volume of tasks ge...
详细信息
Language detection is a crucial preprocessing step in natural language processing (NLP) tasks, especially in a multilingual environment. This paper presents a language detection system utilizing the Naive Bayes classi...
详细信息
Activity and motion recognition using Wi-Fi signals,mainly channel state information(CSI),has captured the interest of many researchers in recent *** research studies have achieved splendid results with the help of ma...
详细信息
Activity and motion recognition using Wi-Fi signals,mainly channel state information(CSI),has captured the interest of many researchers in recent *** research studies have achieved splendid results with the help of machine learning models from different applications such as healthcare services,sign language translation,security,context awareness,and the internet of ***,most of these adopted studies have some shortcomings in the machine learning algorithms as they rely on recurrence and convolutions and,thus,precluding smooth sequential ***,in this paper,we propose a deep-learning approach based solely on attention,i.e.,the sole Self-Attention Mechanism model(Sole-SAM),for activity and motion recognition using Wi-Fi *** Sole-SAM was deployed to learn the features representing different activities and motions from the raw CSI *** were carried out to evaluate the performance of the proposed Sole-SAM *** experimental results indicated that our proposed system took significantly less time to train than models that rely on recurrence and convolutions like Long Short-Term Memory(LSTM)and Recurrent Neural Network(RNN).Sole-SAM archived a 0.94%accuracy level,which is 0.04%better than RNN and 0.02%better than LSTM.
暂无评论