The Internet of Vehicles (IoV) aims to make transportation autonomous, safe, fast, and efficient while reducing resource waste and harmful environmental impacts. To happen this, IoV interconnects pedestrians, cars, an...
详细信息
In recent years, GNNs have becoming one of the hottest topics of deep learning due to their powerful ability in modeling relational data and their wide applications in real world. Besides, reinforcement learning is wi...
详细信息
In this paper, a method to generate speech from a muted video is proposed. Recent advances in computer vision and deep learning have allowed the development of lip-reading technology that can recognize and interpret l...
In this paper, a method to generate speech from a muted video is proposed. Recent advances in computer vision and deep learning have allowed the development of lip-reading technology that can recognize and interpret lip movements in videos. However, one of the main challenges in visual-to-speech synthesis is distinguishing between homophenes, or sets of words that look identical on a person's lips but contain different sounds. To address this challenge, the proposed model utilizes a speech generation framework that uses an external memory called Visual Audio Recall (VA-Recall) to generate speech from silent video. The VA-Recall system allows the system to effectively synthesize speech from a speaker by recalling the most relevant auditory information that matches the visual input context. The proposed method further generates subtitles for the reconstructed audio. Overall, this process allows for a more natural and contextually appropriate speech synthesis system.
This paper is part of undergraduate coursework addressing the ecological impacts of urbanization and industrialization, with a focus on water pollution and diminishing water resources in Bahrain coastal areas. The stu...
详细信息
ISBN:
(数字)9798350369106
ISBN:
(纸本)9798350369113
This paper is part of undergraduate coursework addressing the ecological impacts of urbanization and industrialization, with a focus on water pollution and diminishing water resources in Bahrain coastal areas. The study analyzes water samples from Tubli Bay, Al-Hidd Walkway, and Busaiteen Beach using methodologies such as pH testing, turbidity assessment, conductivity measurement, Total Solids (TS), Total Suspended Solids (TSS), and Total Dissolved Solids (TDS) analysis. Key parameters, including pH, temperature, and turbidity, were assessed against international standards from the World Health Organization and the U.S. Environmental Protection Agency. Results indicated pH levels ranged from 7.15 to 7.44 and turbidity from 20.7 NTU to 22.1 NTU, suggesting suspended particles from sediment runoff and anthropogenic activities. Elevated TDS and conductivity measurements reflect significant salinity in the coastal areas studied. The objectives include monitoring water quality, evaluating compliance with regulatory standards, and assessing pollution impacts on aquatic ecosystems.
Few-Shot Learning (FSL) is a sub-area of machine learning which mainly deals with data where there is a scarcity of training supervised samples. Few shot learning (FSL) more closely resembles the human brain in compar...
Few-Shot Learning (FSL) is a sub-area of machine learning which mainly deals with data where there is a scarcity of training supervised samples. Few shot learning (FSL) more closely resembles the human brain in comparing new concepts to others based on prior experience rather than identifying it exactly. FSL aims to generalize the model across the tasks (in meta learning) opposed to the classical supervised learning which generalizes across the data points. In general the FSL models may suffer from underfitting because of scarcity of supervised samples and at the same time it causes overfitting as it is likely to memorize task specific features of the training set. This work aims to reduce such problems and is presented as a metric based model ”Few Shot Learning with Feature Pairing and Mean Discrepancy” (FL-FPMD). As the title suggests, feature pairing is one among various data augmentations. It is observed that flip augmentation is more suitable in the context of pairing the features within the given task. Memorizing task specific features is reduced by incorporating the discrepancy of mean distributions of the query and the support embedding in the loss function. The training and the evaluation is performed at the miniImageNet dataset and the results indicate that the proposed model outperforms the state-of-the-art models of similar complexity.
The development of autonomous vehicle driving systems and Intelligent Transportation System (ITS) have drawn massive attention since the 1980s. For the development of ITS, road sign detection and identification are co...
详细信息
Finding the top candidates for a position is the aim of the resume screening process. The application must make use of machine learning methodologies as well as natural language processing to rate candidates in real t...
Finding the top candidates for a position is the aim of the resume screening process. The application must make use of machine learning methodologies as well as natural language processing to rate candidates in real time. Natural language processing (NLP) and machine learning methods are used to rate resumes. The output would be the resume of a top candidate, with resumes and job descriptions serving as the input. The output’s results are obtained instantly. String matching, Cosine Similarity, and TF-IDF will all be done with Mon. Existing systems are straightforward and efficient, but they lack precision, efficiency, and processing capacity.
This study presents the development and implementation of a sophisticated Web Application Firewall (WAF) empowered by machine learning techniques to bolster cybersecurity measures. Traditional WAFs primarily rely on r...
详细信息
ISBN:
(数字)9798350330366
ISBN:
(纸本)9798350330373
This study presents the development and implementation of a sophisticated Web Application Firewall (WAF) empowered by machine learning techniques to bolster cybersecurity measures. Traditional WAFs primarily rely on rule-based systems, which may struggle to adapt to the evolving nature of web-based threats. In contrast, our proposed solution leverages machine learning algorithms to dynamically analyze and respond to emerging cyber threats, providing a more proactive and adaptive defense mechanism. The core functionality of the system involves the continuous monitoring of incoming web traffic, extracting relevant features, and utilizing a machine learning model to classify the traffic as either benign or malicious. The model is trained on historical data to recognize patterns and behaviors indicative of various cyber threats, including SQL injection, cross-site scripting, and other common attack vectors. Through this learning process, the system becomes adept.at discerning malicious activities and adapting its defense strategies accordingly. The proposed model helps achieve higher precision in identifying the threat requests from normal requests.
The concept of matrices was developed to represent complex data into understandable and formattable data. There is a huge number of applications in which matrices are used like classification and data size reduction u...
详细信息
The JARVIS AI Support System represents a remarkable fusion of modern technology, blending a sophisticated GUI design, seamless voice control, and inventive features like the captivating “Air Canvas” facilitated by ...
详细信息
ISBN:
(数字)9798350354379
ISBN:
(纸本)9798350354386
The JARVIS AI Support System represents a remarkable fusion of modern technology, blending a sophisticated GUI design, seamless voice control, and inventive features like the captivating “Air Canvas” facilitated by OpenCV. This AI-driven virtual assistant offers users a natural and intuitive experience, allowing them to effortlessly perform tasks such as browsing the web, interacting with a chatbot, and executing dynamic voice- controlled actions. Moreover, the system showcases advanced capabilities including motion detection and facial recognition with an accuracy of 95% in multiple runs. Leveraging the power of computer vision, the Air Canvas feature empowers users to express creativity through fluid hand gestures, while voice commands effortlessly manage diverse tasks. This innovative project presents an approachable way to interact with technology in the world of AI.
暂无评论