Task-oriented dialogue (TOD) system is designed to accomplish user-defined tasks through dialogues. The TOD system has progressed towards end-to-end modeling by leveraging pre-trained large language models. Fine-tunin...
详细信息
As language models are often deployed as chatbot assistants, it becomes a virtue for models to engage in conversations in a user’s first language. While these models are trained on a wide range of languages, a compre...
详细信息
This paper presents a new approach to multiple language learning, with Hindi the language to be learn’t in our case, by using the integration of virtual reality environments and AI enabled tutoring systems using Ope...
详细信息
The proliferation of phishing URLs has experienced rapid growth in recent years, necessitating urgent attention to phishing attack detection in cybersecurity. In response, we introduce an improved predictive model tha...
The proliferation of phishing URLs has experienced rapid growth in recent years, necessitating urgent attention to phishing attack detection in cybersecurity. In response, we introduce an improved predictive model that leverages both machine learning and a Deep Learning Model. Our dataset consists of 88,674 instances, encompassing 112 features, including the class level. Notably, this dataset comprises 58,000 legitimate instances and 30,647 phishing *** proposed method incorporates six distinct algorithms: Decision Tree, Support Vector Machine, K-nearest neighbors, Logistic Regression, Ensemble Classifier, and Neural Network algorithm. we systematically evaluated total 28 models performance with different types of feature changes. Following enhancements, every algorithm demonstrated an accuracy performance surpassing 93.16%. Among them, the Ensemble Classifier emerged as the most effective, boasting an accuracy 97.45%, MCC 94.37%, precision a96.35%, and an F1 Score 96.31%.
This video demo is about automatic authoring of various motion effects that are provided with audiovisual content to improve user experiences. Traditionally, motion effects have been used for simulators, e.g., flight ...
详细信息
Advertising legal compliance reviews have always been time-consuming and labor-intensive, and existing Large Language Models(LLMs) are far worse performing than senior industry experts. In this paper, we propose a nov...
详细信息
ISBN:
(数字)9798331530334
ISBN:
(纸本)9798331530341
Advertising legal compliance reviews have always been time-consuming and labor-intensive, and existing Large Language Models(LLMs) are far worse performing than senior industry experts. In this paper, we propose a novel LLMs-based text-visual question answering method called Adchat-TVQA for advertising legal compliance review. After reading an advertising image and the corresponding review question, it will first understand the content on the image through multimodal learning, then optimize the question with Zero-Shot Chain-of-Thought (CoT) prompting method based on predefined industry expert experience, and finally input the augmented question into the large language model for inquiry. The feasibility of this method is verified through system prototype implementation and end-to-end functional tests. In addition, we summarized the characteristics of advertising images and added image segmentation and text box transpose to the data processing process for system performance improvement. Performance testing of the text visual cognitive question task on the AIWIN2021 dataset shows that the method scores higher than Microsoft's LayoutLM, with an increasement of the score from 51.52% to 52.74%.
Intrusion Detection Systems (IDS) are critical for identifying and mitigating potential security threats within network traffic. However, traditional IDS solutions often struggle with scalability and real-time threat ...
详细信息
Real-time monitoring of vehicle movements within a parking lot is crucial for creating a personalized parking guidance service. This applied research proposes using lane cameras to monitor vehicle dynamics, employing ...
详细信息
ISBN:
(数字)9798331504120
ISBN:
(纸本)9798331504137
Real-time monitoring of vehicle movements within a parking lot is crucial for creating a personalized parking guidance service. This applied research proposes using lane cameras to monitor vehicle dynamics, employing YOLOv8-based license plate object tracking and character recognition to determine the real-time location and movement direction of vehicles, while achieving cross-camera object tracking capabilities. With this critical information, the parking management system can provide tailored parking guidance instructions for each vehicle, reducing the burden on drivers to locate available parking spaces and contributing to lower carbon emissions.
Forensic science demands the precise application of advanced scientific principles and procedures to investigate crimes and ensure justice. This process involves the analysis of vast and complex data. The research exp...
详细信息
ISBN:
(数字)9798331504403
ISBN:
(纸本)9798331504410
Forensic science demands the precise application of advanced scientific principles and procedures to investigate crimes and ensure justice. This process involves the analysis of vast and complex data. The research explores the potential of modern science and technologies, particularly Artificial Intelligence (AI), to develop innovative techniques across various branches of forensic science. It provides a comprehensive examination of AI applications in forensic fields, focusing on image and audio analysis to identify instruments used in crimes and determine causes of death. By integrating AI, the research aims to enhance the accuracy and efficiency of forensic investigations while maintaining ethical standards. The paper also discusses the difficulties with AI in forensic science, highlighting the need for open and moral AI systems. It highlights that forensic errors leading to wrongful convictions often result from factors such as incompetence, spam, weak scientific foundations, or organizational deficiencies, rather than blunders made by forensic scientists. These issues highlight the importance of addressing the root causes of forensic errors to improve the reliability of forensic practices. The study also delves into the specific challenges AI encounters, such as data quality, interpretability of AI models, and the unification of AI into current forensic procedures. By tackling these challenges, the research aims to ensure that AI applications in forensic science are effective and trustworthy. Overall, this work highlights the critical role of AI in advancing forensic science and the importance of addressing the underlying issues that contribute to forensic errors, ultimately striving for a more reliable and just forensic process
To address the challenge of reduced image quality in plane-wave imaging (PWI), coherent plane-wave compounding (CPWC) has been introduced. CPWC combines plane wave images from various directions (i.e., with different ...
详细信息
ISBN:
(数字)9798350371901
ISBN:
(纸本)9798350371918
To address the challenge of reduced image quality in plane-wave imaging (PWI), coherent plane-wave compounding (CPWC) has been introduced. CPWC combines plane wave images from various directions (i.e., with different angles) to enhance image quality. However, the number of angles required to achieve satisfactory image quality directly impacts the maximum attainable frame rate in CPWC. Consequently, there exists a tradeoff between image quality, particularly contrast, and frame rate. In this study, our goal is to mitigate this compromise by enhancing image contrast while simultaneously improving the frame rate of CPWC imaging. Our proposed technique includes skeleton-based spatial filtering of each beamformed radio frequency (RF) image at each angle before coherent compounding. For this purpose, we introduce a novel filtering technique based on the Euclidean distance transform (EDT). EDT assigns the minimum distance from each foreground pixel to the background. Given that lower contrast in CPWC imaging stems from information leakage (i.e., higher sidelobe levels), we propose using the EDT to reduce this effect, thereby improving contrast. We implemented our proposed technique using the Plane Wave Imaging Challenge in Medical Ultrasound (PICMUS) datasets.
暂无评论