Chatbots powered by Large Language Model(LLM) can be manipulated by malicious prompts, generating harmful content and biased responses which would raise security concerns. Growing dependence on chatbots demands robust...
详细信息
ISBN:
(纸本)9798350369083
Chatbots powered by Large Language Model(LLM) can be manipulated by malicious prompts, generating harmful content and biased responses which would raise security concerns. Growing dependence on chatbots demands robust security for ethical development and user trust, which makes the work relevant in today's world. The motivation behind the work is to let the user have a safe experience with no negative responses being displayed while using the chatbot, which paved the way to arrive at the goal of developing a security filter that could be integrated into any LLM feature integrated application to mitigate the risk of having security vulnerabilities like prompt injection and jailbreaking, which could be achieved by converting malicious prompt into safer prompts by the method of eliminating negative sentiment phrases. The work focuses on building and implementing the security filters to popular in-production LLMs like Large Language Model Meta AI-2 (LLaMA2) and Generative Pre-trained Transformer - 3.5 turbo (GPT-3.5) to see how they handle against prompt injection and jailbreaking before and after the security filter being integrated. A large database of 200,000 prompts has been collected and preprocessed to train on a machine learning model using binary classification algorithm having 99.7% accuracy for classification of prompts into malicious or non-malicious and further checks are being done by breaking the prompt into smaller phrases and individually analyzing their compound sentiment score using Natural Language Toolkit (NLTK) Valence Aware Dictionary for Sentiment Reasoning (VADER) algorithm to detect and drop the negative sentiment phrases for the modification of the user prompt to eliminate the possibility of malicious prompt being passed to LLM. It is difficult to determine the sentiment of prompts in a detailed way and convert it into an efficient design that will perform well with models. Once this hurdle is overcome, chatbots will become even more reliable,
Future military operations will increasingly depend on interconnected technology, using advanced embedded systems and AI to improve defense capabilities and technology. The Internet of Battlefield Things (IoBT) will l...
详细信息
Multi-factor authentication has become a widely employed method for verifying users and safeguarding sensitive information. This approach typically combines a password with an additional authentication factor, such as...
详细信息
With rising industrialization, India confronts increasing difficulties in maintaining air quality regulations. This research proposes a comprehensive analysis and prediction framework based on machine learning approac...
详细信息
Diabetic foot ulcers (DFUs) are a serious consequence for diabetes individuals, frequently resulting in amputation. Early identification is critical in averting such results. This study looks at the effectiveness of d...
详细信息
In the current educational landscape, the transition towards digitalization has become crucial. However, the manual entry of data from traditional physical marksheets into digital systems remains a significant bottlen...
详细信息
In today's world, the use of data analytics with precision has been extremely important and crucial with the changing times. The creation of virtual environments to facilitate communication and education among the...
详细信息
Cervicography, a diagnostic technique that checks and identifies problems in the cervix area with its major aim being cervical cancer, is used. This involves application of 5% acetic acid to the cervix followed by cap...
详细信息
The use of computers has become an integral part of our daily lives, and they are utilized in various fields. However, traditional input devices such as a mouse and keyboard can sometimes be limiting. Hand gestures ca...
详细信息
Deep Fake technology has become increasingly sophisticated, posing a significant challenge to the integrity of digital content in today's information age. This research paper introduces a novel approach in detecti...
详细信息
暂无评论