More than a year has passed since reports of ChatGPT-3.5’s capability to pass exams sent shockwaves through education circles. These initial concerns led to a multi-institutional and multi-disciplinary study to asses...
详细信息
Structural testing is one of the testing techniques in software testing. It is the testing process of internal structure in the given source code by comparing the expected result and real results and finding out fault...
详细信息
Computational narrative is a complex field. While the computational processing of narratives has been tackled from different perspectives, the literature has focused on the analysis of a main plot, even if it is compo...
详细信息
The use of virtual reality as a tool for professional training enables access to knowledge in a more active way. Applying it in high-risk situations training encourages finding the most effective way to share knowledg...
详细信息
Aortic valve calcium scoring is extensively utilized for diagnosing, treating, monitoring, and assessing the risk of aortic stenosis and coronary artery disease. The gold standard method for determining aortic valve c...
详细信息
In light of this unmistakable exponential data expansion, visual media archiving must be rethought. Human generated meta-data might not be sufficient for efficient data retrieval. Object detection and object recogniti...
详细信息
Considering the increasing and widespread use of chatbots, it is of great importance to provide methods and tools to address ethical concerns and to make users aware of various aspects of a chatbot, including non-func...
详细信息
ISBN:
(数字)9798331517649
ISBN:
(纸本)9798331517656
Considering the increasing and widespread use of chatbots, it is of great importance to provide methods and tools to address ethical concerns and to make users aware of various aspects of a chatbot, including non-functional features. The Generative Pre-Trained Transformer (GPT) is a state-of-the-art Natural Language Generation (NLG) model, enhanced by the latest version of GPT-4. This study evaluates the trustworthiness of the responses of three different chatbot models, BlenderBot with Hugging Face, ChatGPT-3.5 and ChatGPT-4, to 100 randomly selected medical questions. The accuracy and semantic similarity of the answers were measured by BLEU, ROUGE-1 and Cosine Similarity metrics, and response times were also recorded as an important performance factor. The results showed that GPT-4 exhibited superior performance, thus being able to produce more accurate and contextually reliable responses. However, the significantly longer response time of GPT-4 emerged as a disadvantage that may affect real-time utilisation. These findings provide an important reference for the effective use of GPT-4 in the context of health chatbots. The study contributes to the literature on improving the effectiveness of chatbots in healthcare by drawing attention to the importance of balancing speed, accuracy and trustworthiness, as well as the fact that the answers obtained are drawn with Application Programming Interface (API).
Online misinformation poses a significant challenge due to its rapid spread and limited supervision. To address this issue, automated rumour detection techniques are essential for countering the negative impact of fal...
详细信息
Human beings have gone through stages that have made significant contributions to their lives with technological developments. One of the most important of these close to the present day is the introduction of IoTs, a...
详细信息
Due to its importance in studying people's thoughts on various Web 2.0 services, emotion classification is a critical undertaking. Most existing research is focused on the English language, with little work on low...
详细信息
暂无评论