With the rapid spread of epidemic situation, how to quickly analyze health risk factors has become a major challenge in the current public health field. The development of naturallanguageprocessing (NLP) technology ...
详细信息
The rapid growth of the internet has increased people's reliance on it for expressing opinions on products and stores. Text sentiment analysis is now a key research area. Deep learning methods are commonly used fo...
详细信息
This paper presents our error tolerable system for coreference resolution in CoNLL-2011(Pradhan et al., 2011) shared task (closed track). Different from most previous reported work, we detect mention candidates based ...
详细信息
Efficient knowledge management plays a pivotal role in augmenting both the operational efficiency and the innovative capacity of businesses and organizations. By indexing knowledge through vectorization, a variety of ...
详细信息
ISBN:
(纸本)9789819755615;9789819755622
Efficient knowledge management plays a pivotal role in augmenting both the operational efficiency and the innovative capacity of businesses and organizations. By indexing knowledge through vectorization, a variety of knowledge retrieval methods have emerged, significantly enhancing the efficacy of knowledge management systems. Recently, the rapid advancements in generative naturallanguageprocessing technologies paved the way for generating precise and coherent answers after retrieving relevant documents tailored to user queries. However, for enterprise knowledge bases, assembling extensive training data from scratch for knowledge retrieval and generation is a formidable challenge due to the privacy and security policies of private data, frequently entailing substantial costs. To address the challenge above, in this paper, we propose EKRG, a novel Retrieval-Generation framework based on large language models (LLMs), expertly designed to enable question-answering for Enterprise Knowledge bases with limited annotation costs. Specifically, for the retrieval process, we first introduce an instruction-tuning method using an LLM to generate sufficient document-question pairs for training a knowledge retriever. This method, through carefully designed instructions, efficiently generates diverse questions for enterprise knowledge bases, encompassing both fact-oriented and solution-oriented knowledge. Additionally, we develop a relevance-aware teacher-student learning strategy to further enhance the efficiency of the training process. For the generation process, we propose a novel chain of thought (CoT) based fine-tuning method to empower the LLM-based generator to adeptly respond to user questions using retrieved documents. Finally, extensive experiments on real-world datasets have demonstrated the effectiveness of our proposed framework.
Sparse-view computed tomography (CT) is an effective method for reducing radiation dose. The images reconstructed from insufficient data obtained from sparse-view CT suffer from severe star shaped artifacts. Therefore...
详细信息
ISBN:
(数字)9798350349115
ISBN:
(纸本)9798350349122
Sparse-view computed tomography (CT) is an effective method for reducing radiation dose. The images reconstructed from insufficient data obtained from sparse-view CT suffer from severe star shaped artifacts. Therefore, reducing radiation dose will reduce imaging quality. Although deep learning methods used for sparse-view CT reconstruction have achieved impressive success, such as convolutional neural network (CNN), the reconstruction results are still too smooth, i.e. losing many details. In this work, we propose a two-stage deep learning method to reduce the artifacts of sparse-view CT images. We conducted several numerical simulation experiments to test the performance of the proposed network. The results indicate that the denoising method can significantly reduce the artifacts caused by sparse sampling and carry more detailed information than CNN.
Navigating through the University of Ghana campus, like most tertiary campuses, can be very challenging, especially for a freshman, foreign, or an exchange student. There are numerous routes leading to particular buil...
详细信息
Plagiarism detectors are software or apps that are primarily intended for string-level comparisons between texts that raise the suspicion of being plagiarized and texts that have the potential to be original work. By ...
详细信息
Plagiarism detectors are software or apps that are primarily intended for string-level comparisons between texts that raise the suspicion of being plagiarized and texts that have the potential to be original work. By incorporating naturallanguageprocessing (NLP) methods into the currently in use detection methods, this study seeks to increase the accuracy of plagiarism detectors. A framework for plagiarism detectors was proposed, in which sets of naturallanguageprocessing (NLP) techniques were applied to several available documents deemed suspicious and authentic. The work was not limited to analyzing the given documents, but also involved understanding the structure of the text and accounting for text relations. Using the methods on a corpus of short paragraphs revealed the level of improvement that NLP techniques had, as well as their increased accuracy in identifying plagiarism.
YouTube is one of major video-sharing platform in today's world. Every minute hundreds of hours of video content are being added to it. Extracting key information from such massive amount of video content is very ...
详细信息
Large-scale log files are vital for capturing critical information about system operations, network interactions, and user activities, making them essential in areas such as enterprise management, research, and cybers...
详细信息
ISBN:
(数字)9798331528829
ISBN:
(纸本)9798331528836
Large-scale log files are vital for capturing critical information about system operations, network interactions, and user activities, making them essential in areas such as enterprise management, research, and cybersecurity. However, as data volume grows exponentially, traditional text processingmethods struggle to efficiently parse these extensive logs, hindering accurate information extraction and analysis. This study focuses on leveraging existing naturallanguageprocessing techniques, including regular expressions, keyword extraction, and text clustering, to tackle the challenges of parsing large log files. We examine the specific issues posed by such data and introduce a comprehensive approach that incorporates log preprocessing with regular expressions, followed by key information extraction via advanced algorithms, and segmentation through text clustering methods. Our experimental results demonstrate an accuracy rate of 85%, highlighting the effectiveness and potential of our proposed solution for addressing the increasing complexity of log file analysis.
People cannot live without naturallanguage in their daily life, and naturallanguage is also an essential part of the heritage of human civilization. With the rapid development of information technology and the explo...
People cannot live without naturallanguage in their daily life, and naturallanguage is also an essential part of the heritage of human civilization. With the rapid development of information technology and the explosive growth of various data, naturallanguageprocessing (NLP) technology based on deep learning and other technologies in the field of artificial intelligence has emerged as the times require. With the rapid development of deep learning models in recent years, breakthroughs have been made in the field of naturallanguageprocessing. Based on recent research, this paper briefly introduces the development process of deep learning, the concepts of deep learning and NLP, the methods of deep learning used to solve the core problems of NLP, the application of the neural network model in naturallanguage modeling. This paper summarizes the current development achievements and forecasts its future development.
暂无评论