Following language instructions to navigate in unseen environments is a challenging task for autonomous embodied agents. With strong representation capabilities, pretrained vision-and-language models are widely used i...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Following language instructions to navigate in unseen environments is a challenging task for autonomous embodied agents. With strong representation capabilities, pretrained vision-and-language models are widely used in VLN. However, most of them are trained on web-crawled general-purpose datasets, which incurs a considerable domain gap when used for VLN tasks. To address the problem, we propose a novel and model-agnostic Domain-Aware Prompt learning (DAP) framework. For equipping the pretrained models with specific object-level and scene-level cross-modal alignment in VLN tasks, DAP applies a low-cost prompt tuning paradigm to learn soft visual prompts for extracting in-domain image semantics. Specifically, we first generate a set of in-domain image-text pairs with the help of the CLIP model. Then we introduce soft visual prompts in the input space of the visual encoder in a pretrained model. DAP injects in-domain visual knowledge into the visual encoder of the pretrained model in an efficient way. Experimental results on both R2R and REVERIE show the superiority of DAP compared to existing state-of-the-art methods.
This project proposes an NLP-based Intelligent information Summarization System that reduces the effort required for comprehending and summarizing instructional material, maximizing engineering students' study tim...
详细信息
A backbone of knowledge graphs are their class membership relations, which assign entities to a given class. As part of the knowledgeengineering process, we propose a new method for evaluating the quality of these re...
详细信息
ISBN:
(纸本)9783031789519;9783031789526
A backbone of knowledge graphs are their class membership relations, which assign entities to a given class. As part of the knowledgeengineering process, we propose a new method for evaluating the quality of these relations by processing descriptions of a given entity and class using a zero-shot chain-of-thought classifier that uses a naturallanguage intensional definition of a class. We evaluate the method using two publicly available knowledge graphs, Wikidata and CaLiGraph, and 7 large language models. Using the gpt-4-0125-preview large language model, the method's classification performance achieves a macro-averaged F1-score of 0.830 on data from Wikidata and 0.893 on data from CaLiGraph. Moreover, a manual analysis of the classification errors shows that 40.9% of errors were due to the knowledge graphs, with 16.0% due to missing relations and 24.9% due to incorrectly asserted relations. These results show how large language models can assist knowledge engineers in the process of knowledge graph refinement. The code and data are available on Github (https://***/bradleypallen/evaluating-kg-class-memberships-using-llms).
Parameter-efficient, soft, and prompt- based tuning methods have received increasing attention in various downstream tasks due to the high cost of traditional fine-tuning methods in pre-trained language models (PLM). ...
详细信息
ISBN:
(纸本)9798350359329;9798350359312
Parameter-efficient, soft, and prompt- based tuning methods have received increasing attention in various downstream tasks due to the high cost of traditional fine-tuning methods in pre-trained language models (PLM). Prompt tuning (PT) is one such effective mechanism which has achieved remarkable performance in transferring the acquired knowledge of a PLM to perform an unseen task within the same domain using task-specific prompts and informative instructions. Even though most prior work considers in-domain knowledge transfer using PT, much work remains to be done for PT-based out-of-domain knowledge transfer. In this study we propose ProDepDet, a novel framework specifically designed to use a PLM's knowledge about structure and semantic modelling in multi-party conversations to perform the unseen, out-ofdomain task of depression detection. To our knowledge, this study is the first attempt to adapt the acquired knowledge of a PLM for out-of-domain task modelling using PT-based crosstask transferability. Experiments on few-shot and full data settings across multiple benchmark datasets demonstrate the superiority of our PT framework in two downstream tasks including depressed utterance classification and depressed speaker identification.
Large language Models (LLMs) are gaining popularity in the field of naturallanguageprocessing (NLP) due to their remarkable accuracy in various NLP tasks. LLMs designed for coding are trained on massive datasets, wh...
详细信息
In this paper, the studies carried out to detect objectionable expressions in any text will be explained. Experiments were performed with Sentence transformers, supervised machine learning algorithms, and Bert transfo...
详细信息
ISBN:
(纸本)9798350345650
In this paper, the studies carried out to detect objectionable expressions in any text will be explained. Experiments were performed with Sentence transformers, supervised machine learning algorithms, and Bert transformer architecture trained in English, and the results were observed. To prepare the dataset used in the experiments, the naturallanguageprocessing and machine learning methodologies of the toxic and non-toxic contents in the labeled text data obtained from the Kaggle platform are explained, and then the methods and performances of the models trained using this dataset are summarized in this paper.
This research aims to develop a naturallanguageprocessing (NLP) framework to uncover the current demanded knowledge, Skills, and Abilities (KSAs) in the Sri Lankan Information Technology (IT) sector. In the KSA, thi...
详细信息
Writing software requirements in the form of naturallanguage especially Thai language is very challenging. If software engineers do not have good writing skills, this may cause the ambiguity resulting in misunderstan...
详细信息
ISBN:
(纸本)9798350381771;9798350381764
Writing software requirements in the form of naturallanguage especially Thai language is very challenging. If software engineers do not have good writing skills, this may cause the ambiguity resulting in misunderstanding and misinterpretation during the development. To prevent this occurrence, this paper presents an NLP-based approach for detecting ambiguity of Thai software requirements. This approach influences an initiative fundamental of ambiguity detection mechanism at lexical level. The words potentially causing the ambiguity in software requirements are detected and classified into the ambiguity type. The contribution of the approach is demonstrated with the development of a prototype tool, Software Requirement Ambiguity Detector (SRAD). The validation and evaluation results with real software requirements from the various system domains with the practical expert perspective confirm the benefits of the proposed approach and developed tool. In the future, our tool and model will be integrated to our redesigned approach for writing Thai software requirements specification.
The process of converting naturallanguage requirements and visual models into executable software code remains an ongoing challenge in software engineering. We developed an intelligent system that adopts natural Lang...
详细信息
Deriving acceptance tests from high-level, naturallanguage requirements that achieve full coverage is a major manual challenge at the interface between requirements engineering and testing. Conditional requirements (...
详细信息
ISBN:
(纸本)9798350326918
Deriving acceptance tests from high-level, naturallanguage requirements that achieve full coverage is a major manual challenge at the interface between requirements engineering and testing. Conditional requirements (e.g., "If A or B then C.") imply causal relationships which when extracted allow to generate these acceptance tests automatically. This paper presents a tool from the CiRA (Causality In Requirements Artifacts) initiative, which automatically processes conditional naturallanguage requirements and generates a minimal set of test case descriptions achieving full coverage. We evaluate the tool on a publicly available data set of 61 requirements from the requirements specification of the German Corona-Warn-App. The tool infers the correct lest variables in 84.5% and correct variable configurations in 92.3% of all cases, which corroborates the feasibility of our approach.
暂无评论