Recent technologies like ChatGPT inspire ideas of increasing automation for professional tasks in various domains. This includes among others requirements engineering. So far, however, the divide between useful and ha...
详细信息
ISBN:
(纸本)9798350395129;9798350395112
Recent technologies like ChatGPT inspire ideas of increasing automation for professional tasks in various domains. This includes among others requirements engineering. So far, however, the divide between useful and harmful automation has been blurry and in flux. In this keynote, I will discuss foundational concepts with which automation has been analyzed. Using these concepts, I will discuss the opportunities arising for requirements engineering from the advent of naturallanguageprocessing, image processing, and event sequence analysis techniques. Furthermore, I will discuss important pitfalls that have been well documented for other automation technologies in the past.
Having difficulties like being visually impaired, hard of listening to, dumb are a more amount of difficulty. Technology and innovation have motivated humans to grow to be depending on solace but there exists an under...
详细信息
3D open-vocabulary semantic segmentation is a challenge in the task of 3D scene understanding, as most current models trained on closed-set datasets struggle to effectively identify categories that were not seen durin...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
3D open-vocabulary semantic segmentation is a challenge in the task of 3D scene understanding, as most current models trained on closed-set datasets struggle to effectively identify categories that were not seen during training. To address this, we introduce a framework called LSWKD. It distills knowledge from a pre-trained 3D open-world model, thereby enhancing the alignment between visual and semantic features. Furthermore, we employ Point-discriminative Contrastive Learning to compute caption loss in the teacher model instead of CLIP-style Contrastive Loss in order to let each point be supervised with its all related language captions, which improves the teacher model's performance. We conducted experiments on ScanNet and S3DIS datasets. The results demonstrate that our approach achieves better hIoU compared with state-of-the-art models. Code will be released at https://***/wu39848/LSWKD.
Large language models augmented with task-relevant documents have demonstrated impressive performance on knowledge-intensive tasks. However, regarding how to obtain effective documents, the existing methods are mainly...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Large language models augmented with task-relevant documents have demonstrated impressive performance on knowledge-intensive tasks. However, regarding how to obtain effective documents, the existing methods are mainly divided into two categories. One is to retrieve from an external knowledge base, and the other is to utilize large language models to generate documents. We propose an iterative retrieval-generation collaborative framework. It is not only able to leverage both parametric and non-parametric knowledge, but also helps to find the correct reasoning path through retrieval-generation interactions, which is very important for tasks that require multi-step reasoning. We conduct experiments on four question answering datasets, including single-hop QA and multi-hop QA tasks. Empirical results show that our method significantly improves the reasoning ability of large language models and outperforms previous baselines.
Ancient medical texts, which reflect traditional cultural practices, can play an important role in the promotion of medical traditions. Nevertheless, the complexity of extracting information from classical Chinese med...
详细信息
Replication studies help mature our knowledge and attempt to validate the findings of a prior piece of research. However, these studies are still rare in the Requirements engineering field. Additionally, the rapidly a...
详细信息
ISBN:
(纸本)9798350326918
Replication studies help mature our knowledge and attempt to validate the findings of a prior piece of research. However, these studies are still rare in the Requirements engineering field. Additionally, the rapidly advancing realm of naturallanguageprocessing (NLP) is creating new opportunities for efficient, machine-assisted workflows application which can bring new perspectives and results to the forefront. Thus, in this paper, we replicate and extend a previous study (baseline), a tool, VVikiDoMiner, which automatically generated domain-specific corpora by crawling Wikipedia. In this study, we investigated and executed the implementation of WikiDoMiner (open-sourced code from the original paper) to recreate the results. This allowed us to strengthen the external validity of the original study. We extended the baseline to evaluate additional data sets and generated nuanced results using state-of-the-art NLP techniques such as Bidirectional Encoder Representations from Transformers (BERT). Results showed that due to the growing content in Wikipedia, the corpus generated for the Railways and Networks domains did not precisely match the results from the baseline. However, utilizing the state-of-the-art KeyBERT library from the Huggingface AI community enhanced the results, eventually generating a meaningful corpus compared to the baseline.
Abductive naturallanguage commonsense reasoning is a task aiming at inferring the most plausible explanation in narrative text for observed events. Previous works mostly concentrate on utilizing powerful pre-trained ...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Abductive naturallanguage commonsense reasoning is a task aiming at inferring the most plausible explanation in narrative text for observed events. Previous works mostly concentrate on utilizing powerful pre-trained language models and making better use of excess training data to learn abundant event commonsense knowledge. However, the utilization of causal effect is hidden in the language reasoning process and the explicit constraint of the causal effect between events has not been explored, resulting in biased inference. The model may focus on one observed event and make the wrong prediction while ignoring the other helpful events. To reveal the problem we modify the original task by appending unrelated text to the context which won't change the causal relation. And typical methods get worse in the new task as they are not good at utilizing the complementary between the two observations. Motivated by eliminating the shortcut from incomplete observation and utilizing the complementarity of the two observations, we propose an incomplete observation bias suppression method to guide the training process. Results show our approach can ease the problem revealed in the new task. Based on the proposed method and the new task, our method also get competitive result on the original task.
A synonym mining method is proposed by combining the character vector graph and noise robust learning method. The model uses paired word vectors pre-trained by ChatGPT to enhance entity semantic representation. Classi...
详细信息
ISBN:
(纸本)9798400718144
A synonym mining method is proposed by combining the character vector graph and noise robust learning method. The model uses paired word vectors pre-trained by ChatGPT to enhance entity semantic representation. Classify marks with noise. Then the cross optimal processing is carried out to identify the true and false marks. The two-layer construction system of knowledge extraction and knowledge fusion is constructed to realize the independent construction and answer of software engineering questions. The system effectively improves the efficiency of software project understanding and software reuse.
This paper presents the development process of a naturallanguage to SQL model using the T5 model as the basis. The models, developed in August 2022 for an online transaction processing system and a data warehouse, ha...
详细信息
ISBN:
(纸本)9798350358810;9798350358803
This paper presents the development process of a naturallanguage to SQL model using the T5 model as the basis. The models, developed in August 2022 for an online transaction processing system and a data warehouse, have a 73% and 84% exact match accuracy respectively. These models, in conjunction with other work completed in the research project, were implemented for several companies and used successfully on a daily basis. The approach used in the model development could be implemented in a similar fashion for other database environments and with a more powerful pre-trained language model.
Today's world data is about 80-90% unstructured, which makes it very complex to analyze, interpret and understand. Meaningful insights, patterns are hidden in the data, discovering knowledge and converting to stru...
详细信息
暂无评论