naturallanguage Interfaces (NLIs) are a viable, human-readable alternative to complex, formal query languages like SPARQL, which are typically used for accessing semantically structured data (e.g. RDF and OWL reposit...
详细信息
ISBN:
(纸本)9781479938346
naturallanguage Interfaces (NLIs) are a viable, human-readable alternative to complex, formal query languages like SPARQL, which are typically used for accessing semantically structured data (e.g. RDF and OWL repositories). However, in order to cope with naturallanguage ambiguities, NLIs typically support a more restricted language. A major challenge when designing such restricted languages is habitability- how easily, naturally and effectively users can use the language to express themselves within the con-strains imposed by the system. In this paper, we investigate two methods for improving the habitability of a naturallanguage Interface: feedback and clarification dialogues. We model feedback by showing the user how the system interprets the query, thus suggesting repair through query reformulation. Next, we investigate how clarification dialogues can be used to control the query interpretations generated by the system. To reduce the cognitive overhead, clarification dialogues are coupled with a learning mechanism. Both methods are shown to have a positive effect on the overall performance and habitability.
Pre-trained vision and language models such as CLIP (Radford et al., 2021) have witnessed remarkable success in connecting images and texts with a primary focus on English texts. Despite recent efforts to extend CLIP ...
详细信息
ISBN:
(纸本)9798891760615
Pre-trained vision and language models such as CLIP (Radford et al., 2021) have witnessed remarkable success in connecting images and texts with a primary focus on English texts. Despite recent efforts to extend CLIP to support other languages, disparities in performance among different languages have been observed due to uneven resource availability. Additionally, current cross-lingual transfer methods of those pre-trained models would consume excessive resources for a large number of languages. Therefore, we propose a new parameter-efficient cross-lingual transfer learning framework that utilizes a translation-based alignment method to mitigate multilingual disparities and explores parameterefficient fine-tuning methods for parameterefficient cross-lingual transfer. Extensive experiments on XTD (Aggarwal and Kale, 2020) and Multi30K (Elliott et al., 2016) datasets, covering 11 languages under zero-shot, few-shot, and full-dataset learning scenarios, show that our framework significantly reduces the multilingual disparities among languages and improves cross-lingual transfer results, especially in low-resource scenarios, while only keeping and fine-tuning an extremely small number of parameters compared to the full model (e.g., Our framework only requires 0.16% additional parameters of a full-model for each language in the few-shot learning scenario). The codes are available at https://***/eric-ai-lab/PECTVLM.
The stable operation of power grid system is essential for our daily lives. The on-site patrol and examining play an important rule to detect the potential damage and stabilize the power grid system. In this paper, we...
详细信息
ISBN:
(纸本)9781665423137
The stable operation of power grid system is essential for our daily lives. The on-site patrol and examining play an important rule to detect the potential damage and stabilize the power grid system. In this paper, we propose a framework to analyze possible errors in electric system by naturallanguageprocessing method so that the on-site workers can directly describe the phenomenon and get the instruction to deal with the situation. The framework applies the word2vec algorithm to calculate the similarity between the words, and Earth Movers' Distance method is adopted to compute the overall semantic similarity between the on-site situation and the history record. Based on the similar records, the next step operation is generated to guide the on-site workers. We implement the proposed framework for electric distribution network and the experimental results indicate that the proposed method has better accuracy (93%) compared with existing methods (84%) and can be used to improve the management of power grid system.
We have created the NLP4NLP corpus to study the content of scientific publications in the field of speech and naturallanguageprocessing. It contains articles published in 34 major conferences and journals in that fi...
详细信息
ISBN:
(纸本)9781538633335
We have created the NLP4NLP corpus to study the content of scientific publications in the field of speech and naturallanguageprocessing. It contains articles published in 34 major conferences and journals in that field over a period of 50 years (1965-2015). comprising 65.000 documents. gathering 50.000 authors. including 325.000 references and representing approximately 270 million words. Most of these publications are in English. some are in French. German or Russian. Some are open access. others have been provided by the publishers. In order to constitute and analyze this corpus several tools have been used or developed. Some of them use naturallanguageprocessingmethods that have been published in the corpus. hence its name. Numerous manual corrections were necessary. which demonstrated the importance of establishing standards for uniquely identifying authors. publications or resources. We have conducted various studies: evolution over time of the number of articles and authors. collaborations between authors. citations between papers and authors. evolution of research themes and identification of the authors who introduced them. measure of innovation and detection of epistemological ruptures. use of language resources. reuse of articles and plagiarism in the context of a global or comparative analysis between sources.
Reports of human-like behaviors in foundation models are growing, with psychological theories providing enduring tools to investigate these behaviors. However, current research tends to directly apply these human-orie...
详细信息
ISBN:
(纸本)9798891760615
Reports of human-like behaviors in foundation models are growing, with psychological theories providing enduring tools to investigate these behaviors. However, current research tends to directly apply these human-oriented tools without verifying the faithfulness of their outcomes. In this paper, we introduce a framework, RealBehavior, which is designed to characterize the humanoid behaviors of models faithfully. Beyond simply measuring behaviors, our framework assesses the faithfulness of results based on reproducibility, internal and external consistency, and generalizability. Our findings suggest that a simple application of psychological tools cannot faithfully characterize all human-like behaviors. Moreover, we discuss the impacts of aligning models with human and social values, arguing for the necessity of diversifying alignment objectives to prevent the creation of models with restricted characteristics.
naturallanguageprocessing (NLP) is a domain that programs machines to interpret/ comprehend human language like human beings do. Application of NLP to summarization and machine translation has been accelerating from...
详细信息
We investigate the usefulness of evolutionary algorithms in three incarnations of the problem of feature relevance assignment in memory-based languageprocessing (MBLP): feature weighting, feature ordering and feature...
详细信息
Artificial intelligence is the field of computer science and naturallanguageprocessing is the sub-part method of artificial intelligence. Its major purpose is to remove the transmission gap between human being and c...
详细信息
ISBN:
(纸本)9781665400916
Artificial intelligence is the field of computer science and naturallanguageprocessing is the sub-part method of artificial intelligence. Its major purpose is to remove the transmission gap between human being and computer machine. Their usage in deep learning, machine learning with the help of different algorithms. In this paper, talk about different systems/methods of naturallanguage in the light of multiple six survey-based articles. Defined the basic structure of NLP, their different steps from input sentence to last outcome with the help of figure. Paper represents the NLP techniques, their structure, and their unavoidable importance in the filed of computer science as well as other fields.
Recent advances in vision-and-language modeling have seen the development of Transformer architectures that achieve remarkable performance on multimodal reasoning tasks. Yet, the exact capabilities of these black-box ...
详细信息
The problem of quantifying semantic relatedness level of two words is a fundamental sub-task for many naturallanguageprocessing systems. While there is a large body of research on measuring semantic relatedness in t...
详细信息
ISBN:
(纸本)9783319754772;9783319754765
The problem of quantifying semantic relatedness level of two words is a fundamental sub-task for many naturallanguageprocessing systems. While there is a large body of research on measuring semantic relatedness in the English language, the literature lacks detailed analysis for these methods in agglutinative languages. In this research, two new evaluation resources for the Turkish language are constructed. An extensive set of experiments involving multiple tasks: word association, semantic categorization, and automatic WordNet relationship discovery are performed to evaluate different semantic relatedness measures in the Turkish language. As Turkish is an agglutinative language, the morphological processing component is important for distributional similarity algorithms. For languages with rich morphological variations and productivity, methods ranging from simple stemming strategies to morphological disambiguation exists. In our experiments, different morphological processingmethods for the Turkish language are investigated.
暂无评论