检索结果-内蒙古大学图书馆

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Kai, Jushi Hou, Shengyuan Huang, Yusheng Lin, Zhouhan Shanghai Jiao Tong University China

ISBN: (纸本)9798891761681

Grammar induction has made significant progress in recent years. However, it is not clear how the application of induced grammar could enhance practical performance in downstream tasks. In this work, we introduce an unsupervised grammar induction method for language understanding and generation. We construct a grammar parser to induce constituency structures and dependency relations, which is simultaneously trained on downstream tasks without additional syntax annotations. The induced grammar features are subsequently incorporated into Transformer as a syntactic mask to guide self-attention. We evaluate and apply our method to multiple machine translation tasks and natural language understanding tasks. Our method demonstrates superior performance compared to the original Transformer and other models enhanced with external parsers. Experimental results indicate that our method is effective in both from-scratch and pre-trained scenarios. Additionally, our research highlights the contribution of explicitly modeling the grammatical structure of texts to neural network models. © 2024 Association for Computational Linguistics.

关键词： Syntactics

来源：评论

学校读者我要写书评

暂无评论

Reader: Model-based language-instructed reinforcement learning

Reader: Model-based language-instructed reinforcement learni...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Dainese, Nicola Marttinen, Pekka Ilin, Alexander Aalto Univ Dept Comp Sci Espoo Finland

ISBN: (纸本)9798891760608

We explore how we can build accurate world models, which are partially specified by language, and how we can plan with them in the face of novelty and uncertainty. We propose the first model-based reinforcement learning approach to tackle the environment Read To Fight Monsters (Zhong et al., 2019), a grounded policy learning problem. In RTFM an agent has to reason over a set of rules and a goal, both described in a language manual, and the observations, while taking into account the uncertainty arising from the stochasticity of the environment, in order to generalize successfully its policy to test episodes. We demonstrate the superior performance and sample efficiency of our model-based approach to the existing model-free SOTA agents in eight variants of RTFM. Furthermore, we show how the agent's plans can be inspected, which represents progress towards more interpretable agents.

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

InfiniPot: Infinite Context processing on Memory-Constrained LLMs

InfiniPot: Infinite Context Processing on Memory-Constrained...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Kim, Minsoo Shim, Kyuhong Choi, Jungwook Chang, Simyung Hanyang University Korea Republic of Qualcomm AI Research Qualcomm Korea YH Korea Republic of

ISBN: (纸本)9798891761643

Handling long input contexts remains a significant challenge for Large language Models (LLMs), particularly in resource-constrained environments such as mobile devices. Our work aims to address this limitation by introducing InfiniPot, a novel KV cache control framework designed to enable pre-trained LLMs to manage extensive sequences within fixed memory constraints efficiently, without requiring additional training. InfiniPot leverages Continual Context Distillation (CCD), an iterative process that compresses and retains essential information through novel importance metrics, effectively maintaining critical data even without access to future context. Our comprehensive evaluations indicate that InfiniPot significantly outperforms models trained for long contexts in various NLP tasks, establishing its efficacy and versatility. This work represents a substantial advancement toward making LLMs applicable to a broader range of real-world scenarios. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Adapting LLMs for Structured natural language API Integration

Adapting LLMs for Structured Natural Language API Integratio...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Chan, Robin Mirylenka, Katsiaryna Gschwind, Thomas Miksovic-Czasch, Christoph Scotton, Paolo Toniato, Enrico Labbi, Abdel IBM Research United States ETH Zürich Switzerland

ISBN: (纸本)9798891761667

API integration is crucial for enterprise systems, as it enables seamless interaction between applications within workflows. However, the diversity and complexity of the API landscape present significant challenges in combining API calls based on user intent. Existing methods rely on named entity recognition (NER) and knowledge graphs, but struggle to generate more complex control flow structures, such as conditionals and loops. We propose a novel framework that leverages the success of large language models (LLMs) in code generation to integrate APIs based on natural language input. Our approach involves fine-tuning an LLM using automatically generated API flows derived from OpenAPI specifications. We further evaluate the effectiveness of enforcing the syntax and schema adherence through constrained decoding. To enable systematic comparison, we introduce targeted test suites to assess the generalization capabilities of these approaches and their ability to retain structured knowledge. Our findings show that LLMs fine-tuned on OpenAPI specifications can (a) learn structural API constraints implicitly during training, and (b) achieve significant improvements in both in-distribution and out-of-distribution performance over NER and retrieval-augmented generation (RAG)-based approaches. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Hallucination Mitigation in natural language Generation from Large-Scale Open-Domain Knowledge Graphs

Hallucination Mitigation in Natural Language Generation from...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Shi, Xiao Zhu, Zhengyuan Zhang, Zeyu Li, Chengkai Univ Texas Arlington Dept Comp Sci & Engn Arlington TX 76019 USA

ISBN: (纸本)9798891760608

In generating natural language descriptions for knowledge graph triples, prior works used either small-scale, human-annotated datasets or datasets with limited variety of graph shapes, e.g., those having mostly star graphs. Graph-to-text models trained and evaluated on such datasets are largely not assessed for more realistic large-scale, open-domain settings. We introduce a new dataset, GraphNarrative, to fill this gap. Fine-tuning transformer-based pre-trained language models has achieved state-of-the-art performance among graph-to-text models. However, this method suffers from information hallucination-the generated text may contain fabricated facts not present in input graphs. We propose a novel approach that, given a graph-sentence pair in GraphNarrative, trims the sentence to eliminate portions that are not present in the corresponding graph, by utilizing the sentence's dependency parse tree. Our experiment results verify this approach using models trained on GraphNarrative and existing datasets. The dataset, source code, and trained models are released at https://***/idirlab/graphnarrator.

关键词： Trees (mathematics)

来源：评论

学校读者我要写书评

暂无评论

Leveraging encoder-only large language models for mobile app review feature extraction

引用

empirical SOFTWARE ENGINEERING 2025年第3期30卷 1-37页

作者： Motger, Quim Miaschi, Alessio Dell'Orletta, Felice Franch, Xavier Marco, Jordi Univ Politecn Cataluna Dept Serv & Informat Syst Engn Barcelona Spain Inst Computat Linguist A Zampolli ILC CNR ItaliaNLP Lab Pisa Italy Univ Politecn Cataluna Dept Comp Sci Barcelona Spain

Mobile app review analysis presents unique challenges due to the low quality, subjective bias, and noisy content of user-generated documents. Extracting features from these reviews is essential for tasks such as feature prioritization and sentiment analysis, but it remains a challenging task. Meanwhile, encoder-only models based on the Transformer architecture have shown promising results for classification and information extraction tasks for multiple software engineering processes. This study explores the hypothesis that encoder-only large language models can enhance feature extraction from mobile app reviews. By leveraging crowdsourced annotations from an industrial context, we redefine feature extraction as a supervised token classification task. Our approach includes extending the pre-training of these models with a large corpus of user reviews to improve contextual understanding and employing instance selection techniques to optimize model fine-tuning. empirical evaluations demonstrate that these methods improve the precision and recall of extracted features and enhance performance efficiency. Key contributions include a novel approach to feature extraction, annotated datasets, extended pre-trained models, and an instance selection mechanism for cost-effective fine-tuning. This research provides practical methods and empirical evidence in applying large language models to natural language processing tasks within mobile app reviews, offering improved performance in feature extraction.

关键词： Mobile app reviews Feature extraction Named-entity recognition Large language models Extended pre-training Instance selection

来源：评论

学校读者我要写书评

暂无评论

Reformulating NLP tasks to Capture Longitudinal Manifestation of language Disorders in People with Dementia.

Reformulating NLP tasks to Capture Longitudinal Manifestatio...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Gkoumas, Dimitris Purver, Matthew Liakata, Maria Queen Mary Univ London London England Alan Turing Inst London England Jozef Stefan Inst Ljubljana Slovenia

ISBN: (纸本)9798891760608

Dementia is associated with language disorders which impede communication. Here, we automatically learn linguistic disorder patterns by making use of a moderately-sized pre-trained language model and forcing it to focus on reformulated natural language processing (NLP) tasks and associated linguistic patterns. Our experiments show that NLP tasks that encapsulate contextual information and enhance the gradient signal with linguistic patterns benefit performance. We then use the probability estimates from the best model to construct digital linguistic markers measuring the overall quality in communication and the intensity of a variety of language disorders. We investigate how the digital markers characterize dementia speech from a longitudinal perspective. We find that our proposed communication marker is able to robustly and reliably characterize the language of people with dementia, outperforming existing linguistic approaches;and shows external validity via significant correlation with clinical markers of behaviour. Finally, our proposed linguistic disorder markers provide useful insights into gradual language impairment associated with disease progression.

关键词： Neurodegenerative diseases

来源：评论

学校读者我要写书评

暂无评论

Red Teaming language Models for processing Contradictory Dialogues

Red Teaming Language Models for Processing Contradictory Dia...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Wen, Xiaofei Li, Bangzheng Huang, Tenghao Chen, Muhao University of California Davis United States University of Southern California United States

ISBN: (纸本)9798891761643

Most language models currently available are prone to self-contradiction during dialogues. To mitigate this issue, this study explores a novel contradictory dialogue processing task that aims to detect and modify contradictory statements in a conversation. This task is inspired by research on context faithfulness and dialogue comprehension, which have demonstrated that the detection and understanding of contradictions often necessitate detailed explanations. We develop a dataset comprising contradictory dialogues, in which one side of the conversation contradicts itself. Each dialogue is accompanied by an explanatory label that highlights the location and details of the contradiction. With this dataset, we present a Red Teaming framework for contradictory dialogue processing. The framework detects and attempts to explain the dialogue, then modifies the existing contradictory content using the explanation. Our experiments demonstrate that the framework improves the ability to detect contradictory dialogues and provides valid explanations. Additionally, it showcases distinct capabilities for modifying such dialogues. Our study highlights the importance of the logical inconsistency problem in conversational AI. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

VivesDebate-Speech: A Corpus of Spoken Argumentation to Leverage Audio Features for Argument Mining

VivesDebate-Speech: A Corpus of Spoken Argumentation to Leve...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Ruiz-Dolz, Ramon Iranzo-Sanchez, Javier Univ Dundee Ctr Argument Technol Dundee DD1 4HN Scotland Univ Politecn Valencia VRAIN MLLP Valencia 46022 Spain

ISBN: (纸本)9798891760608

In this paper, we describe VivesDebate-Speech, a corpus of spoken argumentation created to leverage audio features for argument mining tasks. The creation of this corpus represents an important contribution to the intersection of speech processing and argument mining communities, and one of the most complete publicly available resources in this topic. Moreover, we have performed a set of first-of-their-kind experiments which show an improvement when integrating audio features into the argument mining pipeline. The provided results can be used as a baseline for future research.

关键词： Speech processing

来源：评论

学校读者我要写书评

暂无评论

Story Embeddings - Narrative-Focused Representations of Fictional Stories

Story Embeddings - Narrative-Focused Representations of Fict...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Hatzel, Hans Ole Biemann, Chris Universität Hamburg Language Technology Group Germany

ISBN: (纸本)9798891761643

We present a novel approach to modeling fictional narratives. The proposed model creates embeddings that represent a story such that similar narratives, that is, reformulations of the same story, will result in similar embeddings. We showcase the prowess of our narrative-focused embeddings on various datasets, exhibiting state-of-the-art performance on multiple retrieval tasks. The embeddings also show promising results on a narrative understanding task. Additionally, we perform an annotation-based evaluation to validate that our introduced computational notion of narrative similarity aligns with human perception. The approach can help to explore vast datasets of stories, with potential applications in recommender systems and in the computational analysis of literature. © 2024 Association for Computational Linguistics.

关键词： Embeddings

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：