检索结果-内蒙古大学图书馆

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Wang, Xiangfeng Chen, Zaiyi Xu, Tong Xie, Zheyong He, Yongyi Chen, Enhong University of Science and Technology of China China

ISBN: (纸本)9798891761681

With the rising popularity of Transformer-based large language models (LLMs), reducing their high inference costs has become a significant research focus. One effective approach is to compress the long input contexts. Existing methods typically leverage the self-attention mechanism of the LLM itself for context compression. While these methods have achieved notable results, the compression process still involves quadratic time complexity, which limits their applicability. To mitigate this limitation, we propose the In-Context Former (IC-Former). Unlike previous methods, IC-Former does not depend on the target LLMs. Instead, it leverages the cross-attention mechanism and a small number of learnable digest tokens to directly condense information from the contextual word embeddings. This approach significantly reduces inference time, which achieves linear growth in time complexity within the compression range. Experimental results indicate that our method requires only 1/32 of the floating-point operations of the baseline during compression and improves processing speed by 68 to 112 times while achieving over 90% of the baseline performance on evaluation metrics. Overall, our model effectively reduces compression costs and makes real-time compression scenarios feasible. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

DocCGen: Document-based Controlled Code Generation

DocCGen: Document-based Controlled Code Generation

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Pimparkhede, Sameer Kammakomati, Mehant Tamilselvam, Srikanth G. Kumar, Prince Kumar, Ashok Pon Bhattacharyya, Pushpak IIT Bombay India IBM Research United States

ISBN: (纸本)9798891761643

Recent developments show that Large language Models (LLMs) produce state-of-the-art performance on natural language (NL) to code generation for resource-rich general-purpose languages like C++, Java, and ***, their practical usage for structured domain-specific languages (DSLs) such as YAML, JSON is limited due to domain-specific schema, grammar, and customizations generally unseen by LLMs during *** have been made to mitigate this challenge via in-context learning through relevant examples or by ***, it suffers from problems, such as limited DSL samples and prompt sensitivity but enterprises maintain good documentation of the ***, we propose DocCGen, a framework that can leverage such rich knowledge by breaking the NL-to-Code generation task for structured code languages into a two-step ***, it detects the correct libraries using the library documentation that best matches the NL ***, it utilizes schema rules extracted from the documentation of these libraries to constrain the *** evaluate our framework for two complex structured languages, Ansible YAML and Bash command, consisting of two settings: Out-of-domain (OOD) and In-domain (ID).Our extensive experiments show that DocCGen consistently improves different-sized language models across all six evaluation metrics, reducing syntactic and semantic errors in structured code. © 2024 Association for Computational Linguistics.

关键词： Structured Query language

来源：评论

学校读者我要写书评

暂无评论

Prompt-Based Monte-Carlo Tree Search for Goal-oriented Dialogue Policy Planning

Prompt-Based Monte-Carlo Tree Search for Goal-oriented Dialo...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Yu, Xiao Chen, Maximillian Yu, Zhou Columbia Univ Dept Comp Sci New York NY 10027 USA

ISBN: (纸本)9798891760608

Planning for goal-oriented dialogue often requires simulating future dialogue interactions and estimating task progress. Many approaches thus consider training neural networks to perform look-ahead search algorithms such as A* search and Monte Carlo Tree Search (MCTS). However, this training often requires abundant annotated data, which creates challenges when faced with noisy annotations or low-resource settings. We introduce GDP- ZERO, an approach using Open-Loop MCTS to perform goal-oriented dialogue policy planning without any model training. GDP-ZERO prompts a large language model to act as a policy prior, value function, user simulator, and system model during the tree search. We evaluate GDP- ZERO on the goal-oriented task PersuasionForGood, and find that its responses are preferred over ChatGPT up to 59.32% of the time, and are rated more persuasive than Chat-GPT during interactive evaluations(1).

关键词： Monte Carlo methods

来源：评论

学校读者我要写书评

暂无评论

Architectural Sweet Spots for Modeling Human Label Variation by the Example of Argument Quality: It's Best to Relate Perspectives!

Architectural Sweet Spots for Modeling Human Label Variation...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Heinisch, Philipp Orlikowski, Matthias Romberg, Julia Cimiano, Philipp Bielefeld Univ Ctr Cognit Interact Technol CITEC Bielefeld Germany Heinrich Heine Univ Dusseldorf Dept Social Sci Dusseldorf Germany

ISBN: (纸本)9798891760608

Many annotation tasks in natural language processing are highly subjective in that there can be different valid and justified perspectives on what is a proper label for a given example. This also applies to the judgment of argument quality, where the assignment of a single ground truth is often questionable. At the same time, there are generally accepted concepts behind argumentation that form a common ground. To best represent the interplay of individual and shared perspectives, we consider a continuum of approaches ranging from models that fully aggregate perspectives into a majority label to "share nothing"-architectures in which each annotator is considered in isolation from all other annotators. In between these extremes, inspired by models used in the field of recommender systems, we investigate the extent to which architectures that include layers to model the relations between different annotators are beneficial for predicting single-annotator labels. By means of two tasks of argument quality classification (argument concreteness and validity/novelty of conclusions), we show that recommender architectures increase the averaged annotator-individual F-1-scores up to 43% over a majority-label model. Our findings indicate that approaches to subjectivity can benefit from relating individual perspectives.

关键词： natural language processing systems

来源：评论

学校读者我要写书评

暂无评论

Deciphering the Interplay of Parametric and Non-parametric Memory in Retrieval-augmented language Models

Deciphering the Interplay of Parametric and Non-parametric M...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Farahani, Mehrdad Johansson, Richard Chalmers University of Technology University of Gothenburg Sweden

ISBN: (纸本)9798891761643

Generative language models often struggle with specialized or less-discussed knowledge. A potential solution is found in Retrieval-Augmented Generation (RAG) models which act like retrieving information before generating responses. In this study, we explore how the ATLAS approach, a RAG model, decides between what it already knows (parametric) and what it retrieves (non-parametric). We use causal mediation analysis and controlled experiments to examine how internal representations influence information processing. Our findings disentangle the effects of parametric knowledge and the retrieved context. They indicate that in cases where the model can choose between both types of information (parametric and nonparametric), it relies more on the context than the parametric knowledge. Furthermore, the analysis investigates the computations involved in how the model uses the information from the context. We find that multiple mechanisms are active within the model and can be detected with mediation analysis: first, the decision of whether the context is relevant, and second, how the encoder computes output representations to support copying when relevant. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

"A good pun is its own reword": Can Large language Models Understand Puns?

"A good pun is its own reword": Can Large Language Models Un...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Xu, Zhijun Yuan, Siyu Chen, Lingjie Yang, Deqing School of Data Science Fudan University China

ISBN: (纸本)9798891761643

As one of the common rhetorical devices, puns play a vital role in linguistic study, including the comprehensive analysis of linguistic humor. Although large language models (LLMs) have been widely explored on various tasks of natural language understanding and generation, their ability to understand puns has not been systematically studied, limiting the utilization of LLMs in creative writing and humor creation. In this paper, we leverage three popular tasks, i.e., pun recognition, pun explanation, and pun generation, to systematically evaluate LLMs' capability of understanding puns. In addition to the evaluation metrics adopted by prior research, we introduce some new evaluation methods and metrics that are better suited to the in-context learning paradigm of LLMs. These new metrics offer a more rigorous assessment of an LLM's capability to understand puns and align more closely with human cognition. Our research findings reveal the "lazy pun generation" pattern and identify the primary challenges in understanding puns with LLMs. The code is available at https://***/Zhijun-Xu/PunEval. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

SEGMENT+: Long Text processing with Short-Context language Models

SEGMENT+: Long Text Processing with Short-Context Language M...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Shi, Wei Li, Shuang Yu, Kerun Chen, Jinglei Liang, Zujie Wu, Xinhui Qian, Yuxi Wei, Feng Zheng, Bo Liang, Jiaqing Chen, Jiangjie Xiao, Yanghua Shanghai Key Laboratory of Data Science School of Computer Science Fudan University China Columbia University United States MYbank Ant Group China School of Data Science Fudan University China

ISBN: (纸本)9798891761643

There is a growing interest in expanding the input capacity of language models (LMs) across various domains. However, simply increasing the context window does not guarantee robust performance across diverse long-input processing tasks, such as understanding extensive documents and extracting detailed information from lengthy and noisy data. In response, we introduce SEGMENT+, a general framework that enables LMs to handle extended inputs within limited context windows efficiently. SEGMENT+ utilizes structured notes and a filtering module to manage information flow, resulting in a system that is both controllable and interpretable. Our extensive experiments across various model sizes, focusing on long-document question-answering and Needle-in-a-Haystack tasks, demonstrate the effectiveness of SEGMENT+ in improving performance. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Generative Dictionary: Improving language Learner Understanding with Contextual Definitions

Generative Dictionary: Improving Language Learner Understand...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Tuan, Kevin Tu, Hai-Lun Chang, Jason S. Department of Computer Science National Tsing Hua University Taiwan Department of Library and Information Science Fu Jen Catholic University Taiwan

ISBN: (纸本)9798891761674

We introduce GenerativeDictionary, a novel dictionary system that generates word sense interpretations based on the given context. Our approach involves transforming context sentences to highlight the meaning of target words within their specific context. The method involves automatically transforming context sentences into sequences of low-dimensional vector token representations, automatically processing the input embeddings through multiple layers of transformers, and automatically generate the word senses based on the latent representations derived from the context. At runtime, context sentences with target words are processed through a transformer model that outputs the relevant word senses. Blind evaluations on a combined set of dictionary example sentences and generated sentences based on given word senses demonstrate that our method is comparable to traditional word sense disambiguation (WSD) methods. By framing WSD as a generative problem, GenerativeDictionary delivers more precise and contextually appropriate word senses, enhancing the effectiveness of language learning tools. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

On the Influence of Gender and Race in Romantic Relationship Prediction from Large language Models

On the Influence of Gender and Race in Romantic Relationship...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Sancheti, Abhilasha An, Haozhe Rudinger, Rachel University of Maryland College Park United States

ISBN: (纸本)9798891761643

We study the presence of heteronormative biases and prejudice against interracial romantic relationships in large language models by performing controlled name-replacement experiments for the task of relationship prediction. We show that models are less likely to predict romantic relationships for (a) same-gender character pairs than different-gender pairs;and (b) intra/inter-racial character pairs involving Asian names as compared to Black, Hispanic, or White names. We examine the contextualized embeddings of first names and find that gender for Asian names is less discernible than non-Asian names. We discuss the social implications of our findings, underlining the need to prioritize the development of inclusive and equitable technology. © 2024 Association for Computational Linguistics.

关键词： Prediction models

来源：评论

学校读者我要写书评

暂无评论

Explainable Video Topics for Content Taxonomy: A Multimodal Retrieval Approach to Industry-Compliant Contextual Advertising

引用

IEEE ACCESS 2025年 13卷 30597-30612页

作者： de Silva, Waruna Fernando, Anil Univ Strathclyde Dept Comp & Informat Sci Glasgow G1 1XQ Scotland

Owing to the increased video content consumption in recent years, the need for advanced contextual advertising methods that leverage increasing user engagement and relevance on advertisement-based video-on-demand platforms has increased. Traditional behavior-based advertisement targeting is waning, particularly owing to the recent strict privacy policies that favor user consent and privacy. This study proposes an innovative approach for integrating advanced natural language processing with multimodal analysis for video contextual advertising. To this end, transformer-based architectures, specifically BERTopic, computer vision techniques, and large language models were used to extract sets of topics from visual and textual video data automatically and systematically. The proposed framework decodes the taxonomy of content efficiently through videos in different levels of noise and languages. empirical analysis of the YouTube-8M dataset shows the potential for the approach to change the paradigm in video advertising. Built to be scalable and easily adaptable, this solution can handle multifarious and complex user-generated content well, suited for a wide range of applications across various media platforms.

关键词： Advertising Semantics Visualization Taxonomy Analytical models Feature extraction Context modeling Streaming media Media Image color analysis natural language processing video contextual advertisements multimodal fusion topic modeling BERTopic contextual taxonomy standards multi-label classification

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：