检索结果-内蒙古大学图书馆

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Ran, Yiting Wang, Xintao Xu, Rui Yuan, Xinfeng Liang, Jiaqing Yang, Deqing Xiao, Yanghua School of Data Science Fudan University China School of Computer Science Fudan University China

ISBN: (纸本)9798891761681

Role-playing agents (RPA) have been a popular application area for large language models (LLMs), attracting significant interest from both industry and academia. While existing RPAs well portray the characters' knowledge and tones, they face challenges in capturing their minds, especially for small role-playing language models (RPLMs). In this paper, we propose to enhance RPLMs via personality-indicative data. Specifically, we leverage questions from psychological scales and distill advanced RPAs to generate dialogues that grasp the minds of characters. Experimental results validate that RPLMs trained with our dataset exhibit advanced role-playing capabilities for both general and personality-related evaluations. Code and data are available at https://***/alienet1109/RolePersonality. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Well Begun is Half Done: Generator-agnostic Knowledge Pre-Selection for Knowledge-Grounded Dialogue

Well Begun is Half Done: Generator-agnostic Knowledge Pre-Se...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Qin, Lang Zhang, Yao Liang, Hongru Wang, Jun Yang, Zhenglu Nankai Univ TKLNDST CS Tianjin Peoples R China Minist Educ Key Lab DISSec Beijing Peoples R China Nankai Univ Sch Stat & Data Sci LPMC KLMDASR & LEBPS Tianjin Peoples R China Sichuan Univ Coll Comp Sci Chengdu Peoples R China Ludong Univ Shandong Key Lab Language Resource Dev & Applicat Coll Math & Stat Sci Yantai Shandong Peoples R China Natl Press & Publicat Adm Educ Field Integrated Publishing Knowledge Min & Beijing Peoples R China

ISBN: (纸本)9798891760608

Accurate knowledge selection is critical in knowledge-grounded dialogue systems. Towards a closer look at it, we offer a novel perspective to organize existing literature, i.e., knowledge selection coupled with, after, and before generation. We focus on the third underexplored category of study, which can not only select knowledge accurately in advance, but has the advantage to reduce the learning, adjustment, and interpretation burden of subsequent response generation models, especially LLMs. We propose GATE, a generator-agnostic knowledge selection method, to prepare knowledge for subsequent response generation models by selecting context-related knowledge among different knowledge structures and variable knowledge requirements. Experimental results demonstrate the superiority of GATE, and indicate that knowledge selection before generation is a lightweight yet effective way to facilitate LLMs (e.g., ChatGPT) to generate more informative responses.

关键词： Speech processing

来源：评论

学校读者我要写书评

暂无评论

To Forget or Not? Towards Practical Knowledge Unlearning for Large language Models

To Forget or Not? Towards Practical Knowledge Unlearning for...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Tian, Bozhong Liang, Xiaozhuan Cheng, Siyuan Liu, Qingbin Wang, Mengru Sui, Dianbo Chen, Xi Chen, Huajun Zhang, Ningyu Zhejiang University China Platform and Content Group Tencent China Harbin Institute of Technology China

ISBN: (纸本)9798891761681

Large language Models (LLMs) trained on extensive corpora inevitably retain sensitive data, such as personal privacy information and copyrighted material. Recent advancements in knowledge unlearning involve updating LLM parameters to erase specific knowledge. However, current unlearning paradigms are mired in vague forgetting boundaries, often erasing knowledge indiscriminately. In this work, we introduce KnowUnDo, a benchmark containing copyrighted content and user privacy domains to evaluate if the unlearning process inadvertently erases essential knowledge. Our findings indicate that existing unlearning methods often suffer from excessive unlearning. To address this, we propose a simple yet effective method, MemFlex, which utilizes gradient information to precisely target and unlearn sensitive parameters. Experimental results show that MemFlex is superior to existing methods in both precise knowledge unlearning and general knowledge retaining of LLMs. © 2024 Association for Computational Linguistics.

关键词： Differential privacy

来源：评论

学校读者我要写书评

暂无评论

Using language Models to Disambiguate Lexical Choices in Translation

Using Language Models to Disambiguate Lexical Choices in Tra...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Barua, Josh Subramanian, Sanjay Yin, Kayo Suhr, Alane University of California Berkeley United States

ISBN: (纸本)9798891761643

In translation, a concept represented by a single word in a source language can have multiple variations in a target language. The task of lexical selection requires using context to identify which variation is most appropriate for a source text. We work with native speakers of nine languages to create DTAiLS, a dataset of 1,377 sentence pairs that exhibit cross-lingual concept variation when translating from English. We evaluate recent LLMs and neural machine translation systems on DTAiLS, with the best-performing model, GPT-4, achieving from 67 to 85% accuracy across languages. Finally, we use language models to generate English rules describing target-language concept variations. Providing weaker models with high-quality lexical rules improves accuracy substantially, in some cases reaching or outperforming GPT-4. © 2024 Association for Computational Linguistics.

关键词： Neural machine translation

来源：评论

学校读者我要写书评

暂无评论

Automated Dataset-Creation and Evaluation Pipeline for NER in Russian Literary Heritage

引用

APPLIED SCIENCES-BASEL 2025年第4期15卷 2072-2072页

作者： Kassab, Kenan Teslya, Nikolay Vozhik, Ekaterina Russian Acad Sci SPC RAS St Petersburg Fed Res Ctr 14th Line 39 St Petersburg 199178 Russia Russian Acad Sci Inst Russian Literature Pushkinskij Dom Makarova Emb 4 St Petersburg 199034 Russia

Developing robust and reliable models for Named Entity Recognition (NER) in the Russian language presents significant challenges due to the linguistic complexity of Russian and the limited availability of suitable training datasets. This study introduces a semi-automated methodology for building a customized Russian dataset for NER specifically designed for literary purposes. The paper provides a detailed description of the methodology employed for collecting and proofreading the dataset, outlining the pipeline used for processing and annotating its contents. A comprehensive analysis highlights the dataset's richness and diversity. Central to the proposed approach is the use of a voting system to facilitate the efficient elicitation of entities, enabling significant time and cost savings compared to traditional methods of constructing NER datasets. The voting system is described theoretically and mathematically to highlight its impact on enhancing the annotation process. The results of testing the voting system with various thresholds show its impact in increasing the overall precision by 28% compared to using only the state-of-the-art model for auto-annotating. The dataset is meticulously annotated and thoroughly proofread, ensuring its value as a high-quality resource for training and evaluating NER models. empirical evaluations using multiple NER models underscore the dataset's importance and its potential to enhance the robustness and reliability of NER models in the Russian language.

关键词： natural language processing named entity recognition bidirectional encoder representations from transformers (BERT) multilingual models text processing

来源：评论

学校读者我要写书评

暂无评论

Enhancing Legal Expertise in Large language Models through Composite Model Integration: The Development and Evaluation of Law-Neo 6

Enhancing Legal Expertise in Large Language Models through C...

引用

6th natural Legal language processing Workshop 2024, NLLP 2024, co-located with the 2024 conference on empirical methods in natural language processing

作者： Liu, Zhihao Zhu, Yanzhen Lu, Mengyuan Shandong University of Finance and Economics China

ISBN: (纸本)9798891761834

Although large language models (LLMs) like ChatGPT (OpenAI et al., 2024) have demonstrated considerable capabilities in general domains, they often lack proficiency in specialized fields. Enhancing a model's performance in a specific domain, such as law, while maintaining low costs, has been a significant challenge. Existing methods, such as fine-tuning or building mixture of experts (MoE) models, often struggle to balance model parameters, training costs, and domain-specific performance. Inspired by composition to augment language models (Bansal et al., 2024), we have developed Law-Neo, a novel model designed to enhance legal LLMs. This model significantly improves the model's legal domain expertise at minimal training costs, while retaining the logical capabilities of a large-scale anchor model. Our Law-Neo model outperformed other models in comprehensive experiments on multiple legal task benchmarks, demonstrating the effectiveness of this approach. ©2024 Association for Computational Linguistics.

关键词： Costs

来源：评论

学校读者我要写书评

暂无评论

Leading Whitespaces of language Models' Subword Vocabulary Pose a Confound for Calculating Word Probabilities

Leading Whitespaces of Language Models' Subword Vocabulary P...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Oh, Byung-Doh Schuler, William Center for Data Science New York University United States Department of Linguistics The Ohio State University United States

ISBN: (纸本)9798891761643

Predictions of word-by-word conditional probabilities from Transformer-based language models are often evaluated to model the incremental processing difficulty of human readers. In this paper, we argue that there is a confound posed by the most common method of aggregating subword probabilities of such language models into word probabilities. This is due to the fact that tokens in the subword vocabulary of most language models have leading whitespaces and therefore do not naturally define stop probabilities of words. We first prove that this can result in distributions over word probabilities that sum to more than one, thereby violating the axiom that P(Ω) = 1. This property results in a misallocation of word-by-word surprisal, where the unacceptability of the end of the current word is incorrectly carried over to the next word. Additionally, this implicit prediction of word boundaries incorrectly models psycholinguistic experiments where human subjects directly observe upcoming word boundaries. We present a simple decoding technique to reaccount the probability of the trailing whitespace into that of the current word, which resolves this confound. Experiments show that this correction reveals lower estimates of garden-path effects in transitive/intransitive sentences and poorer fits to naturalistic reading times. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Do All languages Cost the Same? Tokenization in the Era of Commercial language Models

Do All Languages Cost the Same? Tokenization in the Era of C...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Ahia, Orevaoghene Kumar, Sachin Gonen, Hila Kasai, Jungo Mortensen, David R. Smith, Noah A. Tsvetkov, Yulia Univ Washington Paul G Allen Sch Comp Sci & Engn Seattle WA 98195 USA Carnegie Mellon Univ Language Technol Inst Pittsburgh PA USA Allen Inst Artificial Intelligence Seattle WA USA

ISBN: (纸本)9798891760608

language models have graduated from being research prototypes to commercialized products offered as web APIs, and recent works have highlighted the multilingual capabilities of these products. The API vendors charge their users based on usage, more specifically on the number of "tokens" processed or generated by the underlying language models. What constitutes a token, however, is training data and model dependent with a large variance in the number of tokens required to convey the same information in different languages. In this work, we analyze the effect of this non-uniformity on the fairness of an API's pricing policy across languages. We conduct a systematic analysis of the cost and utility of OpenAI's language model API on multilingual benchmarks in 22 typologically diverse languages. We show evidence that speakers of a large number of the supported languages are overcharged while obtaining poorer results. These speakers tend to also come from regions where the APIs are less affordable to begin with. Through these analyses, we aim to increase transparency around language model APIs' pricing policies and encourage the vendors to make them more equitable.

关键词： Cost benefit analysis

来源：评论

学校读者我要写书评

暂无评论

Unveiling the mystery of visual attributes of concrete and abstract concepts: Variability, nearest neighbors, and challenging categories

Unveiling the mystery of visual attributes of concrete and a...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Tater, Tarun Walde, Sabine Schulte im Frassinelli, Diego Institute for Natural Language Processing University of Stuttgart Germany MaiNLP Center for Information and Language Processing LMU Munich Germany

ISBN: (纸本)9798891761643

The visual representation of a concept varies significantly depending on its meaning and the context where it occurs;this poses multiple challenges both for vision and multimodal models. Our study focuses on concreteness, a well-researched lexical-semantic variable, using it as a case study to examine the variability in visual representations. We rely on images associated with approximately 1,000 abstract and concrete concepts extracted from two different datasets: Bing and YFCC. Our goals are: (i) evaluate whether visual diversity in the depiction of concepts can reliably distinguish between concrete and abstract concepts;(ii) analyze the variability of visual features across multiple images of the same concept through a nearest neighbor analysis;and (iii) identify challenging factors contributing to this variability by categorizing and annotating images. Our findings indicate that for classifying images of abstract versus concrete concepts, a combination of basic visual features such as color and texture is more effective than features extracted by more complex models like Vision Transformer (ViT). However, ViTs show better performances in the nearest neighbor analysis, emphasizing the need for a careful selection of visual features when analyzing conceptual variables through modalities other than text. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering

DVD: Dynamic Contrastive Decoding for Knowledge Amplificatio...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Jin, Jing Wang, Houfeng Zhang, Hao Li, Xiaoguang Guo, Zhijiang National Key Laboratory of Multimedia Information Processing School of Computer Science Peking University China Huawei Noah's Ark Lab Canada

ISBN: (纸本)9798891761643

Large language models (LLMs) are widely used in question-answering (QA) systems but often generate information with hallucinations. Retrieval-augmented generation (RAG) offers a potential remedy, yet the uneven retrieval quality and irrelevant contents may distract LLMs. In this work, we address these issues at the generation phase by treating RAG as a multi-document QA task. We propose a novel decoding strategy, Dynamic Contrastive Decoding (DVD), which dynamically amplifies knowledge from selected documents during the generation phase. DVD involves constructing inputs batchwise, designing new selection criteria to identify documents worth amplifying, and applying contrastive decoding with a specialized weight calculation to adjust the final logits used for sampling answer tokens. Zero-shot experimental results on ALCE-ASQA, NQ, TQA and PopQA benchmarks show that our method outperforms other decoding strategies. Additionally, we conduct experiments to validate the effectiveness of our selection criteria, weight calculation, and general multi-document scenarios. Our method requires no training and can be integrated with other methods to improve the RAG performance. Our codes will be publicly available at https://***/JulieJin-km/Dynamic_Contrastive_Decoding. © 2024 Association for Computational Linguistics.

关键词： Question answering

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：