检索结果-内蒙古大学图书馆

13th International conference on natural language processing and Chinese Computing

作者： Wang, Bailun Ji, Yatu Wu, Nier Liu, Xu Wang, Yanli Mao, Rui Zhou, Chao Jia, Yepai Zhao, Chen Ren, Qing-Dao-Er-Ji Liu, Na Inner Mongolia Univ Technol Hohhot 010000 Peoples R China

ISBN: (纸本)9789819794393;9789819794409

Low-resource language translation remains a significant challenge in natural language processing, particularly for the Mongolian-Chinese language pair under the "Belt and Road" initiative. Existing translation systems struggle with this pair due to the scarcity of high-quality data. This paper addresses these challenges by combining multilingual k-nearest-neighbor machine translation (kNNMT) with Chinese-centric methods. We constructed a robust multilingual datastore and introduced an incomplete-trust loss function to effectively manage low-quality data. Additionally, we implemented re-ranking techniques to further enhance the robustness and accuracy of the translation model. The experimental results indicate that this combined approach significantly improves Mongolian-Chinese translation quality on the mBART model, with a BLEU score increase of 3.81 points and a TER score decrease of 0.0531 points. Our findings demonstrate that integrating kNN-MT with Chinese-centric methods and employing advanced loss functions and re-ranking techniques can effectively address data scarcity and quality issues, leading to substantial improvements in translation performance for low-resource language pairs.

关键词： Neural Machine Translation Chinese-Centric Multilingual Datastores Mongolian-Chinese

来源：评论

学校读者我要写书评

暂无评论

Moleco: Molecular Contrastive Learning with Chemical language Models for Molecular Property Prediction

Moleco: Molecular Contrastive Learning with Chemical Languag...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Park, Jun-Hyung Park, Hyuntae Kim, Yeachan Lim, Woosang Lee, SangKeun Division of Language & AI Hankuk University of Foreign Studies Korea Republic of Department of Artificial Intelligence Korea University Korea Republic of POSCO Holdings Korea Republic of Department of Computer Science and Engineering Korea University Korea Republic of

ISBN: (纸本)9798891761667

Pre-trained chemical language models (CLMs) excel in the field of molecular property prediction, utilizing string-based molecular descriptors such as SMILES for learning universal representations. However, such string-based descriptors implicitly contain limited structural information, which is closely associated with molecular property prediction. In this work, we introduce Moleco, a novel contrastive learning framework to enhance the understanding of molecular structures within CLMs. Based on the similarity of fingerprint vectors among different molecules, we train CLMs to distinguish structurally similar and dissimilar molecules in a contrastive manner. Experimental results demonstrate that Moleco significantly improves the molecular property prediction performance of CLMs, outperforming state-of-the-art models. Moreover, our in-depth analysis with diverse Moleco variants verifies that fingerprint vectors are highly effective features in improving CLMs’ understanding of the structural information of molecules1 © 2024 Association for Computational Linguistics.

关键词： Prediction models

来源：评论

学校读者我要写书评

暂无评论

Diversity Over Size: On the Effect of Sample and Topic Sizes for Topic-Dependent Argument Mining Datasets

Diversity Over Size: On the Effect of Sample and Topic Sizes...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Schiller, Benjamin Daxenberger, Johannes Waldis, Andreas Gurevych, Iryna summetix GmbH Germany Ubiquitous Knowledge Processing Lab Department of Computer Science Technical University of Darmstadt Germany Information Systems Research Lab Lucerne University of Applied Sciences and Arts Switzerland

ISBN: (纸本)9798891761643

Topic-Dependent Argument Mining (TDAM), that is extracting and classifying argument components for a specific topic from large document sources, is an inherently difficult task for machine learning models and humans alike, as large TDAM datasets are rare and recognition of argument components requires expert knowledge. The task becomes even more difficult if it also involves stance detection of retrieved arguments. In this work, we investigate the effect of TDAM dataset composition in few- and zero-shot settings. Our findings show that, while fine-tuning is mandatory to achieve acceptable model performance, using carefully composed training samples and reducing the training sample size by up to almost 90% can still yield 95% of the maximum performance. This gain is consistent across three TDAM tasks on three different datasets. We also publish a new dataset and code for future benchmarking. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Dynamic Stashing Quantization for Efficient Transformer Training

Dynamic Stashing Quantization for Efficient Transformer Trai...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Yang, Guo Mullins, Robert D. Lo, Daniel Zhao, Yiren Univ Cambridge Cambridge England Microsoft Res Redmond WA USA Imperial Coll London London England

ISBN: (纸本)9798891760615

Large language Models (LLMs) have demonstrated impressive performance on a range of natural language processing (NLP) tasks. Unfortunately, the immense amount of computations and memory accesses required for LLM training makes them prohibitively expensive in terms of hardware cost, and thus challenging to deploy in use cases such as on-device learning. In this paper, motivated by the observation that LLM training is memory-bound, we propose a novel dynamic quantization strategy, termed Dynamic Stashing Quantization (DSQ), that puts a special focus on reducing the memory operations, but also enjoys the other benefits of low precision training, such as the reduced arithmetic cost. We conduct a thorough study on two translation tasks (trained-from-scratch) and three classification tasks (fine-tuning). DSQ reduces the amount of arithmetic operations by 20.95x and the number of DRAM operations by 2.55x on IWSLT17 compared to the standard 16-bit fixed-point.

关键词： natural language processing systems

来源：评论

学校读者我要写书评

暂无评论

An NLP-Enabled Approach to Semantic Grouping for Improved Requirements Modularity and Traceability

引用

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS 2025年第2期16卷 504-511页

作者： Izhar, Rahat Bhatti, Shahid Nazir Alharthi, Sultan A. Chiang Mai Univ Fac Engn Chiang Mai Thailand Univ Jeddah Coll Comp Sci & Engn Dept Software Engn Jeddah 21493 Saudi Arabia

The escalating complexity of modern software systems has rendered the management of requirements increasingly arduous, often plagued by redundancy, inconsistency, and inefficiency. Traditional manual methods prove inadequate for addressing the intricacies of dynamic, large-scale datasets. In response, this research introduces SQUIRE (Semantic Quick Requirements Engineering), a cutting-edge automated framework leveraging advanced natural language processing (NLP) techniques, specifically Sentence-BERT (SBERT) embeddings and hierarchical clustering, to semantically organize requirements into coherent functional clusters. SQUIRE is meticulously designed to enhance modularity, mitigate redundancy, and strengthen traceability within requirements engineering processes. Its efficacy is rigorously validated using real-world datasets from diverse domains, including attendance management, e-commerce systems, and school operations. empirical evaluations reveal that SQUIRE outperforms conventional clustering methods, demonstrating superior intracluster cohesion and inter-cluster separation, while significantly reducing manual intervention. This research establishes SQUIRE as a scalable and domain-agnostic solution, effectively addressing the evolving complexities of contemporary software development. By streamlining requirements management and enabling software teams to focus on strategic initiatives, SQUIRE advances the state of NLP-driven methodologies in Requirements Engineering, offering a robust foundation for future innovations.

关键词： Requirements Engineering (RE) semantic clustering sentence-BERT natural language processing (NLP)

来源：评论

学校读者我要写书评

暂无评论

Prompt-Based Bias Calibration for Better Zero/Few-Shot Learning of language Models

Prompt-Based Bias Calibration for Better Zero/Few-Shot Learn...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： He, Kang Long, Yinghan Roy, Kaushik Purdue University United States

ISBN: (纸本)9798891761681

Prompt-based learning is susceptible to intrinsic bias present in pre-trained language models (LMs), leading to sub-optimal performance in prompt-based zero/few-shot settings. In this work, we propose a null-input prompting method to calibrate intrinsic bias encoded in pre-trained LMs. Different from prior efforts that address intrinsic bias primarily for social fairness and often involve excessive computational cost, our objective is to explore enhancing LMs' performance in downstream zero/few-shot learning while emphasizing the efficiency of intrinsic bias calibration. Specifically, we leverage a diverse set of auto-selected null-meaning inputs generated from GPT-4 to probe intrinsic bias of pre-trained LMs. Utilizing the bias-reflected probability distribution, we formulate a distribution disparity loss for bias calibration, where we exclusively update bias parameters (0.1% of total parameters) of LMs towards equal probability distribution. Experimental results show that the calibration promotes an equitable starting point for LMs while preserving language modeling abilities. Across a wide range of datasets, including sentiment analysis and topic classification, our method significantly improves zero/few-shot learning performance of LMs for both in-context learning and prompt-based fine-tuning (on average 9% and 2%, respectively). © 2024 Association for Computational Linguistics.

关键词： Zero-shot learning

来源：评论

学校读者我要写书评

暂无评论

The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large language Models

The Potential and Challenges of Evaluating Attitudes, Opinio...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Ma, Bolei Wang, Xinpeng Hu, Tiancheng Haensch, Anna-Carolina Hedderich, Michael A. Plank, Barbara Kreuter, Frauke LMU Munich Germany Munich Center for Machine Learning Germany University of Cambridge United Kingdom University of Maryland College Park United States ITU Copenhagen Denmark

ISBN: (纸本)9798891761681

Recent advances in Large language Models (LLMs) have sparked wide interest in validating and comprehending the human-like cognitive-behavioral traits LLMs may capture and *** cognitive-behavioral traits include typically Attitudes, Opinions, Values (AOVs).However, measuring AOVs embedded within LLMs remains opaque, and different evaluation methods may yield different *** has led to a lack of clarity on how different studies are related to each other and how they can be *** paper aims to bridge this gap by providing a comprehensive overview of recent works on the evaluation of AOVs in ***, we survey related approaches in different stages of the evaluation pipeline in these *** doing so, we address the potential and challenges with respect to understanding the model, human-AI alignment, and downstream application in social ***, we provide practical insights into evaluation methods, model enhancement, and interdisciplinary collaboration, thereby contributing to the evolving landscape of evaluating AOVs in LLMs. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

A Survey on Open Information Extraction from Rule-based Model to Large language Model

A Survey on Open Information Extraction from Rule-based Mode...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Liu, Pai Gao, Wenyang Dong, Wenjie Ai, Lin Gong, Ziwei Huang, Songfang Li, Zongsheng Hoque, Ehsan Hirschberg, Julia Zhang, Yue Westlake University China University of Rochester United States Zhejiang University China Columbia University United States Alibaba DAMO Academy China Northeastern University United States

ISBN: (纸本)9798891761681

Open Information Extraction (OpenIE) represents a crucial NLP task aimed at deriving structured information from unstructured text, unrestricted by relation type or *** survey paper provides an overview of OpenIE technologies spanning from 2007 to 2024, emphasizing a chronological perspective absent in prior *** examines the evolution of task settings in OpenIE to align with the advances in recent *** paper categorizes OpenIE approaches into rule-based, neural, and pre-trained large language models, discussing each within a chronological ***, it highlights prevalent datasets and evaluation metrics currently in *** on this extensive review, this paper systematically reviews the evolution of task settings, data, evaluation metrics, and methodologies in the era of large language models, highlighting their mutual influence, comparing their capabilities, and examining their implications for open challenges and future research directions. © 2024 Association for Computational Linguistics.

关键词：

来源：评论

学校读者我要写书评

暂无评论

The Law and NLP: Bridging Disciplinary Disconnects

The Law and NLP: Bridging Disciplinary Disconnects

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Mahari, Robert Stammbach, Dominik Ash, Elliott Pentland, Alex 'Sandy' MIT 77 Massachusetts Ave Cambridge MA 02139 USA Harvard Law Sch Cambridge MA 02138 USA Swiss Fed Inst Technol Zurich Switzerland

ISBN: (纸本)9798891760615

Legal practice is intrinsically rooted in the fabric of language, yet legal practitioners and scholars have been slow to adopt tools from natural language processing (NLP). At the same time, the legal system is experiencing an access to justice crisis, which could be partially alleviated with NLP. In this position paper, we argue that the slow uptake of NLP in legal practice is exacerbated by a disconnect between the needs of the legal community and the focus of NLP researchers. In a review of recent trends in the legal NLP literature, we find limited overlap between the legal NLP community and legal academia. Our interpretation is that some of the most popular legal NLP tasks fail to address the needs of legal practitioners. We discuss examples of legal NLP tasks that promise to bridge disciplinary disconnects and highlight interesting areas for legal NLP research that remain underexplored.

关键词： natural language processing systems

来源：评论

学校读者我要写书评

暂无评论

An empirical Study of Instruction-tuning Large language Models in Chinese

An Empirical Study of Instruction-tuning Large Language Mode...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Si, Qingyi Wang, Tong Lin, Zheng Zhang, Xu Cao, Yanan Wang, Weiping Chinese Acad Sci Inst Informat Engn Beijing Peoples R China Univ Chinese Acad Sci Sch Cyber Secur Beijing Peoples R China APUS AiLMe Lab Beijing Peoples R China

ISBN: (纸本)9798891760615

The success of ChatGPT validates the potential of large language models (LLMs) in artificial general intelligence (AGI). Subsequently, the release of LLMs has sparked the opensource community's interest in instructiontuning, which is deemed to accelerate ChatGPT's replication process. However, research on instruction-tuning LLMs in Chinese, the world's most spoken language, is still in its early stages. Therefore, this paper makes an in-depth empirical study of instruction-tuning LLMs in Chinese, which can serve as a cookbook that provides valuable findings for effectively customizing LLMs that can better respond to Chinese instructions. Specifically, we systematically explore the impact of LLM bases, parameter-efficient methods, instruction data types, which are the three most important elements for instruction-tuning. Besides, we also conduct experiment to study the impact of other factors, e.g., chain-of-thought data and human-value alignment. We hope that this empirical study can make a modest contribution to the open Chinese version of ChatGPT. This paper will release a powerful Chinese LLM that is comparable to ChatGLM. The code and data are available at https: //***/PhoebusSi/Alpaca-CoT.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：