检索结果-内蒙古大学图书馆

2024 conference on empirical methods in natural language processing: System Demonstrations, EMNLP 2024

作者： Dandekar, Chinmay Xu, Wenda Xu, Xi Ouyang, Siqi Li, Lei University of California Santa Barbara United States Carnegie Mellon University United States

ISBN: (纸本)9798891761674

With the rapid advancement of machine translation research, evaluation toolkits have become essential for benchmarking system progress. Tools like COMET and SacreBLEU offer single quality score assessments that are effective for pairwise system comparisons. However, these tools provide limited insights for finegrained system-level comparisons and the analysis of instance-level defects. To address these limitations, we introduce Translation Canvas, an explainable interface designed to pinpoint and analyze translation systems' performance: 1) Translation Canvas assists machine translation researchers in comprehending system-level model performance by identifying common errors (their frequency and severity) and analyzing relationships between different systems based on various evaluation metrics. 2) It supports fine-grained analysis by highlighting error spans with explanations and selectively displaying systems' predictions. According to human evaluation, Translation Canvas demonstrates superior performance over COMET and SacreBLEU packages under enjoyability and understandability criteria. © 2024 Association for Computational Linguistics.

关键词： Computer aided language translation

来源：评论

学校读者我要写书评

暂无评论

LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large language Models

LLM-Adapters: An Adapter Family for Parameter-Efficient Fine...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Hu, Zhiqiang Wang, Lei Lan, Yihuai Xu, Wanyu Lim, Ee-Peng Bing, Lidong Xu, Xing Poria, Soujanya Lee, Roy Ka-Wei Singapore Univ Technol & Design Singapore Singapore Singapore Management Univ Singapore Singapore Alibaba Grp DAMO Acad Singapore Singapore Southwest Jiaotong Univ Chengdu Peoples R China Univ Elect Sci & Technol China Chengdu Peoples R China

ISBN: (纸本)9798891760608

The success of large language models (LLMs), like GPT-4 and ChatGPT, has led to the development of numerous cost-effective and accessible alternatives that are created by finetuning open-access LLMs with task-specific data (e.g., ChatDoctor) or instruction data (e.g., Alpaca). Among the various fine-tuning methods, adapter-based parameter-efficient fine-tuning (PEFT) is undoubtedly one of the most attractive topics, as it only requires fine-tuning a few external parameters instead of the entire LLMs while achieving comparable or even better performance. To enable further research on PEFT methods of LLMs, this paper presents LLM-Adapters, an easy-to-use framework that integrates various adapters into LLMs and can execute these adapter-based PEFT methods of LLMs for different tasks. The framework includes state-of-the-art open-access LLMs such as LLaMA, BLOOM, and GPT-J, as well as widely used adapters such as Series adapters, Parallel adapter, Prompt-based learning and Reparametrization-based methods. Moreover, we conduct extensive empirical studies on the impact of adapter types, placement locations, and hyper-parameters to the best design for each adapter-based methods. We evaluate the effectiveness of the adapters on fourteen datasets from two different reasoning tasks, Arithmetic Reasoning and Commonsense Reasoning. The results demonstrate that using adapter-based PEFT in smaller-scale LLMs (7B) with few extra trainable parameters yields comparable, and in some cases superior, performance to powerful LLMs (175B) in zero-shot inference on both reasoning tasks. The code and datasets can be found in https://***/AGI-Edgerunners/LLM-Adapters.

关键词： Cost effectiveness

来源：评论

学校读者我要写书评

暂无评论

README++: Benchmarking Multilingual language Models for Multi-Domain Readability Assessment

README++: Benchmarking Multilingual Language Models for Mult...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Naous, Tarek Ryan, Michael J. Lavrouk, Anton Chandra, Mohit Xu, Wei College of Computing Georgia Institute of Technology United States

ISBN: (纸本)9798891761643

We present a comprehensive evaluation of large language models for multilingual readability assessment. Existing evaluation resources lack domain and language diversity, limiting the ability for cross-domain and cross-lingual analyses. This paper introduces README++, a multilingual multi-domain dataset with human annotations of 9757 sentences in Arabic, English, French, Hindi, and Russian, collected from 112 different data sources. This benchmark will encourage research on developing robust multilingual readability assessment methods. Using README++, we benchmark multilingual and monolingual language models in the supervised, unsupervised, and few-shot prompting settings. The domain and language diversity in README++ enable us to test more effective few-shot prompting, and identify shortcomings in state-of-the-art unsupervised methods. Our experiments also reveal exciting results of superior domain generalization and enhanced cross-lingual transfer capabilities by models trained on README++. We will make our data publicly available and release a python package tool for multilingual sentence readability prediction using our trained models at: https://***/tareknaous/readme. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Generating Vehicular Icon Descriptions and Indications Using Large Vision-language Models

Generating Vehicular Icon Descriptions and Indications Using...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Fletcher, James Dehnen, Nicholas Bathaie, Seyed Nima Tayarani An, Aijun Davoudi, Heidar Di Carlantonio, Ron Farmaner, Gary York University Toronto Canada Ontario Tech University Oshawa Canada iNAGO Co Toronto Canada

ISBN: (纸本)9798891761667

To enhance a question-answering system for automotive drivers, we tackle the problem of automatic generation of icon image descriptions. The descriptions can match the driver’s query about the icon appearing on the dashboard and tell the driver what is happening so that they may take an appropriate action. We use three state-of-the-art large vision-language models to generate both visual and functional descriptions based on the icon image and its context information in the car manual. Both zero-shot and few-shot prompts are used. We create a dataset containing over 400 icons with their ground-truth descriptions and use it to evaluate model-generated descriptions across several performance metrics. Our evaluation shows that two of these models (GPT-4o and Claude 3.5) performed well on this task, while the third model (LLaVA) performs poorly. © 2024 Association for Computational Linguistics.

关键词： Visual languages

来源：评论

学校读者我要写书评

暂无评论

Walking in Others' Shoes: How Perspective-Taking Guides Large language Models in Reducing Toxicity and Bias

Walking in Others' Shoes: How Perspective-Taking Guides Larg...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Xu, Rongwu Zhou, Zi'an Zhang, Tianwei Qi, Zehan Yao, Su Xu, Ke Xu, Wei Qiu, Han Tsinghua Universty China Nanyang Technological University Singapore

ISBN: (纸本)9798891761643

The common toxicity and societal bias in contents generated by large language models (LLMs) necessitate strategies to reduce harm. Present solutions often demand white-box access to the model or substantial training, which is impractical for cutting-edge commercial LLMs. Moreover, prevailing prompting methods depend on external tool feedback and fail to simultaneously lessen toxicity and bias. Motivated by social psychology principles, we propose a novel strategy named perspective-taking prompting (PET) that inspires LLMs to integrate diverse human perspectives and self-regulate their responses. This self-correction mechanism can significantly diminish toxicity (up to 89%) and bias (up to 73%) in LLMs' responses. Rigorous evaluations and ablation studies are conducted on two commercial LLMs (ChatGPT and GLM) and three open-source LLMs, revealing PET's superiority in producing less harmful responses, outperforming five strong baselines. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Can Large language Models Understand DL-Lite Ontologies? An empirical Study

Can Large Language Models Understand DL-Lite Ontologies? An ...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Wang, Keyu Qi, Guilin Li, Jiaqi Zhai, Songlin School of Computer Science and Engineering Southeast University Nanjing China Ministry of Education China

ISBN: (纸本)9798891761681

Large language models (LLMs) have shown significant achievements in solving a wide range of tasks. Recently, LLMs' capability to store, retrieve and infer with symbolic knowledge has drawn a great deal of attention, showing their potential to understand structured information. However, it is not yet known whether LLMs can understand Description Logic (DL) ontologies. In this work, we empirically analyze the LLMs' capability of understanding DL-Lite ontologies covering 6 representative tasks from syntactic and semantic aspects. With extensive experiments, we demonstrate both the effectiveness and limitations of LLMs in understanding DL-Lite ontologies. We find that LLMs can understand formal syntax and model-theoretic semantics of concepts and roles quite well. However, LLMs struggle with understanding TBox NI (Negative Inclusion) transitivity and handling ontologies with large ABoxes. We hope that our experiments and analyses provide more insights into LLMs and inspire to build more faithful knowledge engineering solutions. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Consistent Document-Level Relation Extraction via Counterfactuals

Consistent Document-Level Relation Extraction via Counterfac...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Modarressi, Ali Köksal, Abdullatif Schütze, Hinrich Center for Information and Language Processing LMU Munich Germany Munich Center for Machine Learning Germany

ISBN: (纸本)9798891761681

Many datasets have been developed to train and evaluate document-level relation extraction (RE) models. Most of these are constructed using real-world data. It has been shown that RE models trained on real-world data suffer from factual biases. To evaluate and address this issue, we present COVERED, a counterfactual data generation approach for document-level relation extraction datasets using entity replacement. We first demonstrate that models trained on factual data exhibit inconsistent behavior: while they accurately extract triples from factual data, they fail to extract the same triples after counterfactual modification. This inconsistency suggests that models trained on factual data rely on spurious signals such as specific entities and external knowledge - rather than on the input context - to extract triples. We show that by generating document-level counterfactual data with COVERED and training models on them, consistency is maintained with minimal impact on RE performance. We release our COVERED pipeline as well as REDOCRED-CF, a dataset of counterfactual RE documents, to assist in evaluating and addressing inconsistency in document-level RE. © 2024 Association for Computational Linguistics.

关键词： Data consistency

来源：评论

学校读者我要写书评

暂无评论

Transforming EFL Teaching with AI: A Systematic Review of empirical Studies

引用

INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION 2025年 1-34页

作者： Kundu, Arnab Bej, Tripti Inst Educ Res & Policy Bankura India

This systematic review explores the integration and impact of Artificial Intelligence in English as a Foreign language teaching in schools, evaluating the effectiveness, challenges, and pedagogical implications of AI-driven tools. After screening 189 studies from seven databases, 22 relevant empirical studies focusing on experiential learning outcomes with AI use were selected, following PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. The findings highlight AI's transformative impact on school-based EFL education, offering tailored, interactive experiences. Students using AI tools showed significant improvements in reading, writing, listening, speaking, vocabulary, and overall language comprehension compared to traditional methods. Improvements in language proficiency align with all three domains of Bloom's Taxonomy. Tools like natural language processing and Intelligent Tutoring Systems enhance instruction but struggle with language nuances and cultural contexts. Challenges like the digital divide, literacy gaps, teacher readiness and role confusion, cognitive load, and context-specific adaptation persist. Addressing these requires robust infrastructure, teacher training, and institutional support. The review offers valuable insights for teachers, policymakers, and researchers dedicated to advancing school-based EFL education with innovative AI solutions.

关键词： AI in Education EFL language Teaching School Education PRISMA

来源：评论

学校读者我要写书评

暂无评论

Efficient Temporal Extrapolation of Multimodal Large language Models with Temporal Grounding Bridge

Efficient Temporal Extrapolation of Multimodal Large Languag...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Wang, Yuxuan Wang, Yueqian Wu, Pengfei Liang, Jianxin Zhao, Dongyan Liu, Yang Zheng, Zilong Beijing China Wangxuan Institute of Computer Technology Peking University Beijing China State Key Laboratory of General Artificial Intelligence Beijing China

ISBN: (纸本)9798891761643

Despite progress in multimodal large language models (MLLMs), the challenge of interpreting long-form videos in response to linguistic queries persists, largely due to the inefficiency in temporal grounding and limited pre-trained context window size. In this work, we introduce Temporal Grounding Bridge (TGB), a novel framework that bootstraps MLLMs with advanced temporal grounding capabilities and broadens their contextual scope. Our framework significantly enhances the temporal capabilities of current MLLMs through three key innovations: an efficient multi-span temporal grounding algorithm applied to low-dimension temporal features projected from flow;a multimodal length extrapolation training paradigm that utilizes low-dimension temporal features to extend the training context window size;and a bootstrapping framework that bridges our model with pluggable MLLMs without requiring annotation. We validate TGB across seven video benchmarks and demonstrate substantial performance improvements compared with prior MLLMs. Notably, our model, initially trained on sequences of four frames, effectively handles sequences up to 16× longer without sacrificing performance, highlighting its scalability and effectiveness in real-world applications. Our code is publicly available at https://***/bigai-nlco/VideoTGB. © 2024 Association for Computational Linguistics.

关键词： Extrapolation

来源：评论

学校读者我要写书评

暂无评论

TreePiece: Faster Semantic Parsing via Tree Tokenization

TreePiece: Faster Semantic Parsing via Tree Tokenization

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Wang, Sid Shrivastava, Akshat Livshits, Aleksandr Meta Inc Menlo Pk CA 94025 USA

ISBN: (纸本)9798891760615

Autoregressive (AR) encoder-decoder neural networks have proved successful in many NLP problems, including Semantic Parsing - a task that translates natural language to machine-readable parse trees. However, the sequential prediction process of AR models can be slow. To accelerate AR for semantic parsing, we introduce a new technique called TreePiece that tokenizes a parse tree into subtrees and generates one subtree per decoding step. On TOPv2 benchmark, TreePiece shows 6.1 times faster decoding speed than standard AR, and comparable speed but significantly higher accuracy compared to Non-Autoregressive (NAR). © 2023 Association for Computational Linguistics.

关键词： natural language processing systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：