Human label variation exists in many naturallanguageprocessing (NLP) tasks, including naturallanguage inference (NLI). To gain direct evidence of how NLI label variation arises, we build LIVENLI, an English dataset...
详细信息
ISBN:
(纸本)9798891760615
Human label variation exists in many naturallanguageprocessing (NLP) tasks, including naturallanguage inference (NLI). To gain direct evidence of how NLI label variation arises, we build LIVENLI, an English dataset of 1,415 ecologically valid explanations (annotators explain the NLI labels they chose) for 122 MNLI (Williams et al., 2018) items (at least 10 explanations per item). The LIVENLI explanations confirm that people can systematically vary on their interpretation and highlight within-label variation: annotators sometimes choose the same label for different reasons. This suggests that explanations are crucial for navigating label interpretations in general. We few-shot prompt language models (LMs) to generate explanations but the results are inconsistent: the models sometimes produce valid and informative explanations, but they also generate implausible ones that do not support the label, highlighting directions for improvement.
naturallanguage is a powerful complementary modality of communication for data visualizations, such as bar and line charts. To facilitate chart-based reasoning using naturallanguage, various downstream tasks have be...
详细信息
Query rewriting is a crucial technique for passage retrieval in open-domain conversational question answering (CQA). It decontexualizes conversational queries into self-contained questions suitable for off-the-shelf r...
详细信息
Large language Models (LLMs) exhibit impressive zero/few-shot inference and generation quality for high-resource languages (HRLs). A few of them have been trained on low-resource languages (LRLs) and give decent perfo...
详细信息
Modern large language models (LLMs) like ChatGPT have shown remarkable performance on general language tasks but still struggle on complex reasoning tasks, which drives the research on cognitive behaviors of LLMs to e...
详细信息
In contemporary society, the advent of the digital economy is swiftly emerging as a novel catalyst propelling global economic advancement. Against this backdrop, the adept management of vast economic datasets has beco...
详细信息
ISBN:
(纸本)9798400709760
In contemporary society, the advent of the digital economy is swiftly emerging as a novel catalyst propelling global economic advancement. Against this backdrop, the adept management of vast economic datasets has become a focal point of inquiry. Leveraging its robust data processing and learning capabilities, artificial intelligence algorithms offer fresh perspectives on the analysis and application of digital economic data. This discourse introduces sophisticated algorithms such as kernel principal component analysis, time series analysis, and support vector machines, culminating in the conception and realization of a data processing framework. empirical analysis of real-world data validates the efficacy of the system, juxtaposed with comparative analyses to delve into the merits and limitations of various algorithms in digital economic data management. Ultimately, this research endeavors to furnish the domain of digital economics with scientifically grounded data processing methodologies and pragmatic strategic solutions.
Authorship verification (AV) is a fundamental task in naturallanguageprocessing (NLP) and computational linguistics, with applications in forensic analysis, plagiarism detection, and identification of deceptive cont...
详细信息
ISBN:
(纸本)9798891760615
Authorship verification (AV) is a fundamental task in naturallanguageprocessing (NLP) and computational linguistics, with applications in forensic analysis, plagiarism detection, and identification of deceptive content. Existing AV techniques, including traditional stylometric and deep learning approaches, face limitations in terms of data requirements and lack of explainability. To address these limitations, this paper proposes PromptAV, a novel technique that leverages Large-language Models (LLMs) for AV by providing step-by-step stylometric explanation prompts. PromptAV outperforms state-of-the-art baselines, operates effectively with limited training data, and enhances interpretability through intuitive explanations, showcasing its potential as an effective and interpretable solution for the AV task.
Large Multimodal Models (LMMs) have achieved strong performance across a range of vision and language tasks. However, their spatial reasoning capabilities are under-investigated. In this paper, we construct a novel VQ...
详细信息
Recent work has explored the capability of large language models (LLMs) to identify and correct errors in LLM-generated responses. These refinement approaches frequently evaluate what sizes of models are able to do re...
详细信息
In many PDF documents, the reading order of text blocks is missing, which can hinder machine understanding of the document's content. Existing works try to extract one universal reading order for a PDF file. Howev...
详细信息
暂无评论