检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

14,463 篇 会议
653 篇 期刊文献
101 册 图书
40 篇 学位论文
1 篇 科技报告

馆藏范围

15,257 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

10,943 篇 工学
- 10,283 篇 计算机科学与技术...
- 5,409 篇 软件工程
- 1,461 篇 信息与通信工程
- 953 篇 电气工程
- 879 篇 控制科学与工程
- 446 篇 生物工程
- 221 篇 网络空间安全
- 220 篇 化学工程与技术
- 186 篇 机械工程
- 174 篇 生物医学工程（可授...
- 141 篇 电子科学与技术（可...
- 100 篇 仪器科学与技术
- 100 篇 安全科学与工程
2,473 篇 理学
- 1,150 篇 数学
- 649 篇 物理学
- 518 篇 生物学
- 391 篇 统计学（可授理学、...
- 241 篇 系统科学
- 232 篇 化学
2,417 篇 管理学
- 1,748 篇 图书情报与档案管...
- 758 篇 管理科学与工程(可...
- 240 篇 工商管理
- 104 篇 公共管理
1,761 篇 文学
- 1,709 篇 外国语言文学
- 184 篇 中国语言文学
510 篇 医学
- 299 篇 临床医学
- 282 篇 基础医学(可授医学...
- 112 篇 公共卫生与预防医...
277 篇 法学
- 249 篇 社会学
237 篇 教育学
- 224 篇 教育学
100 篇 农学
97 篇 经济学
9 篇 艺术学
7 篇 哲学
4 篇 军事学

主题

3,534 篇 natural language...
1,768 篇 natural language...
952 篇 computational li...
741 篇 semantics
680 篇 machine learning
609 篇 deep learning
520 篇 natural language...
347 篇 computational mo...
336 篇 training
333 篇 accuracy
331 篇 sentiment analys...
329 篇 large language m...
320 篇 feature extracti...
311 篇 data mining
290 篇 speech processin...
261 篇 speech recogniti...
252 篇 transformers
235 篇 neural networks
217 篇 iterative method...
212 篇 support vector m...

机构

85 篇 carnegie mellon ...
51 篇 university of ch...
45 篇 tsinghua univers...
45 篇 carnegie mellon ...
43 篇 zhejiang univers...
43 篇 national univers...
38 篇 nanyang technolo...
36 篇 university of wa...
35 篇 univ chinese aca...
34 篇 university of sc...
34 篇 carnegie mellon ...
33 篇 stanford univers...
32 篇 gaoling school o...
32 篇 school of artifi...
32 篇 alibaba grp peop...
29 篇 tsinghua univ de...
28 篇 harbin institute...
27 篇 language technol...
27 篇 peking universit...
26 篇 microsoft resear...

作者

55 篇 zhou guodong
50 篇 neubig graham
46 篇 liu yang
39 篇 sun maosong
36 篇 zhang min
34 篇 liu qun
33 篇 smith noah a.
28 篇 schütze hinrich
27 篇 liu zhiyuan
27 篇 lapata mirella
26 篇 wen ji-rong
24 篇 chang kai-wei
23 篇 zhou jie
23 篇 yang diyi
23 篇 zhao hai
23 篇 zhao wayne xin
21 篇 chua tat-seng
20 篇 dredze mark
18 篇 biemann chris
18 篇 fung pascale

语言

14,663 篇 英文
481 篇 其他
105 篇 中文
18 篇 法文
15 篇 土耳其文
2 篇 西班牙文
2 篇 俄文

检索条件"任意字段=Conference on empirical methods in natural language processing"

共 15258 条记录，以下是521-530 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Zhang, Shimao Gao, Changjiang Zhu, Wenhao Chen, Jiajun Huang, Xin Han, Xue Feng, Junlan Deng, Chao Huang, Shujian National Key Laboratory for Novel Software Technology Nanjing University China China Mobile Research Beijing China

ISBN: (纸本)9798891761643

Recently, Large language Models (LLMs) have shown impressive language capabilities, while most of them have very unbalanced performance across different languages. Multilingual alignment based on the translation parallel data is an effective method to enhance LLMs' multilingual capabilities. In this work, we first discover and comprehensively investigate the spontaneous multilingual alignment of LLMs. Firstly, we find that LLMs instruction-tuned on the question translation data (i.e. without annotated answers) are able to encourage the alignment between English and a wide range of languages, even including those unseen during instruction-tuning. Additionally, we utilize different settings and mechanistic interpretability methods to analyze the LLM's performance in the multilingual scenario comprehensively. Our work suggests that LLMs have enormous potential for improving multilingual alignment efficiently with great language generalization and task generalization. © 2024 Association for Computational Linguistics.

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

When are Lemons Purple? The Concept Association Bias of Vision-language Models

When are Lemons Purple? The Concept Association Bias of Visi...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Tang, Yingtian Yamada, Yutaro Zhang, Yoyo Yildirim, Ilker Ecole Polytech Fed Lausanne Lausanne Switzerland Yale Univ New Haven CT USA

ISBN: (纸本)9798891760608

Large-scale vision-language models such as CLIP have shown impressive performance on zero-shot image classification and image-to-text retrieval. However, such performance does not realize in tasks that require a finer-grained correspondence between vision and language, such as Visual Question Answering (VQA). As a potential cause of the difficulty of applying these models to VQA and similar tasks, we report an interesting phenomenon of vision-language models, which we call the Concept Association Bias (CAB). We find that models with CAB tend to treat input as a bag of concepts and attempt to fill in the other missing concept crossmodally, leading to an unexpected zero-shot prediction. We demonstrate CAB by showing that CLIP's zero-shot classification performance greatly suffers when there is a strong concept association between an object (e.g. eggplant) and an attribute (e.g. color purple). We also show that the strength of CAB predicts the performance on VQA. We observe that CAB is prevalent in vision-language models trained with contrastive losses, even when autoregressive losses are jointly employed. However, a model that solely relies on autoregressive loss seems to exhibit minimal or no signs of CAB.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Evaluating Object Hallucination in Large Vision-language Models

Evaluating Object Hallucination in Large Vision-Language Mod...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Li, Yifan Du, Yifan Zhou, Kun Wang, Jinpeng Zhao, Wayne Xin Wen, Ji-Rong Renmin Univ China Gaoling Sch Artificial Intelligence Beijing Peoples R China Renmin Univ China Sch Informat Beijing Peoples R China Beijing Key Lab Big Data Management & Anal Method Beijing Peoples R China Meituan Grp Beijing Peoples R China

ISBN: (纸本)9798891760608

Inspired by the superior language abilities of large language models (LLM), large vision-language models (LVLM) have been recently proposed by integrating powerful LLMs for improving the performance on complex multimodal tasks. Despite the promising progress on LVLMs, we find that they suffer from object hallucinations, i.e., they tend to generate objects inconsistent with the target images in the descriptions. To investigate it, this work presents the first systematic study on object hallucination of LVLMs. We conduct the evaluation experiments on several representative LVLMs, and show that they mostly suffer from severe object hallucination issues. We further discuss that the visual instructions may influence the hallucination, and find that: objects that frequently appear in the visual instructions or co-occur with the image objects are obviously prone to be hallucinated by LVLMs. Besides, we further design a polling-based query method called POPE for better evaluation of object hallucination. Experiment results show that our POPE can evaluate object hallucination in a more stable and flexible way.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Defining Knowledge: Bridging Epistemology and Large language Models

Defining Knowledge: Bridging Epistemology and Large Language...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Fierro, Constanza Dhar, Ruchira Stamatiou, Filippos Garneau, Nicolas Søgaard, Anders Department of Computer Science University of Copenhagen Denmark Center for Philosophy in Artificial Intelligence University of Copenhagen Denmark

ISBN: (纸本)9798891761643

Knowledge claims are abundant in the literature on large language models (LLMs);but can we say that GPT-4 truly "knows" the Earth is round? To address this question, we review standard definitions of knowledge in epistemology and we formalize interpretations applicable to LLMs. In doing so, we identify inconsistencies and gaps in how current NLP research conceptualizes knowledge with respect to epistemological frameworks. Additionally, we conduct a survey of 100 professional philosophers and computer scientists to compare their preferences in knowledge definitions and their views on whether LLMs can really be said to know. Finally, we suggest evaluation protocols for testing knowledge in accordance to the most relevant definitions. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

V-DPO: Mitigating Hallucination in Large Vision language Models via Vision-Guided Direct Preference Optimization

V-DPO: Mitigating Hallucination in Large Vision Language Mod...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Xie, Yuxi Li, Guanzhen Xu, Xiao Kan, Min-Yen National University of Singapore Singapore

ISBN: (纸本)9798891761681

Large vision-language models (LVLMs) suffer from hallucination, resulting in misalignment between the output textual response and the input visual content. Recent research indicates that the over-reliance on the Large language Model (LLM) backbone, as one cause of the LVLM hallucination, inherently introduces bias from language priors, leading to insufficient context attention to the visual inputs. We tackle this issue of hallucination by mitigating such over-reliance through preference learning. We propose Vision-guided Direct Preference Optimization (V-DPO) to enhance visual context learning at training time. To interpret the effectiveness and generalizability of V-DPO on different types of training data, we construct a synthetic dataset containing both response- and image-contrast preference pairs, compared against existing human-annotated hallucination samples. Our approach achieves significant improvements compared with baseline methods across various hallucination benchmarks. Our analysis indicates that V-DPO excels in learning from image-contrast preference data, demonstrating its superior ability to elicit and understand nuances of visual context. Our code is publicly available at https://***/YuxiXie/V-DPO. © 2024 Association for Computational Linguistics.

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

Self-Evaluation of Large language Model based on Glass-box Features

Self-Evaluation of Large Language Model based on Glass-box F...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Huang, Hui Qu, Yingqi Liu, Jing Yang, Muyun Xu, Bing Zhao, Tiejun Lu, Wenpeng Faculty of Computing Harbin Institute of Technology Harbin China Baidu Inc. Beijing China Key Laboratory of Computing Power Network and Information Security Ministry of Education Shandong Computer Science Center Qilu University of Technology Jinan China

ISBN: (纸本)9798891761681

The proliferation of open-source Large language Models (LLMs) underscores the pressing need for evaluation methods. Existing works primarily rely on external evaluators, focusing on training and prompting strategies. However, a crucial aspect - model-aware glass-box features - is overlooked. In this study, we explore the utility of glass-box features under the scenario of self-evaluation, namely applying an LLM to evaluate its own output. We investigate various glass-box feature groups and discovered that the softmax distribution serves as a reliable quality indicator for self-evaluation. Experimental results on public benchmarks validate the feasibility of self-evaluation of LLMs using glass-box features. © 2024 Association for Computational Linguistics.

关键词： Open systems

来源：评论

学校读者我要写书评

暂无评论

Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large language Models

Fishing for Magikarp: Automatically Detecting Under-trained ...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Land, Sander Bartolo, Max Cohere

ISBN: (纸本)9798891761643

The disconnect between tokenizer creation and model training in language models allows for specific inputs, such as the infamous_SolidGoldMagikarp token, to induce unwanted model behaviour. Although such 'glitch tokens', tokens present in the tokenizer vocabulary but that are nearly or entirely absent during model training, have been observed across various models, a reliable method to identify and address them has been missing. We present a comprehensive analysis of Large language Model tokenizers, specifically targeting this issue of detecting under-trained tokens. Through a combination of tokenizer analysis, model weight-based indicators, and prompting techniques, we develop novel and effective methods for automatically detecting these problematic tokens. Our findings demonstrate the prevalence of such tokens across a diverse set of models and provide insights into improving the efficiency and safety of language models. https://***/cohere-ai/magikarp/. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Introducing Compiler Semantics into Large language Models as Programming language Translators: A Case Study of C to x86 Assembly

Introducing Compiler Semantics into Large Language Models as...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Zhang, Shuoming Zhao, Jiacheng Xia, Chunwei Wang, Zheng Chen, Yunji Cui, Huimin SKLP Institute of Computing Technology CAS China University of Leeds United Kingdom University of Chinese Academy of Sciences Beijing China

ISBN: (纸本)9798891761681

Compilers are complex software containing millions of lines of code, taking years to develop. This paper investigates to what extent Large language Models (LLMs) can replace hand-crafted compilers in translating high-level programming languages to machine instructions, using C to x86 assembly as a case study. We identify two challenges of using LLMs for code translation and introduce two novel data pre-processing techniques to address the challenges: numerical value conversion and training data resampling. While only using a 13B model, our approach achieves a behavioral accuracy of over 91%, outperforming the much larger GPT-4 Turbo model by over 50%. Our results are encouraging, showing that LLMs have the potential to transform how compilation tools are constructed. © 2024 Association for Computational Linguistics.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Active Retrieval Augmented Generation

Active Retrieval Augmented Generation

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Jiang, Zhengbao Xu, Frank F. Gao, Luyu Sun, Zhiqing Liu, Qian Dwivedi-Yu, Jane Yang, Yiming Callan, Jamie Neubig, Graham Carnegie Mellon Univ Language Technol Inst Pittsburgh PA 15213 USA Sea AI Lab Singapore Singapore Meta FAIR Menlo Pk CA USA

ISBN: (纸本)9798891760608

Despite the remarkable ability of large language models (LMs) to comprehend and generate language, they have a tendency to hallucinate and create factually inaccurate output. Augmenting LMs by retrieving information from external knowledge resources is one promising solution. Most existing retrieval augmented LMs employ a retrieve-and-generate setup that only retrieves information once based on the input. This is limiting, however, in more general scenarios involving generation of long texts, where continually gathering information throughout generation is essential. In this work, we provide a generalized view of active retrieval augmented generation, methods that actively decide when and what to retrieve across the course of the generation. We propose Forward-Looking Active REtrieval augmented generation (FLARE), a generic method which iteratively uses a prediction of the upcoming sentence to anticipate future content, which is then utilized as a query to retrieve relevant documents to regenerate the sentence if it contains low-confidence tokens. We test FLARE along with baselines comprehensively over 4 longform knowledge-intensive generation tasks/datasets. FLARE achieves superior or competitive performance on all tasks, demonstrating the effectiveness of our method.(1)

关键词： Iterative methods

来源：评论

学校读者我要写书评

暂无评论

Inference-Time Decontamination: Reusing Leaked Benchmarks for Large language Model Evaluation

Inference-Time Decontamination: Reusing Leaked Benchmarks fo...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Zhu, Qin Cheng, Qinyuan Peng, Runyu Li, Xiaonan Liu, Tengxiao Peng, Ru Qiu, Xipeng Huang, Xuanjing School of Computer Science Fudan University China Shanghai Key Laboratory of Intelligent Information Processing Fudan University China College of Computer Science and Technology Zhejiang University China

ISBN: (纸本)9798891761681

The training process of large language models (LLMs) often involves varying degrees of test data contamination (Yang et al., 2023b).Although current LLMs are achieving increasingly better performance on various benchmarks, their performance in practical applications does not always match their benchmark *** of benchmarks can prevent the accurate assessment of LLMs' true ***, constructing new benchmarks is costly, labor-intensive and still carries the risk of ***, in this paper, we ask the question "Can we reuse these leaked benchmarks for LLM evaluation?" We propose Inference-Time Decontamination (ITD) to address this issue by detecting and rewriting leaked samples without altering their *** can mitigate performance inflation caused by memorizing leaked *** proof-of-concept experiments demonstrate that ITD reduces inflated accuracy by 22.9% on GSM8K and 19.0% on *** MMLU, using Inference-time Decontamination can lead to a decrease in the results of Phi3 and Mistral by 6.7% and 3.6% *** hope that ITD can provide more truthful evaluation results for large language models. © 2024 Association for Computational Linguistics.

关键词： Decontamination

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 49 50 51 52 53 54 55 56 57 58 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：