检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

7,582 篇 会议
71 册 图书
49 篇 期刊文献
2 篇 学位论文

馆藏范围

7,703 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

6,480 篇 工学
- 6,252 篇 计算机科学与技术...
- 3,600 篇 软件工程
- 748 篇 信息与通信工程
- 507 篇 控制科学与工程
- 271 篇 电气工程
- 213 篇 生物工程
- 121 篇 化学工程与技术
- 100 篇 机械工程
- 85 篇 电子科学与技术（可...
- 76 篇 生物医学工程（可授...
- 63 篇 安全科学与工程
- 59 篇 农业工程
- 57 篇 交通运输工程
- 49 篇 网络空间安全
1,524 篇 管理学
- 1,167 篇 图书情报与档案管...
- 467 篇 管理科学与工程(可...
- 134 篇 工商管理
1,472 篇 文学
- 1,465 篇 外国语言文学
- 161 篇 中国语言文学
1,447 篇 理学
- 775 篇 数学
- 352 篇 物理学
- 250 篇 生物学
- 240 篇 统计学（可授理学、...
- 120 篇 化学
- 101 篇 系统科学
165 篇 法学
- 153 篇 社会学
130 篇 医学
- 94 篇 临床医学
- 76 篇 基础医学(可授医学...
112 篇 教育学
- 106 篇 教育学
68 篇 农学
- 68 篇 作物学
42 篇 经济学
6 篇 哲学
3 篇 艺术学
1 篇 军事学

主题

1,183 篇 natural language...
872 篇 computational li...
621 篇 natural language...
283 篇 semantics
165 篇 natural language...
128 篇 machine learning
127 篇 graphic methods
123 篇 iterative method...
111 篇 sentiment analys...
110 篇 speech recogniti...
106 篇 deep learning
94 篇 syntactics
90 篇 text processing
86 篇 speech processin...
81 篇 embeddings
72 篇 information retr...
69 篇 modeling languag...
69 篇 artificial intel...
66 篇 contrastive lear...
63 篇 zero-shot learni...

机构

74 篇 carnegie mellon ...
36 篇 national univers...
34 篇 carnegie mellon ...
34 篇 language technol...
34 篇 institute for na...
33 篇 university of wa...
33 篇 school of comput...
32 篇 tsinghua univers...
30 篇 nanyang technolo...
30 篇 stanford univers...
30 篇 university of ch...
29 篇 zhejiang univers...
27 篇 alibaba grp peop...
26 篇 carnegie mellon ...
25 篇 gaoling school o...
25 篇 harbin institute...
25 篇 peking universit...
25 篇 natl univ singap...
24 篇 allen inst artif...
23 篇 the chinese univ...

作者

42 篇 neubig graham
39 篇 zhou guodong
39 篇 smith noah a.
36 篇 liu yang
36 篇 lapata mirella
34 篇 sun maosong
32 篇 zhang min
30 篇 liu qun
30 篇 hovy eduard
29 篇 zhao jun
27 篇 schütze hinrich
27 篇 liu zhiyuan
26 篇 gurevych iryna
25 篇 vulic ivan
22 篇 huang xuanjing
21 篇 chang kai-wei
21 篇 liu kang
21 篇 zhang yue
20 篇 wen ji-rong
20 篇 zhang qi

语言

6,985 篇 英文
689 篇 其他
23 篇 中文
8 篇 法文
4 篇 土耳其文
2 篇 德文
2 篇 俄文

检索条件"任意字段=Proceedings of the Conference on Empirical Methods in Natural Language Processing"

共 7704 条记录，以下是421-430 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

MORL-Prompt: An empirical Analysis of Multi-Objective Reinforcement Learning for Discrete Prompt Optimization

MORL-Prompt: An Empirical Analysis of Multi-Objective Reinfo...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Jafari, Yasaman Mekala, Dheeraj Yu, Rose Berg-Kirkpatrick, Taylor University of California San Diego United States

ISBN: (纸本)9798891761681

RL-based techniques can be employed to search for prompts that, when fed into a target language model, maximize a set of user-specified reward ***, in many target applications, the natural reward functions are in tension with one another - for example, content preservation *** matching in style transfer *** techniques focus on maximizing the average of reward functions, which does not necessarily lead to prompts that achieve balance across rewards - an issue that has been well-studied in the multi-objective and robust optimization *** this paper we conduct an empirical comparison of several existing multi-objective optimization techniques, adapted to this new setting: RL-based discrete prompt *** compare two methods optimizing the volume of the Pareto reward surface, and one method that chooses an update direction that benefits all rewards *** evaluate performance on two NLP tasks: style transfer and machine translation, each using three competing reward *** experiments demonstrate that multi-objective methods that directly optimize the volume of the Pareto reward surface perform better and achieve a better balance of all rewards than those that attempt to find monotonic update directions. © 2024 Association for Computational Linguistics.

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

DTS-SQL: Decomposed Text-to-SQL with Small Large language Models

DTS-SQL: Decomposed Text-to-SQL with Small Large Language Mo...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Pourreza, Mohammadreza Rafiei, Davood University of Alberta Canada

ISBN: (纸本)9798891761681

Leading models for the text-to-SQL task heavily rely on proprietary Large language Models (LLMs), posing concerns over data *** the performance gap between small open-source models and large proprietary models is crucial to mitigate this *** this end, we introduce a novel two-stage fine-tuning approach that decomposes the task into two simpler *** comprehensive evaluation on three large cross-domain datasets and two small LLMs, we show that this approach improves execution accuracy by 3 to 7 percent, effectively aligning the performance of open-source models with their proprietary *** proposed method has achieved 60.31% execution accuracy on BIRD hold-out test set, which is the highest performance among methods using 7B parameter models. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

The Lou Dataset Exploring the Impact of Gender-Fair language in German Text Classification

The Lou Dataset Exploring the Impact of Gender-Fair Language...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Waldis, Andreas Birrer, Joel Lauscher, Anne Gurevych, Iryna Technical University of Darmstadt Germany Information Systems Research Lab Lucerne University of Applied Sciences and Arts Switzerland Data Science Group University of Hamburg Germany

ISBN: (纸本)9798891761643

Gender-fair language, an evolving German linguistic variation, fosters inclusion by addressing all genders or using neutral forms. Nevertheless, there is a significant lack of resources to assess the impact of this linguistic shift on classification using language models (LMs), which are probably not trained on such variations. To address this gap, we present Lou, the first dataset featuring high-quality reformulations for German text classification covering seven tasks, like stance detection and toxicity classification. Evaluating 16 mono- and multi-lingual LMs on Lou shows that gender-fair language substantially impacts predictions by flipping labels, reducing certainty, and altering attention patterns. However, existing evaluations remain valid, as LM rankings of original and reformulated instances do not significantly differ. While we offer initial insights on the effect on German text classification, the findings likely apply to other languages, as consistent patterns were observed in multi-lingual and English LMs. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

MetaGPT: Merging Large language Models Using Model Exclusive Task Arithmetic

MetaGPT: Merging Large Language Models Using Model Exclusive...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Zhou, Yuyan Song, Liang Wang, Bingning Chen, Weipeng Baichuan Inc. China

ISBN: (纸本)9798891761643

The advent of large language models (LLMs) like GPT-4 has catalyzed the exploration of multi-task learning (MTL), in which a single model demonstrates proficiency across diverse tasks. Task arithmetic has emerged as a cost-effective approach for MTL. It enables performance enhancement across multiple tasks by adding their corresponding task vectors to a pre-trained model. However, the current lack of a method that can achieve optimal performance with low computational cost and protecting the data privacy, which limits their application to LLMs. In this paper, we propose Model Exclusive Task Arithmetic for merging GPT-scale models (MetaGPT), which formalizes the objective of model merging into a multi-task learning framework, aiming to minimize the average loss difference between the merged model and each individual task model. Since data privacy limits the use of multi-task training data, we leverage LLMs' local linearity and task vectors' orthogonality to separate the data term and scaling coefficients term and derive a model-exclusive task arithmetic method. Our proposed MetaGPT is data-agnostic and bypasses the heavy search process, making it cost-effective and easy to implement for LLMs. Extensive experiments demonstrate that MetaGPT leads to improvements in task arithmetic and achieves state-of-the-art performance on multiple tasks. © 2024 Association for Computational Linguistics.

关键词： Multi-task learning

来源：评论

学校读者我要写书评

暂无评论

Formality is Favored: Unraveling the Learning Preferences of Large language Models on Data with Conflicting Knowledge

Formality is Favored: Unraveling the Learning Preferences of...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Li, Jiahuan Cao, Yiqing Huang, Shujian Chen, Jiajun National Key Laboratory for Novel Software Technology Nanjing University China

ISBN: (纸本)9798891761643

Having been trained on massive pretraining data, large language models have shown excellent performance on many knowledge-intensive tasks. However, pretraining data tends to contain misleading and even conflicting information, and it is intriguing to understand how LLMs handle these noisy data during training. In this study, we systematically analyze LLMs' learning preferences for data with conflicting knowledge. We find that pretrained LLMs establish learning preferences similar to humans, i.e., preferences towards formal texts and texts with fewer spelling errors, resulting in faster learning and more favorable treatment of knowledge in data with such features when facing conflicts. This finding is generalizable across models and languages and is more evident in larger models. An in-depth analysis reveals that LLMs tend to trust data with features that signify consistency with the majority of data, and it is possible to instill new preferences and erase old ones by manipulating the degree of consistency with the majority data. © 2024 Association for Computational Linguistics.

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

Learning to Extract Structured Entities Using language Models

Learning to Extract Structured Entities Using Language Model...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Wu, Haolun Yuan, Ye Mikaelyan, Liana Meulemans, Alexander Liu, Xue Hensman, James Mitra, Bhaskar McGill University Canada Mila - Quebec AI Institute Canada Microsoft Research United States ETH Zürich Switzerland

ISBN: (纸本)9798891761643

Recent advances in machine learning have significantly impacted the field of information extraction, with language Models (LMs) playing a pivotal role in extracting structured information from unstructured text. Prior works typically represent information extraction as triplet-centric and use classical metrics such as precision and recall for evaluation. We reformulate the task to be entity-centric, enabling the use of diverse metrics that can provide more insights from various perspectives. We contribute to the field by introducing Structured Entity Extraction and proposing the Approximate Entity Set OverlaP (AESOP) metric, designed to appropriately assess model performance. Later, we introduce a new Multistage Structured Entity Extraction (MuSEE) model that harnesses the power of LMs for enhanced effectiveness and efficiency by decomposing the extraction task into multiple stages. Quantitative and human side-by-side evaluations confirm that our model outperforms baselines, offering promising directions for future advancements in structured entity extraction. Our source code is available at https://***/microsoft/Structured-Entity-Extraction. © 2024 Association for Computational Linguistics.

关键词： Information retrieval

来源：评论

学校读者我要写书评

暂无评论

Chain-of-Dictionary Prompting Elicits Translation in Large language Models

Chain-of-Dictionary Prompting Elicits Translation in Large L...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Lu, Hongyuan Yang, Haoran Huang, Haoyang Zhang, Dongdong Lam, Wai Wei, Furu The Chinese University of Hong Kong Hong Kong Microsoft Corporation United States

ISBN: (纸本)9798891761643

Large language models (LLMs) have shown surprisingly good performance in multilingual neural machine translation (MNMT) even if not being trained explicitly for translation. Yet, they still struggle with translating low-resource languages. As supported by our experiments, a bilingual dictionary between the source and the target language could help. Motivated by the fact that multilingual training effectively improves cross-lingual performance, we show that a chained multilingual dictionary with words expressed in more languages can provide more information to better enhance the LLM translation. To this end, we present a novel framework, COD, Chain-of-Dictionary Prompting, which augments LLMs with prior knowledge with the chains of multilingual dictionaries for a subset of input words to elicit translation abilities for LLMs. Experiments indicate that ChatGPT and InstructGPT still have room for improvement in translating many language pairs. And COD elicits large gains by up to 13x chrF++ points for MNMT (3.08 to 42.63 for English to Serbian written in Cyrillic script) on FLORES-200 full devtest set. We demonstrate the importance of chaining the multilingual dictionaries, as well as the superiority of COD to few-shot in-context learning for low-resource languages. Using COD helps ChatGPT to obviously surpass the SOTA translator NLLB 3.3B. © 2024 Association for Computational Linguistics.

关键词： Chains

来源：评论

学校读者我要写书评

暂无评论

Robust Prompt Optimization for Large language Models Against Distribution Shifts

Robust Prompt Optimization for Large Language Models Against...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Li, Moxin Wang, Wenjie Feng, Fuli Cao, Yixin Zhang, Jizhi Chua, Tat-Seng Natl Univ Singapore Singapore Singapore Univ Sci & Technol China Hefei Peoples R China Inst Dataspace Hefei Anhui Peoples R China Singapore Management Univ Singapore Singapore

ISBN: (纸本)9798891760608

Large language Model (LLM) has demonstrated significant ability in various natural language processing tasks. However, their effectiveness is highly dependent on the phrasing of the task prompt, leading to research on automatic prompt optimization using labeled task data. We reveal that these prompt optimization techniques are vulnerable to distribution shifts such as subpopulation shifts, which are common for LLMs in real-world scenarios such as customer reviews analysis. In this light, we propose a new problem of robust prompt optimization for LLMs against distribution shifts, which requires the prompt optimized over the labeled source group can simultaneously generalize to an unlabeled target group. To solve this problem, we propose Generalized Prompt Optimization framework, which incorporates the unlabeled data from the target group into prompt optimization. Extensive experimental results demonstrate the effectiveness of the proposed framework with significant performance improvement on the target group and comparable performance on the source group.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Are Large language Models In-Context Personalized Summarizers? Get an iCOPERNICUS Test Done!

Are Large Language Models In-Context Personalized Summarizer...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Patel, Divya Patel, Pathik Chander, Ankush Dasgupta, Sourish Chakraborty, Tanmoy KDM Lab Dhirubhai Ambani Institute of Information & Communication Technology India Indian Institute of Technology Delhi India

ISBN: (纸本)9798891761643

Large language Models (LLMs) have succeeded considerably in In-Context-Learning (ICL) based summarization. However, saliency is subject to the users' specific preference histories. Hence, we need reliable In-Context Personalization Learning (ICPL) capabilities within such LLMs. For any arbitrary LLM to exhibit ICPL, it needs to have the ability to discern contrast in user profiles. A recent study proposed a measure for degree-of-personalization called EGISES for the first time. EGISES measures a model's responsiveness to user profile differences. However, it cannot test if a model utilizes all three types of cues provided in ICPL prompts: (i) example summaries, (ii) user's reading histories, and (iii) contrast in user profiles. To address this, we propose the iCOPERNICUS framework, a novel In-Context Personalization Learning Scrutiny of Summarization capability in LLMs that uses EGISES as a comparative measure. As a case-study, we evaluate 17 state-of-the-art LLMs based on their reported ICL performances and observe that 15 models' ICPL degrades (min: 1.6%↓;max: 3.6%↓) when probed with richer prompts, thereby showing lack of true ICPL. © 2024 Association for Computational Linguistics.

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

ShadowLLM: Predictor-based Contextual Sparsity for Large language Models

ShadowLLM: Predictor-based Contextual Sparsity for Large Lan...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Akhauri, Yash AbouElhamayed, Ahmed F. Dotzel, Jordan Zhang, Zhiru Rush, Alexander M. Huda, Safeen Abdelfattah, Mohamed S. Cornell University United States Google United States

ISBN: (纸本)9798891761643

The high power consumption and latency-sensitive deployments of large language models (LLMs) have motivated efficiency techniques like quantization and *** sparsity, where the sparsity pattern is input-dependent, is crucial in LLMs because the permanent removal of attention heads or neurons from LLMs can significantly degrade *** work has attempted to model contextual sparsity using neural networks trained to predict activation magnitudes, which can be used to dynamically prune structures with low predicted activation *** this paper, we look beyond magnitude-based pruning criteria to assess attention head and neuron importance in *** develop a novel predictor called ShadowLLM, which can shadow the LLM behavior and enforce better sparsity patterns, resulting in over 15% improvement in end-to-end accuracy compared to prior *** addition, ShadowLLM achieves up to a 20% speedup over the state-of-the-art DejaVu *** enhancements are validated on Llama-2 and OPT models with up to 30 billion *** code is available at ShadowLLM. © 2024 Association for Computational Linguistics.

关键词： Neurons

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 39 40 41 42 43 44 45 46 47 48 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：