检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

14,558 篇 会议
663 篇 期刊文献
101 册 图书
40 篇 学位论文
1 篇 科技报告

馆藏范围

15,362 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

11,025 篇 工学
- 10,359 篇 计算机科学与技术...
- 5,436 篇 软件工程
- 1,474 篇 信息与通信工程
- 963 篇 电气工程
- 925 篇 控制科学与工程
- 446 篇 生物工程
- 223 篇 网络空间安全
- 220 篇 化学工程与技术
- 187 篇 机械工程
- 175 篇 生物医学工程（可授...
- 144 篇 电子科学与技术（可...
- 102 篇 仪器科学与技术
- 99 篇 安全科学与工程
2,494 篇 理学
- 1,163 篇 数学
- 655 篇 物理学
- 520 篇 生物学
- 395 篇 统计学（可授理学、...
- 241 篇 系统科学
- 235 篇 化学
2,427 篇 管理学
- 1,755 篇 图书情报与档案管...
- 760 篇 管理科学与工程(可...
- 241 篇 工商管理
- 106 篇 公共管理
1,761 篇 文学
- 1,709 篇 外国语言文学
- 184 篇 中国语言文学
514 篇 医学
- 303 篇 临床医学
- 284 篇 基础医学(可授医学...
- 113 篇 公共卫生与预防医...
278 篇 法学
- 249 篇 社会学
238 篇 教育学
- 225 篇 教育学
100 篇 农学
98 篇 经济学
9 篇 艺术学
7 篇 哲学
4 篇 军事学

主题

3,557 篇 natural language...
1,786 篇 natural language...
953 篇 computational li...
740 篇 semantics
682 篇 machine learning
613 篇 deep learning
520 篇 natural language...
352 篇 computational mo...
343 篇 accuracy
339 篇 training
335 篇 large language m...
335 篇 sentiment analys...
325 篇 feature extracti...
312 篇 data mining
290 篇 speech processin...
260 篇 speech recogniti...
256 篇 transformers
236 篇 neural networks
218 篇 iterative method...
212 篇 support vector m...

机构

85 篇 carnegie mellon ...
52 篇 university of ch...
46 篇 tsinghua univers...
45 篇 carnegie mellon ...
43 篇 zhejiang univers...
43 篇 national univers...
38 篇 nanyang technolo...
36 篇 university of sc...
36 篇 university of wa...
35 篇 univ chinese aca...
34 篇 carnegie mellon ...
33 篇 gaoling school o...
33 篇 stanford univers...
32 篇 school of artifi...
32 篇 alibaba grp peop...
29 篇 tsinghua univ de...
28 篇 harbin institute...
26 篇 microsoft resear...
26 篇 language technol...
26 篇 peking universit...

作者

55 篇 zhou guodong
50 篇 neubig graham
46 篇 liu yang
39 篇 sun maosong
36 篇 zhang min
34 篇 liu qun
33 篇 smith noah a.
28 篇 schütze hinrich
27 篇 liu zhiyuan
26 篇 wen ji-rong
26 篇 lapata mirella
24 篇 chang kai-wei
23 篇 zhou jie
23 篇 yang diyi
23 篇 zhao hai
23 篇 zhao wayne xin
21 篇 chua tat-seng
20 篇 dredze mark
18 篇 biemann chris
18 篇 fung pascale

语言

14,282 篇 英文
966 篇 其他
113 篇 中文
18 篇 法文
14 篇 土耳其文
2 篇 德文
2 篇 西班牙文
2 篇 俄文

检索条件"任意字段=Conference on empirical methods in natural language processing"

共 15363 条记录，以下是1081-1090 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

ALCUNA: Large language Models Meet New Knowledge

ALCUNA: Large Language Models Meet New Knowledge

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Yin, Xunjian Huang, Baizhou Wan, Xiaojun Peking Univ Wangxuan Inst Comp Technol Beijing Peoples R China Peking Univ Ctr Data Sci Beijing Peoples R China Peking Univ MOE Key Lab Computat Linguist Beijing Peoples R China

ISBN: (纸本)9798891760608

With the rapid development of NLP, large-scale language models (LLMs) excel in various tasks across multiple domains now. However, existing benchmarks may not adequately measure these models' capabilities, especially when faced with new knowledge. In this paper, we address the lack of benchmarks to evaluate LLMs' ability to handle new knowledge, an important and challenging aspect in the rapidly evolving world. We propose an approach called Know-Gen that generates new knowledge by altering existing entity attributes and relationships, resulting in artificial entities that are distinct from real-world entities. With KnowGen, we introduce a benchmark named ALCUNA to assess LLMs' abilities in knowledge understanding, differentiation, and association. We benchmark several LLMs, reveals that their performance in face of new knowledge is not satisfactory, particularly in reasoning between new and internal knowledge. We also explore the impact of entity similarity on the model's understanding of entity knowledge and the influence of contextual entities. We appeal to the need for caution when using LLMs in new scenarios or with new knowledge, and hope that our benchmarks can help drive the development of LLMs in face of new knowledge.

关键词： Benchmarking

来源：评论

学校读者我要写书评

暂无评论

Head-wise Shareable Attention for Large language Models

Head-wise Shareable Attention for Large Language Models

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Cao, Zouying Yang, Yifei Zhao, Hai Department of Computer Science and Engineering Shanghai Jiao Tong University China Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering Shanghai Jiao Tong University China Shanghai Key Laboratory of Trusted Data Circulation and Governance in Web3 China

ISBN: (纸本)9798891761681

Large language Models (LLMs) suffer from huge number of parameters, which restricts their deployment on edge devices. Weight sharing is one promising solution that encourages weight reuse, effectively reducing memory usage with less performance drop. However, current weight sharing techniques primarily focus on small-scale models like BERT and employ coarse-grained sharing rules, e.g., layer-wise. This becomes limiting given the prevalence of LLMs and sharing an entire layer or block obviously diminishes the flexibility of weight sharing. In this paper, we present a perspective on head-wise shareable attention for large language models. We further propose two memory-efficient methods that share parameters across attention heads, with a specific focus on LLMs. Both of them use the same dynamic strategy to select the shared weight matrices. The first method directly reuses the pre-trained weights without retraining, denoted as DirectShare. The second method first post-trains with constraint on weight matrix similarity and then shares, denoted as PostShare. Experimental results reveal our head-wise shared models still maintain satisfactory capabilities, demonstrating the feasibility of fine-grained weight sharing applied to LLMs. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large language Models Personalization

BAPO: Base-Anchored Preference Optimization for Overcoming F...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Lee, Gihun Jeong, Minchan Kim, Yujin Jung, Hojung Oh, Jaehoon Kim, Sangmook Yun, Se-Young Graduate School of AI KAIST Korea Republic of Samsung Advanced Institute of Technology Korea Republic of Department of Electrical and Computer Engineering UBC Canada

ISBN: (纸本)9798891761681

While learning to align Large language Models (LLMs) with human preferences has shown remarkable success, aligning these models to meet the diverse user preferences presents further challenges in preserving previous knowledge. This paper examines the impact of personalized preference optimization on LLMs, revealing that the extent of knowledge loss varies significantly with preference heterogeneity. Although previous approaches have utilized the KL constraint between the reference model and the policy model, we observe that they fail to maintain global knowledge and general alignment when facing personalized preferences. To this end, we introduce Base-Anchored Preference Optimization (BAPO), a simple yet effective approach that utilizes the initial responses of reference model to mitigate forgetting while accommodating personalized alignment. BAPO effectively adapts to diverse user preferences while minimally affecting global knowledge or general alignment. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Lion: Adversarial Distillation of Proprietary Large language Models

Lion: Adversarial Distillation of Proprietary Large Language...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Jiang, Yuxin Chan, Chunkit Chen, Mingyang Wang, Wei Hong Kong Univ Sci & Technol Guangzhou Guangzhou Peoples R China Hong Kong Univ Sci & Technol Hong Kong Peoples R China

ISBN: (纸本)9798891760608

The practice of transferring knowledge from a sophisticated, proprietary large language model (LLM) to a compact, open-source LLM has garnered considerable attention. Previous works have focused on a unidirectional knowledge distillation way by aligning the responses of the student model with those of the teacher models to a set of instructions. Nevertheless, they overlooked the possibility of incorporating any "feedback"-identifying challenging instructions where the student model's performance falls short-to boost the student model's proficiency iteratively. To this end, we propose a novel adversarial distillation framework for a more efficient knowledge transfer. Leveraging the versatile role adaptability of LLMs, we prompt the teacher model to identify "hard" instructions and generate new "hard" instructions for the student model, creating a three-stage adversarial loop of imitation, discrimination, and generation. By applying this adversarial framework, we successfully transfer knowledge from ChatGPT to a student model (named Lion), using a mere 70k training data. Our results show that Lion-13B not only achieves comparable open-ended generation capabilities to ChatGPT but surpasses conventional state-of-the-art (SOTA) instruction-tuned models like Vicuna-13B by 55.4% in challenging zero-shot reasoning benchmarks such as BIG-Bench Hard (BBH) and 16.7% on AGIEval.(1)

关键词： Distillation

来源：评论

学校读者我要写书评

暂无评论

Beyond Accuracy Optimization: Computer Vision Losses for Large language Model Fine-Tuning

Beyond Accuracy Optimization: Computer Vision Losses for Lar...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Cambrin, Daniele Rege Gallipoli, Giuseppe Benedetto, Irene Cagliero, Luca Garza, Paolo Politecnico di Torino Italy MAIZE SRL

ISBN: (纸本)9798891761681

Large language Models (LLMs) have demonstrated impressive performance across various tasks. However, current training approaches combine standard cross-entropy loss with extensive data, human feedback, or ad hoc methods to enhance performance. These solutions are often not scalable or feasible due to their associated costs, complexity, or resource requirements. This study investigates the use of established semantic segmentation loss functions in natural language generation to create a versatile, practical, and scalable solution for fine-tuning different architectures. We evaluate their effectiveness in solving Math Word Problems and question answering across different models of varying sizes. For the analyzed tasks, we found that the traditional Cross-Entropy loss represents a sub-optimal choice, while models trained to minimize alternative (task-dependent) losses, such as Focal or Lovász, achieve a mean improvement of +36% on exact match without requiring additional data or human feedback. These findings suggest a promising pathway for more efficient and accessible training processes. © 2024 Association for Computational Linguistics.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

TCFLE-8: a Corpus of Learner Written Productions for French as a Foreign language and its Application to Automated Essay Scoring

TCFLE-8: a Corpus of Learner Written Productions for French ...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Wilkens, Rodrigo Pintard, Alice Alfter, David Folny, Vincent Francois, Thomas UCLouvain IL&C Cental Louvain Belgium Univ Gothenburg Gothenburg Sweden France Educ Int Sevres France

ISBN: (纸本)9798891760608

Automated Essay Scoring (AES) aims to automatically assess the quality of essays. Automation enables large-scale assessment, improvements in consistency, reliability, and standardization. Those characteristics are of particular relevance in the context of language certification exams. However, a major bottleneck in the development of AES systems is the availability of corpora, which, unfortunately, are scarce, especially for languages other than English. In this paper, we aim to foster the development of AES for French by providing the TCFLE-8 corpus, a corpus of 6.5k essays collected in the context of the Test de Connaissance du Francais (TCF - French Knowledge Test) certification exam. We report the strict quality procedure that led to the scoring of each essay by at least two raters according to the levels of the Common European Framework of Reference for languages (CEFR) and to the creation of a balanced corpus. In addition, we describe how linguistic properties of the essays relate to the learners' proficiency in TCFLE-8. We also advance the state-of-the-art performance for the AES task in French by experimenting with two strong baselines (i.e., RoBERTa and featurebased). Finally, we discuss the challenges of AES using TCFLE-8.(1)

关键词： Automation

来源：评论

学校读者我要写书评

暂无评论

Belief Revision: The Adaptability of Large language Models Reasoning

Belief Revision: The Adaptability of Large Language Models R...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Wilie, Bryan Cahyawijaya, Samuel Ishii, Etsuko He, Junxian Fung, Pascale Hong Kong University of Science and Technology Clear Water Bay Hong Kong

ISBN: (纸本)9798891761643

The capability to reason from text is crucial for real-world NLP applications. Real-world scenarios often involve incomplete or evolving data. In response, individuals update their beliefs and understandings accordingly. However, most existing evaluations assume that language models (LMs) operate with consistent information. We introduce Belief-R, a new dataset designed to test LMs' belief revision ability when presented with new evidence. Inspired by how humans suppress prior inferences, this task assesses LMs within the newly proposed delta reasoning (∆R) framework. Belief-R features sequences of premises designed to simulate scenarios where additional information could necessitate prior conclusions drawn by LMs. We evaluate ∼30 LMs across diverse prompting strategies and found that LMs generally struggle to appropriately revise their beliefs in response to new information. Further, models adept at updating often underperformed in scenarios without necessary updates, highlighting a critical trade-off. These insights underscore the importance of improving LMs' adaptive-ness to changing information, a step toward more reliable AI systems. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

UrbanLLM: Autonomous Urban Activity Planning and Management with Large language Models

UrbanLLM: Autonomous Urban Activity Planning and Management ...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Jiang, Yue Chao, Qin Chen, Yile Li, Xiucheng Liu, Shuai Cong, Gao Nanyang Technological University Singapore Harbin Institute of Technology Shenzhen China DAMO Academy Alibaba group Singapore

ISBN: (纸本)9798891761681

Location-based services play a critical role in improving the quality of our daily lives. Despite the proliferation of numerous specialized AI models within spatio-temporal context of location-based services, these models struggle to autonomously tackle problems regarding complex urban planing and management. To bridge this gap, we introduce UrbanLLM, a fine-tuned large language model (LLM) designed to tackle diverse problems in urban scenarios. UrbanLLM functions as a problem-solver by decomposing urban-related queries into manageable sub-tasks, identifying suitable spatio-temporal AI models for each sub-task, and generating comprehensive responses to the given queries. Our experimental results indicate that UrbanLLM significantly outperforms other established LLMs, such as Llama and the GPT series, in handling problems concerning complex urban activity planning and management. UrbanLLM exhibits considerable potential in enhancing the effectiveness of solving problems in urban scenarios, reducing the workload and reliance for human experts. Our code is available at: https://***/JIANGYUE61610306/UrbanLLM. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

FACTKB: Generalizable Factuality Evaluation using language Models Enhanced with Factual Knowledge

FACTKB: Generalizable Factuality Evaluation using Language M...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Feng, Shangbin Balachandran, Vidhisha Bai, Yuyang Tsvetkov, Yulia Univ Washington Seattle WA 98195 USA Carnegie Mellon Univ Pittsburgh PA 15213 USA Xi An Jiao Tong Univ Xian Peoples R China

ISBN: (纸本)9798891760608

Evaluating the factual consistency of automatically generated summaries is essential for the progress and adoption of reliable summarization systems. Despite recent advances, existing factuality evaluation models are not robust, being especially prone to entity and relation errors in new domains. We propose FACTKB-a simple new approach to factuality evaluation that is generalizable across domains, in particular with respect to entities and relations. FACTKB is based on language models pretrained using facts extracted from external knowledge bases. We introduce three types of complementary factuality pretraining objectives based on entity-specific facts, facts extracted from auxiliary knowledge about entities, and facts constructed compositionally through knowledge base walks. The resulting factuality evaluation model achieves state-of-the-art performance on two in-domain news summarization benchmarks as well as on three outof-domain scientific literature datasets. Further analysis of FACTKB shows improved ability to detect erroneous entities and relations in summaries and is robust and easily generalizable across domains. Code and data are available at https://***/BunsenFeng/FactKB.

关键词： Knowledge based systems

来源：评论

学校读者我要写书评

暂无评论

NLMs: Augmenting Negation in language Models

NLMs: Augmenting Negation in Language Models

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Singh, Rituraj Kumar, Rahul Sridhar, Vivek Samsung R&D Inst India Bangalore Karnataka India

ISBN: (纸本)9798891760615

Negation is the fundamental component in a natural language that reverses the semantic meaning of a sentence. It plays an extremely important role across a wide range of applications, yet they are under-represented in pre-trained language models (LMs), resulting often in wrong inferences. In this work, we try to improve the underlying understanding of the negation in the pre-trained LMs. To augment negation understanding, we propose a language model objective with a weighted cross-entropy loss and elastic weight consolidation regularization. For negated augmented models, we reduce the mean top 1 error rate for BERT-base to l.1%, BERT-large to 0.78%, RoBERTa-base to 3.74%, RoBERTa-large to 0.01% on the negated LAMA dataset that outperform the existing negation models. It minimizes the mean error rate by a margin of 8% and 6% for original BERT and RoBERTa models. We also provide empirical evidences that negated augmented models outperforms the classical models on original as well as negation benchmarks on natural language inference tasks.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 105 106 107 108 109 110 111 112 113 114 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：