检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

14,463 篇 会议
654 篇 期刊文献
101 册 图书
40 篇 学位论文
1 篇 科技报告

馆藏范围

15,258 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

10,944 篇 工学
- 10,283 篇 计算机科学与技术...
- 5,408 篇 软件工程
- 1,463 篇 信息与通信工程
- 954 篇 电气工程
- 880 篇 控制科学与工程
- 446 篇 生物工程
- 221 篇 网络空间安全
- 220 篇 化学工程与技术
- 186 篇 机械工程
- 174 篇 生物医学工程（可授...
- 142 篇 电子科学与技术（可...
- 101 篇 仪器科学与技术
- 99 篇 安全科学与工程
2,473 篇 理学
- 1,150 篇 数学
- 649 篇 物理学
- 518 篇 生物学
- 391 篇 统计学（可授理学、...
- 241 篇 系统科学
- 232 篇 化学
2,416 篇 管理学
- 1,748 篇 图书情报与档案管...
- 757 篇 管理科学与工程(可...
- 239 篇 工商管理
- 104 篇 公共管理
1,761 篇 文学
- 1,709 篇 外国语言文学
- 184 篇 中国语言文学
510 篇 医学
- 299 篇 临床医学
- 283 篇 基础医学(可授医学...
- 111 篇 公共卫生与预防医...
276 篇 法学
- 248 篇 社会学
237 篇 教育学
- 224 篇 教育学
100 篇 农学
96 篇 经济学
9 篇 艺术学
7 篇 哲学
4 篇 军事学

主题

3,535 篇 natural language...
1,768 篇 natural language...
952 篇 computational li...
740 篇 semantics
681 篇 machine learning
609 篇 deep learning
520 篇 natural language...
347 篇 computational mo...
338 篇 training
333 篇 accuracy
331 篇 sentiment analys...
329 篇 large language m...
321 篇 feature extracti...
311 篇 data mining
290 篇 speech processin...
260 篇 speech recogniti...
252 篇 transformers
235 篇 neural networks
217 篇 iterative method...
212 篇 support vector m...

机构

85 篇 carnegie mellon ...
51 篇 university of ch...
45 篇 tsinghua univers...
45 篇 carnegie mellon ...
43 篇 zhejiang univers...
43 篇 national univers...
38 篇 nanyang technolo...
36 篇 university of wa...
35 篇 univ chinese aca...
34 篇 university of sc...
34 篇 carnegie mellon ...
33 篇 stanford univers...
32 篇 gaoling school o...
32 篇 school of artifi...
32 篇 alibaba grp peop...
29 篇 tsinghua univ de...
28 篇 harbin institute...
27 篇 language technol...
27 篇 peking universit...
26 篇 microsoft resear...

作者

55 篇 zhou guodong
50 篇 neubig graham
46 篇 liu yang
39 篇 sun maosong
36 篇 zhang min
34 篇 liu qun
33 篇 smith noah a.
28 篇 schütze hinrich
27 篇 liu zhiyuan
27 篇 lapata mirella
26 篇 wen ji-rong
24 篇 chang kai-wei
23 篇 zhou jie
23 篇 yang diyi
23 篇 zhao hai
23 篇 zhao wayne xin
21 篇 chua tat-seng
20 篇 dredze mark
18 篇 biemann chris
18 篇 fung pascale

语言

14,662 篇 英文
482 篇 其他
106 篇 中文
18 篇 法文
15 篇 土耳其文
2 篇 西班牙文
2 篇 俄文

检索条件"任意字段=Conference on empirical methods in natural language processing"

共 15259 条记录，以下是561-570 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

CombLM: Adapting Black-Box language Models through Small Fine-Tuned Models

CombLM: Adapting Black-Box Language Models through Small Fin...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Ormazabal, Aitor Artetxe, Mikel Agirre, Eneko Univ Basque Country UPV EHU HiTZ Ctr Leioa Spain Reka AI Sunnyvale CA USA

ISBN: (纸本)9798891760608

methods for adapting language models (LMs) to new tasks and domains have traditionally assumed white-box access to the model, and work by modifying its parameters. However, this is incompatible with a recent trend in the field, where the highest quality models are only available as black-boxes through inference APIs. Even when the model weights are available, the computational cost of fine-tuning large LMs can be prohibitive for most practitioners. In this work, we present a lightweight method for adapting large LMs to new domains and tasks, assuming no access to their weights or intermediate activations. Our approach fine-tunes a small white-box LM and combines it with the large black-box LM at the probability level through a small network, learned on a small validation set. We validate our approach by adapting a large LM (OPT-30B) to several domains and a downstream task (machine translation), observing improved performance in all cases, of up to 9%, while using a domain expert 23x smaller.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Enhancing Computation Efficiency in Large language Models throughWeight and Activation Quantization

Enhancing Computation Efficiency in Large Language Models th...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Lee, Janghwan Kim, Minsoo Baek, Seungcheol Hwang, Seok Joong Sung, Wonyong Choi, Jungwook Hanyang Univ Seoul South Korea SAPEON Korea Inc Seongnam Si South Korea Seoul Natl Univ Seoul South Korea

ISBN: (纸本)9798891760608

Large language Models (LLMs) are proficient in natural language processing tasks, but their deployment is often restricted by extensive parameter sizes and computational demands. This paper focuses on post-training quantization (PTQ) in LLMs, specifically 4-bit weight and 8-bit activation (W4A8) quantization, to enhance computational efficiency-a topic less explored compared to weight-only quantization. We present two innovative techniques: activation-quantization-aware scaling (AQAS) and sequence-length-aware calibration (SLAC) to enhance PTQ by considering the combined effects on weights and activations and aligning calibration sequence lengths to target tasks. Moreover, we introduce dINT, a hybrid data format combining integer and denormal representations, to address the underflow issue in W4A8 quantization, where small values are rounded to zero. Through rigorous evaluations of LLMs, including OPT and LLaMA, we demonstrate that our techniques significantly boost task accuracies to levels comparable with full-precision models. By developing arithmetic units compatible with dINT, we further confirm that our methods yield a 2x hardware efficiency improvement compared to 8-bit integer MAC unit.

关键词： Chemical activation

来源：评论

学校读者我要写书评

暂无评论

Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from language Models Fine-Tuned with Human Feedback

Just Ask for Calibration: Strategies for Eliciting Calibrate...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Tian, Katherine Mitchell, Eric Zhou, Allan Sharma, Archit Rafailov, Rafael Yao, Huaxiu Finn, Chelsea Manning, Christopher D. Harvard Univ Cambridge MA 02138 USA Stanford Univ Stanford CA 94305 USA

ISBN: (纸本)9798891760608

A trustworthy real-world prediction system should produce well-calibrated confidence scores;that is, its confidence in an answer should be indicative of the likelihood that the answer is correct, enabling deferral to an expert in cases of low-confidence predictions. Recent studies have shown that unsupervised pre-training produces large language models (LMs) whose conditional probabilities are remarkably well-calibrated. However, the most widelyused LMs are fine-tuned with reinforcement learning from human feedback (RLHF-LMs), and some studies have suggested that RLHFLMs produce conditional probabilities that are very poorly calibrated. In light of this perceived weakness, we conduct a broad evaluation of methods for extracting confidence scores from RLHF-LMs. For RLHF-LMs such as ChatGPT, GPT-4, and Claude, we find that verbalized confidences emitted as output tokens are typically better-calibrated than the model's conditional probabilities on the TriviaQA, SciQ, and TruthfulQA benchmarks, often reducing the expected calibration error by a relative 50%.

关键词： Calibration

来源：评论

学校读者我要写书评

暂无评论

Large language Models and Multimodal Retrieval for Visual Word Sense Disambiguation

Large Language Models and Multimodal Retrieval for Visual Wo...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Kritharoula, Anastasia Lymperaiou, Maria Stamou, Giorgos Natl Tech Univ Athens Sch Elect & Comp Engn Artificial Intelligence & Learning Syst Lab Athens Greece

ISBN: (纸本)9798891760608

Visual Word Sense Disambiguation (VWSD) is a novel challenging task with the goal of retrieving an image among a set of candidates, which better represents the meaning of an ambiguous word within a given context. In this paper, we make a substantial step towards unveiling this interesting task by applying a varying set of approaches. Since VWSD is primarily a text-image retrieval task, we explore the latest transformer-based methods for multimodal retrieval. Additionally, we utilize Large language Models (LLMs) as knowledge bases to enhance the given phrases and resolve ambiguity related to the target word. We also study VWSD as a unimodal problem by converting to text-to-text and image-to-image retrieval, as well as question-answering (QA), to fully explore the capabilities of relevant models. To tap into the implicit knowledge of LLMs, we experiment with Chain-of-Thought (CoT) prompting to guide explainable answer generation. On top of all, we train a learn to rank (LTR) model in order to combine our different modules, achieving competitive ranking results. Extensive experiments on VWSD demonstrate valuable insights to effectively drive future directions.

关键词： Image retrieval

来源：评论

学校读者我要写书评

暂无评论

Learning to Route for Dynamic Adapter Composition in Continual Learning with language Models

Learning to Route for Dynamic Adapter Composition in Continu...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Araujo, Vladimir Moens, Marie-Francine Tuytelaars, Tinne KU Leuven Belgium

ISBN: (纸本)9798891761681

Parameter-efficient fine-tuning (PEFT) methods are increasingly used with pre-trained language models (PLMs) for continual learning (CL). These methods typically involve training a PEFT module for each new task and employing similarity-based selection to route modules during inference. However, they face two major limitations: 1) interference during module training with already learned modules and 2) suboptimal routing when composing modules. In this paper, we present L2R, a method that isolates the training of new PEFT modules to ensure their task specialization. L2R then learns to compose the learned modules by training a network of routers that leverages a small memory containing examples of previously seen tasks. We evaluate our method in two CL setups using various benchmarks. Our results demonstrate that L2R provides an effective composition of PEFT modules, leading to improved generalization and performance compared to other methods. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy in Mental Health and Beyond

Task-Adaptive Tokenization: Enhancing Long-Form Text Generat...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Liu, Siyang Jia, Yilin Deng, Naihao Huang, Minlie Sabour, Sahand Mihalcea, Rada Univ Michigan Dept Comp Sci & Engn Language & Informat Technol Lab LIT Ann Arbor MI 48109 USA Tsinghua Univ CoAI Grp Beijing Peoples R China

ISBN: (纸本)9798891760608

We propose task-adaptive tokenization(1) as a way to adapt the generation pipeline to the specifics of a downstream task and enhance long-form generation in mental health. Inspired by insights from cognitive science, our task-adaptive tokenizer samples variable segmentations from multiple outcomes, with sampling probabilities optimized based on taskspecific data. We introduce a strategy for building a specialized vocabulary and introduce a vocabulary merging protocol that allows for the integration of task-specific tokens into the pre-trained model's tokenization step. Through extensive experiments on psychological question-answering tasks in both Chinese and English, we find that our task-adaptive tokenization approach brings a significant improvement in generation performance while using up to 60% fewer tokens. Preliminary experiments point to promising results when using our tokenization approach with very large language models.

关键词：

来源：评论

学校读者我要写书评

暂无评论

TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large language Models

TongGu: Mastering Classical Chinese Understanding with Knowl...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Cao, Jiahuan Peng, Dezhi Zhang, Peirong Shi, Yongxin Liu, Yang Ding, Kai Jin, Lianwen South China University of Technology China Intsig Information Co. Ltd. Singapore INTSIG-SCUT Joint Lab on Document Analysis and Recognition China

ISBN: (纸本)9798891761681

Classical Chinese is a gateway to the rich heritage and wisdom of ancient China, yet its complexities pose formidable comprehension barriers for most modern people without specialized knowledge. While Large language Models (LLMs) have shown remarkable capabilities in natural language processing (NLP), they struggle with Classical Chinese Understanding (CCU), especially in data-demanding and knowledge-intensive tasks. In response to this dilemma, we propose TongGu (mean understanding ancient and modern), the first CCU-specific LLM, underpinned by three core contributions. First, we construct a two-stage instruction-tuning dataset ACCN-INS derived from rich classical Chinese corpora, aiming to unlock the full CCU potential of LLMs. Second, we propose Redundancy-Aware Tuning (RAT) to prevent catastrophic forgetting, enabling TongGu to acquire new capabilities while preserving its foundational knowledge. Third, we present a CCU Retrieval-Augmented Generation (CCU-RAG) technique to reduce hallucinations based on knowledge-grounding. Extensive experiments across 24 diverse CCU tasks validate TongGu's superior ability, underscoring the effectiveness of RAT and CCU-RAG. The model and dataset are available at https://***/SCUT-DLVCLab/TongGu-LLM. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large language Models

Towards Interpretable Sequence Continuation: Analyzing Share...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Lan, Michael Torr, Philip Barez, Fazl Apart Research Department of Engineering Sciences University of Oxford United Kingdom

ISBN: (纸本)9798891761643

While transformer models exhibit strong capabilities on linguistic tasks, their complex architectures make them difficult to interpret. Recent work has aimed to reverse engineer transformer models into human-readable representations called circuits that implement algorithmic functions. We extend this research by analyzing and comparing circuits for similar sequence continuation tasks, which include increasing sequences of Arabic numerals, number words, and months. By applying circuit interpretability analysis, we identify a key sub-circuit in both GPT-2 Small and Llama-2-7B responsible for detecting sequence members and for predicting the next member in a sequence. Our analysis reveals that semantically related sequences rely on shared circuit subgraphs with analogous roles. Additionally, we show that this sub-circuit has effects on various math-related prompts, such as on intervaled circuits, Spanish number word and months continuation, and natural language word problems. This mechanistic understanding of transformers is a critical step towards building more robust, aligned, and interpretable language models. © 2024 Association for Computational Linguistics.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Instructed language Models with Retrievers Are Powerful Entity Linkers

Instructed Language Models with Retrievers Are Powerful Enti...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Xiao, Zilin Gong, Ming Wu, Jie Zhang, Xingyao Shou, Linjun Jiang, Daxin Rice Univ Houston TX 77251 USA Microsoft STCA Redmond WA USA

ISBN: (纸本)9798891760608

Generative approaches powered by large language models (LLMs) have demonstrated emergent abilities in tasks that require complex reasoning abilities. Yet the generative nature still makes the generated content suffer from hallucinations, thus unsuitable for entity-centric tasks like entity linking (EL) requiring precise entity predictions over a large knowledge base. We present Instructed Generative Entity Linker (INSGENEL), the first approach that enables casual language models to perform entity linking over knowledge bases. Several methods to equip language models with EL capability were proposed in this work, including (i) a sequence-to-sequence training EL objective with instruction-tuning, (ii) a novel generative EL framework based on a light-weight potential mention retriever that frees the model from heavy and non-parallelizable decoding, achieving 4x speedup without compromise on linking metrics. INSGENEL outperforms previous generative alternatives with +6.8 F1 points gain on average, also with a huge advantage in training data efficiency and training compute consumption. In addition, our skillfully engineered in-context learning (ICL) framework for EL still lags behind INSGENEL significantly, reaffirming that the EL task remains a persistent hurdle for general LLMs.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Uncertainty in language Models: Assessment through Rank-Calibration

Uncertainty in Language Models: Assessment through Rank-Cali...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Huang, Xinmeng Li, Shuo Yu, Mengxin Sesia, Matteo Hassani, Hamed Lee, Insup Bastani, Osbert Dobriban, Edgar University of Pennsylvania PhiladelphiaPA United States University of Southern California Los AngelesCA United States

ISBN: (纸本)9798891761643

language Models (LMs) have shown promising performance in natural language generation. However, as LMs often generate incorrect or hallucinated responses, it is crucial to correctly quantify their uncertainty in responding to given inputs. In addition to verbalized confidence elicited via prompting, many uncertainty measures (e.g., semantic entropy and affinity-graph-based measures) have been proposed. However, these measures can differ greatly, and it is unclear how to compare them, partly because they take values over different ranges (e.g., [0, ∞) or [0, 1]). In this work, we address this issue by developing a novel and practical framework, termed Rank-Calibration, to assess uncertainty and confidence measures for LMs. Our key tenet is that higher uncertainty (or lower confidence) should imply lower generation quality, on average. Rank-calibration quantifies deviations from this ideal relationship in a principled manner, without requiring ad hoc binary thresholding of the correctness score (e.g., ROUGE or METEOR). The broad applicability and the granular interpretability of our methods are demonstrated empirically. The code to replicate our experiments is here. © 2024 Association for Computational Linguistics.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 53 54 55 56 57 58 59 60 61 62 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：