检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

14,558 篇 会议
663 篇 期刊文献
101 册 图书
40 篇 学位论文
1 篇 科技报告

馆藏范围

15,362 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

11,025 篇 工学
- 10,359 篇 计算机科学与技术...
- 5,436 篇 软件工程
- 1,474 篇 信息与通信工程
- 963 篇 电气工程
- 925 篇 控制科学与工程
- 446 篇 生物工程
- 223 篇 网络空间安全
- 220 篇 化学工程与技术
- 187 篇 机械工程
- 175 篇 生物医学工程（可授...
- 144 篇 电子科学与技术（可...
- 102 篇 仪器科学与技术
- 99 篇 安全科学与工程
2,494 篇 理学
- 1,163 篇 数学
- 655 篇 物理学
- 520 篇 生物学
- 395 篇 统计学（可授理学、...
- 241 篇 系统科学
- 235 篇 化学
2,427 篇 管理学
- 1,755 篇 图书情报与档案管...
- 760 篇 管理科学与工程(可...
- 241 篇 工商管理
- 106 篇 公共管理
1,761 篇 文学
- 1,709 篇 外国语言文学
- 184 篇 中国语言文学
514 篇 医学
- 303 篇 临床医学
- 284 篇 基础医学(可授医学...
- 113 篇 公共卫生与预防医...
278 篇 法学
- 249 篇 社会学
238 篇 教育学
- 225 篇 教育学
100 篇 农学
98 篇 经济学
9 篇 艺术学
7 篇 哲学
4 篇 军事学

主题

3,557 篇 natural language...
1,786 篇 natural language...
953 篇 computational li...
740 篇 semantics
682 篇 machine learning
613 篇 deep learning
520 篇 natural language...
352 篇 computational mo...
343 篇 accuracy
339 篇 training
335 篇 large language m...
335 篇 sentiment analys...
325 篇 feature extracti...
312 篇 data mining
290 篇 speech processin...
260 篇 speech recogniti...
256 篇 transformers
236 篇 neural networks
218 篇 iterative method...
212 篇 support vector m...

机构

85 篇 carnegie mellon ...
52 篇 university of ch...
46 篇 tsinghua univers...
45 篇 carnegie mellon ...
43 篇 zhejiang univers...
43 篇 national univers...
38 篇 nanyang technolo...
36 篇 university of sc...
36 篇 university of wa...
35 篇 univ chinese aca...
34 篇 carnegie mellon ...
33 篇 gaoling school o...
33 篇 stanford univers...
32 篇 school of artifi...
32 篇 alibaba grp peop...
29 篇 tsinghua univ de...
28 篇 harbin institute...
26 篇 microsoft resear...
26 篇 language technol...
26 篇 peking universit...

作者

55 篇 zhou guodong
50 篇 neubig graham
46 篇 liu yang
39 篇 sun maosong
36 篇 zhang min
34 篇 liu qun
33 篇 smith noah a.
28 篇 schütze hinrich
27 篇 liu zhiyuan
26 篇 wen ji-rong
26 篇 lapata mirella
24 篇 chang kai-wei
23 篇 zhou jie
23 篇 yang diyi
23 篇 zhao hai
23 篇 zhao wayne xin
21 篇 chua tat-seng
20 篇 dredze mark
18 篇 biemann chris
18 篇 fung pascale

语言

14,282 篇 英文
966 篇 其他
113 篇 中文
18 篇 法文
14 篇 土耳其文
2 篇 德文
2 篇 西班牙文
2 篇 俄文

检索条件"任意字段=Conference on empirical methods in natural language processing"

共 15363 条记录，以下是781-790 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

POSTMARK: A Robust Blackbox Watermark for Large language Models

POSTMARK: A Robust Blackbox Watermark for Large Language Mod...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Chang, Yapei Krishna, Kalpesh Houmansadr, Amir Wieting, John Iyyer, Mohit University of Massachusetts Amherst United States Google United States

ISBN: (纸本)9798891761643

The most effective techniques to detect LLM-generated text rely on inserting a detectable signature-or watermark-during the model's decoding process. Most existing watermarking methods require access to the underlying LLM's logits, which LLM API providers are loath to share due to fears of model distillation. As such, these watermarks must be implemented independently by each LLM provider. In this paper, we develop POSTMARK, a modular post-hoc watermarking procedure in which an input-dependent set of words (determined via a semantic embedding) is inserted into the text after the decoding process has completed. Critically, POSTMARK does not require logit access, which means it can be implemented by a third party. We also show that POSTMARK is more robust to paraphrasing attacks than existing watermarking methods: our experiments cover eight baseline algorithms, five base LLMs, and three datasets. Finally, we evaluate the impact of POSTMARK on text quality using both automated and human assessments, highlighting the trade-off between quality and robustness to paraphrasing. We release our code, outputs, and annotations at https://***/lilakk/PostMark. © 2024 Association for Computational Linguistics.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

MQUAKE: Assessing Knowledge Editing in language Models via Multi-Hop Questions

MQUAKE: Assessing Knowledge Editing in Language Models via M...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Zhong, Zexuan Wu, Zhengxuan Manning, Christopher D. Potts, Christopher Chen, Danqi Princeton Univ Princeton NJ 08544 USA Stanford Univ Stanford CA 94305 USA

ISBN: (纸本)9798891760608

The information stored in large language models (LLMs) falls out of date quickly, and retraining from scratch is often not an option. This has recently given rise to a range of techniques for injecting new facts through updating model weights. Current evaluation paradigms are extremely limited, mainly validating the recall of edited facts, but changing one fact should cause rippling changes to the model's related beliefs. If we edit the UK Prime Minister to now be Rishi Sunak, then we should get a different answer to Who is married to the British Prime Minister? In this work, we present a benchmark, MQUAKE (Multi-hop Question Answering for Knowledge Editing), comprising multi-hop questions that assess whether edited models correctly answer questions where the answer should change as an entailed consequence of edited facts. While we find that current knowledge-editing approaches can recall edited facts accurately, they fail catastrophically on the constructed multi-hop questions. We thus propose a simple memory-based approach, MeLLo, which stores all edited facts externally while prompting the language model iteratively to generate answers that are consistent with the edited facts. While MQUAKE remains challenging, we show that MeLLo scales well with LLMs (up to 175B) and outperforms previous model editors by a large margin.

关键词： Iterative methods

来源：评论

学校读者我要写书评

暂无评论

ZGUL: Zero-shot Generalization to Unseen languages using Multi-source Ensembling of language Adapters

ZGUL: Zero-shot Generalization to Unseen Languages using Mul...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Rathore, Vipul Dhingra, Rajdeep Singla, Parag Mausam Indian Inst Technol New Delhi India

ISBN: (纸本)9798891760608

We tackle the problem of zero-shot crosslingual transfer in NLP tasks via the use of language adapters (LAs). Most of the earlier works have explored training with adapter of a single source (often English), and testing either using the target LA or LA of another related language. Training target LA requires unlabeled data, which may not be readily available for low resource unseen languages: those that are neither seen by the underlying multilingual language model (e.g., mBERT), nor do we have any (labeled or unlabeled) data for them. We posit that for more effective cross-lingual transfer, instead of just one source LA, we need to leverage LAs of multiple (linguistically or geographically related) source languages, both at train and test-time - which we investigate via our novel neural architecture, ZGUL. Extensive experimentation across four language groups, covering 15 unseen target languages, demonstrates improvements of up to 3.2 average F1 points over standard fine-tuning and other strong baselines on POS tagging and NER tasks. We also extend ZGUL to settings where either (1) some unlabeled data or (2) few-shot training examples are available for the target language. We find that ZGUL continues to outperform baselines in these settings too.

关键词： Zero-shot learning

来源：评论

学校读者我要写书评

暂无评论

PaCoST: Paired Confidence Significance Testing for Benchmark Contamination Detection in Large language Models

PaCoST: Paired Confidence Significance Testing for Benchmark...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Zhang, Huixuan Lin, Yun Wan, Xiaojun Wangxuan Institute of Computer Technology Peking University China School of Foreign Languages Peking University China

ISBN: (纸本)9798891761681

Large language models (LLMs) are known to be trained on vast amounts of data, which may unintentionally or intentionally include data from commonly used benchmarks. This inclusion can lead to cheatingly high scores on model leaderboards, yet result in disappointing performance in real-world applications. To address this benchmark contamination problem, we first propose a set of requirements that practical contamination detection methods should follow. Following these proposed requirements, we introduce PaCoST, a Paired Confidence Significance Testing to effectively detect benchmark contamination in LLMs. Our method constructs a counterpart for each piece of data with the same distribution, and performs statistical analysis of the corresponding confidence to test whether the model is significantly more confident under the original benchmark. We validate the effectiveness of PaCoST and apply it on popular open-source models and benchmarks. We find that almost all models and benchmarks we tested are suspected contaminated more or less. We finally call for new LLM evaluation methods. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Updating knowledge in Large language Models: an empirical Evaluation

Updating knowledge in Large Language Models: an Empirical Ev...

引用

IEEE conference on Evolving and Adaptive Intelligent Systems (IEEE EAIS)

作者： Marinelli, Alberto Roberto Carta, Antonio Passaro, Lucia C. Univ Pisa Dept Comp Sci Pisa Italy

ISBN: (纸本)9798350366235;9798350366242

natural language processing (NLP) has witnessed a paradigm shift with Large language Models (LLMs), yet the static knowledge from pre-training can lead to knowledge obsolescence. This study focuses on the dynamic relationship between LLMs and evolving knowledge, using GPT-2 as a case study. Leveraging an existing framework, we update models with monthly Wikipedia dumps and Wikidata probes, addressing the stability-plasticity trade-off. We introduce a novel synthetic data generation method for experimental control and present SMARTREVIEW, a state-of-the-art continual learning method. This work advances understanding and methodologies in tackling knowledge obsolescence in evolving language models.

关键词： natural language processing Continual Learning Large language Models

来源：评论

学校读者我要写书评

暂无评论

Mitigating the language Mismatch and Repetition Issues in LLM-based Machine Translation via Model Editing

Mitigating the Language Mismatch and Repetition Issues in LL...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Wang, Weichuan Li, Zhaoyi Lian, Defu Ma, Chen Song, Linqi Wei, Ying City University of Hong Kong Hong Kong University of Science and Technology of China China City University of Hong Kong Shenzhen Research Institute Hong Kong Zhejiang University China

ISBN: (纸本)9798891761643

Large language Models (LLMs) have recently revolutionized the NLP field, while they still fall short in some specific down-stream tasks. In the work, we focus on utilizing LLMs to perform machine translation, where we observe that two patterns of errors frequently occur and drastically affect the translation quality: language mismatch and repetition. The work sets out to explore the potential for mitigating these two issues by leveraging model editing methods, e.g., by locating Feed-Forward Network (FFN) neurons or something that are responsible for the errors and deactivating them in the inference time. We find that directly applying such methods either limited effect on the targeted errors or has significant negative side-effect on the general translation quality, indicating that the located components may also be crucial for ensuring machine translation with LLMs on the rails. To this end, we propose to refine the located components by fetching the intersection of the locating results under different language settings, filtering out the aforementioned information that is irrelevant to targeted errors. The experiment results empirically demonstrate that our methods can effectively reduce the language mismatch and repetition ratios and meanwhile enhance or keep the general translation quality in most cases. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

FastAdaSP: Multitask-Adapted Efficient Inference for Large Speech language Model

FastAdaSP: Multitask-Adapted Efficient Inference for Large S...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Lu, Yichen Song, Jiaqi Yang, Chao-Han Huck Watanabe, Shinji Carnegie Mellon University United States NVIDIA Research United States

ISBN: (纸本)9798891761667

In this study, we aim to explore Multitask Speech language Model (SpeechLM) efficient inference via token reduction. Unlike other modalities such as vision or text, speech has unique temporal dependencies, making previous efficient inference works on other modalities not directly applicable. Furthermore, methods for efficient SpeechLM inference on long sequence and sparse signals remain largely unexplored. Then we propose FastAdaSP, a weighted token merging framework specifically designed for various speech-related tasks to improve the trade-off between efficiency and performance. Experimental results on WavLLM and Qwen-Audio show that our method achieves the state-of-the-art (SOTA) efficiency-performance trade-off compared with other baseline methods. Specifically, FastAdaSP achieved 7x memory efficiency and 1.83x decoding throughput without any degradation on tasks like Emotion Recognition (ER) and Spoken Question Answering (SQA). The code will be available at https://***/yichen14/FastAdaSP. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Struct-XLM: A Structure Discovery Multilingual language Model for Enhancing Cross-lingual Transfer through Reinforcement Learning

Struct-XLM: A Structure Discovery Multilingual Language Mode...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Wu, Linjuan Lu, Weiming Zhejiang Univ Coll Comp Sci & Technol Hangzhou Peoples R China Alibaba Zhejiang Univ Joint Res Inst Frontier Technol Hangzhou Peoples R China

ISBN: (纸本)9798891760608

Cross-lingual transfer learning heavily relies on well-aligned cross-lingual representations. The syntactic structure is recognized as beneficial for cross-lingual transfer, but limited researches utilize it for aligning representation in multilingual pre-trained language models (PLMs). Additionally, existing methods require syntactic labels that are difficult to obtain and of poor quality for low-resource languages. To address this gap, we propose Struct-XLM, a novel multilingual language model that leverages reinforcement learning (RL) to autonomously discover universal syntactic structures for improving the cross-lingual representation alignment of PLM. Struct-XLM integrates a policy network (PNet) and a translation ranking task. The PNet is designed to discover structural information and integrate it into the last layer of the PLM through the structural multi-head attention module to obtain structural representation. The translation ranking task obtains a delayed reward based on the structural representation to optimize the PNet while improving the alignment of cross-lingual representation. Experiments show the effectiveness of the proposed approach for enhancing cross-lingual transfer of multilingual PLM on the XTREME benchmark(1).

关键词： Syntactics

来源：评论

学校读者我要写书评

暂无评论

Revealing the Parallel Multilingual Learning within Large language Models

Revealing the Parallel Multilingual Learning within Large La...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Mu, Yongyu Feng, Peinan Cao, Zhiquan Wu, Yuzhang Li, Bei Wang, Chenglong Xiao, Tong Song, Kai Liu, Tongran Zhang, Chunliang Zhu, Jingbo NLP Lab School of Computer Science and Engineering Northeastern University Shenyang China NiuTrans Research Shenyang China Bytedance Seattle United States CAS Key Laboratory of Behavioral Science Institute of Psychology CAS Beijing China

ISBN: (纸本)9798891761643

Large language models (LLMs) can handle multilingual and cross-lingual text within a single input;however, previous works leveraging multilingualism in LLMs primarily focus on using English as the pivot language to enhance language understanding and reasoning. Given that multiple languages are a compensation for the losses caused by a single language's limitations, it's a natural next step to enrich the model's learning context through the integration of the original input with its multiple translations. In this paper, we start by revealing that LLMs learn from Parallel Multilingual Input (PMI). Our comprehensive evaluation shows that PMI enhances the model's comprehension of the input, achieving superior performance than conventional in-context learning (ICL). Furthermore, to explore how multilingual processing affects prediction, we examine the activated neurons in LLMs. Surprisingly, involving more languages in the input activates fewer neurons, leading to more focused and effective neural activation patterns. This neural reaction coincidently mirrors the neuroscience insight about synaptic pruning, highlighting a similarity between artificial and biological 'brains'. Our parallel multilingual data and code could be found at https://***/takagi97/LLMs-are-parallel-multilingual-learners. © 2024 Association for Computational Linguistics.

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

Do language Models Have a Common Sense regarding Time? Revisiting Temporal Commonsense Reasoning in the Era of Large language Models

Do Language Models Have a Common Sense regarding Time? Revis...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Jain, Raghav Sojitra, Daivik Acharya, Arkadeep Saha, Sriparna Jatowt, Adam Dandapat, Sandipan Indian Inst Technol Patna Dept Comp Sci & Engn Patna Bihar India Univ Innsbruck Innsbruck Austria Microsoft Chennai Tamil Nadu India

ISBN: (纸本)9798891760608

Temporal reasoning represents a vital component of human communication and understanding, yet remains an underexplored area within the context of Large language Models (LLMs). Despite LLMs demonstrating significant proficiency in a range of tasks, a comprehensive, large-scale analysis of their temporal reasoning capabilities is missing. Our paper addresses this gap, presenting the first extensive benchmarking of LLMs on temporal reasoning tasks. We critically evaluate 8 different LLMs across 6 datasets using 3 distinct prompting strategies. Additionally, we broaden the scope of our evaluation by including in our analysis 2 Code Generation LMs. Beyond broad benchmarking of models and prompts, we also conduct a fine-grained investigation of performance across different categories of temporal tasks. We further analyze the LLMs on varying temporal aspects, offering insights into their proficiency in understanding and predicting the continuity, sequence, and progression of events over time. Our findings reveal a nuanced depiction of the capabilities and limitations of the models within temporal reasoning, offering a comprehensive reference for future research in this pivotal domain.

关键词： Benchmarking

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 75 76 77 78 79 80 81 82 83 84 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：