检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

14,463 篇 会议
654 篇 期刊文献
101 册 图书
40 篇 学位论文
1 篇 科技报告

馆藏范围

15,258 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

10,944 篇 工学
- 10,283 篇 计算机科学与技术...
- 5,408 篇 软件工程
- 1,463 篇 信息与通信工程
- 954 篇 电气工程
- 880 篇 控制科学与工程
- 446 篇 生物工程
- 221 篇 网络空间安全
- 220 篇 化学工程与技术
- 186 篇 机械工程
- 174 篇 生物医学工程（可授...
- 142 篇 电子科学与技术（可...
- 101 篇 仪器科学与技术
- 99 篇 安全科学与工程
2,473 篇 理学
- 1,150 篇 数学
- 649 篇 物理学
- 518 篇 生物学
- 391 篇 统计学（可授理学、...
- 241 篇 系统科学
- 232 篇 化学
2,416 篇 管理学
- 1,748 篇 图书情报与档案管...
- 757 篇 管理科学与工程(可...
- 239 篇 工商管理
- 104 篇 公共管理
1,761 篇 文学
- 1,709 篇 外国语言文学
- 184 篇 中国语言文学
510 篇 医学
- 299 篇 临床医学
- 283 篇 基础医学(可授医学...
- 111 篇 公共卫生与预防医...
276 篇 法学
- 248 篇 社会学
237 篇 教育学
- 224 篇 教育学
100 篇 农学
96 篇 经济学
9 篇 艺术学
7 篇 哲学
4 篇 军事学

主题

3,535 篇 natural language...
1,768 篇 natural language...
952 篇 computational li...
740 篇 semantics
681 篇 machine learning
609 篇 deep learning
520 篇 natural language...
347 篇 computational mo...
338 篇 training
333 篇 accuracy
331 篇 sentiment analys...
329 篇 large language m...
321 篇 feature extracti...
311 篇 data mining
290 篇 speech processin...
260 篇 speech recogniti...
252 篇 transformers
235 篇 neural networks
217 篇 iterative method...
212 篇 support vector m...

机构

85 篇 carnegie mellon ...
51 篇 university of ch...
45 篇 tsinghua univers...
45 篇 carnegie mellon ...
43 篇 zhejiang univers...
43 篇 national univers...
38 篇 nanyang technolo...
36 篇 university of wa...
35 篇 univ chinese aca...
34 篇 university of sc...
34 篇 carnegie mellon ...
33 篇 stanford univers...
32 篇 gaoling school o...
32 篇 school of artifi...
32 篇 alibaba grp peop...
29 篇 tsinghua univ de...
28 篇 harbin institute...
27 篇 language technol...
27 篇 peking universit...
26 篇 microsoft resear...

作者

55 篇 zhou guodong
50 篇 neubig graham
46 篇 liu yang
39 篇 sun maosong
36 篇 zhang min
34 篇 liu qun
33 篇 smith noah a.
28 篇 schütze hinrich
27 篇 liu zhiyuan
27 篇 lapata mirella
26 篇 wen ji-rong
24 篇 chang kai-wei
23 篇 zhou jie
23 篇 yang diyi
23 篇 zhao hai
23 篇 zhao wayne xin
21 篇 chua tat-seng
20 篇 dredze mark
18 篇 biemann chris
18 篇 fung pascale

语言

14,662 篇 英文
482 篇 其他
106 篇 中文
18 篇 法文
15 篇 土耳其文
2 篇 西班牙文
2 篇 俄文

检索条件"任意字段=Conference on empirical methods in natural language processing"

共 15259 条记录，以下是591-600 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Sailor: Open language Models for South-East Asia

Sailor: Open Language Models for South-East Asia

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Dou, Longxu Liu, Qian Zeng, Guangtao Guo, Jia Zhou, Jiahui Mao, Xin Jin, Ziqi Lu, Wei Lin, Min Sea AI Lab Singapore SUTD Singapore

ISBN: (纸本)9798891761674

We present Sailor, a family of open language models ranging from 0.5B to 14B parameters, tailored for South-East Asian (SEA) languages. From Qwen1.5, Sailor models accept 200B to 400B tokens during continual pre-training, primarily covering the languages of English, Chinese, Vietnamese, Thai, Indonesian, Malay, and Lao. The training leverages several techniques, including BPE dropout for improving the model robustness, aggressive data cleaning and deduplication, and small proxy models to optimize the data mixture. Experimental results on four typical tasks indicate that Sailor models demonstrate strong performance across different benchmarks, including commonsense reasoning, question answering, reading comprehension and examination. We share our insights to spark a wider interest in developing large language models for multilingual use cases. Our demo can be found at https://***/spaces/sail/Sailor-14B-Chat. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Attribute Controlled Fine-tuning for Large language Models: A Case Study on Detoxification

Attribute Controlled Fine-tuning for Large Language Models: ...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Meng, Tao Mehrabi, Ninareh Goyal, Palash Ramakrishna, Anil Galstyan, Aram Zemel, Richard Chang, Kai-Wei Gupta, Rahul Peris, Charith University of California Los Angeles United States *** Inc. United States

ISBN: (纸本)9798891761681

We propose a constraint learning schema for fine-tuning Large language Models (LLMs) with attribute control. Given a training corpus and control criteria formulated as a sequence-level constraint on model outputs, our method fine-tunes the LLM on the training corpus while enhancing constraint satisfaction with minimal impact on its utility and generation quality. Specifically, our approach regularizes the LLM training by penalizing the KL divergence between the desired output distribution, which satisfies the constraints, and the LLM's posterior. This regularization term can be approximated by an auxiliary model trained to decompose the sequence-level constraints into token-level guidance, allowing the term to be measured by a closed-form formulation. To further improve efficiency, we design a parallel scheme for concurrently updating both the LLM and the auxiliary model. We evaluate the empirical performance of our approach by controlling the toxicity when training an LLM. We show that our approach leads to an LLM that produces fewer inappropriate responses while achieving competitive performance on benchmarks and a toxicity detection task. © 2024 Association for Computational Linguistics.

关键词： Detoxification

来源：评论

学校读者我要写书评

暂无评论

VE-KD: Vocabulary-Expansion Knowledge-Distillation for Training Smaller Domain-Specific language Models

VE-KD: Vocabulary-Expansion Knowledge-Distillation for Train...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Gao, Pengju Yamasaki, Tomohiro Imoto, Kazunori Toshiba Corporation Tokyo Japan

ISBN: (纸本)9798891761681

We propose VE-KD, a novel method that balances knowledge distillation and vocabulary expansion with the aim of training efficient domain-specific language models. Compared with traditional pre-training approaches, VE-KD exhibits competitive performance in downstream tasks while reducing model size and using fewer computational resources. Additionally, VE-KD refrains from overfitting in domain adaptation. Our experiments with different biomedical domain tasks demonstrate that VE-KD performs well compared with models such as BioBERT (+1% at HoC) and PubMedBERT (+1% at PubMedQA), with about 96% less training time. Furthermore, it outperforms DistilBERT and Adapt-and-Distill, showing a significant improvement in document-level tasks. Investigation of vocabulary size and tolerance, which are hyperparameters of our method, provides insights for further model optimization. The fact that VE-KD consistently maintains its advantages, even when the corpus size is small, suggests that it is a practical approach for domain-specific language tasks and is transferrable to different domains for broader applications. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

TEMA: Token Embeddings Mapping for Enriching Low-Resource language Models

TEMA: Token Embeddings Mapping for Enriching Low-Resource La...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Zevallos, Rodolfo Bel, Núria Farrús, Mireia Universitat Pompeu Fabra Barcelona Spain Universitat de Barcelona Barcelona Spain

ISBN: (纸本)9798891761643

The objective of the research we present is to remedy the problem of the low quality of language models for low-resource languages. We introduce an algorithm, the Token Embedding Mapping Algorithm (TEMA), that maps the token embeddings of a richly pre-trained model L1 to a poorly trained model L2, thus creating a richer L2' model. Our experiments show that the L2' model reduces perplexity with respect to the original monolingual model L2, and that for downstream tasks, including SuperGLUE, the results are state-of-the-art or better for the most semantic tasks. The models obtained with TEMA are also competitive or better than multilingual or extended models proposed as solutions for mitigating the low-resource language problems. © 2024 Association for Computational Linguistics.

关键词： Embeddings

来源：评论

学校读者我要写书评

暂无评论

Towards Online Continuous Sign language Recognition and Translation

Towards Online Continuous Sign Language Recognition and Tran...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Zuo, Ronglai Wei, Fangyun Mak, Brian The Hong Kong University of Science and Technology Hong Kong Microsoft Research Asia China

ISBN: (纸本)9798891761643

Research on continuous sign language recognition (CSLR) is essential to bridge the communication gap between deaf and hearing individuals. Numerous previous studies have trained their models using the connectionist temporal classification (CTC) loss. During inference, these CTC-based models generally require the entire sign video as input to make predictions, a process known as offline recognition, which suffers from high latency and substantial memory usage. In this work, we take the first step towards online CSLR. Our approach consists of three phases: 1) developing a sign dictionary;2) training an isolated sign language recognition model on the dictionary;and 3) employing a sliding window approach on the input sign sequence, feeding each sign clip to the optimized model for online recognition. Additionally, our online recognition model can be extended to support online translation by integrating a gloss-to-text network and can enhance the performance of any offline model. With these extensions, our online approach achieves new state-of-the-art performance on three popular benchmarks across various task settings. Code and models are available at https://***/FangyunWei/SLRT. © 2024 Association for Computational Linguistics.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Can We Edit Multimodal Large language Models?

Can We Edit Multimodal Large Language Models?

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Cheng, Siyuan Tian, Bozhong Liu, Qingbin Chen, Xi Wang, Yongheng Chen, Huajun Zhang, Ningyu Zhejiang Univ Hangzhou Peoples R China Zhejiang Univ Ant Grp Joint Lab Knowledge Graph Hangzhou Peoples R China Donghai Lab Zhoushan Peoples R China Tencent Platform & Content Grp Hangzhou Peoples R China Zhejiang Lab Hangzhou Peoples R China

ISBN: (纸本)9798891760608

In this paper, we focus on editing Multimodal Large language Models (MLLMs). Compared to editing single-modal LLMs, multimodal model editing is more challenging, which demands a higher level of scrutiny and careful consideration in the editing process. To facilitate research in this area, we construct a new benchmark, dubbed MMEdit, for editing multimodal LLMs and establishing a suite of innovative metrics for evaluation. We conduct comprehensive experiments involving various model editing baselines and analyze the impact of editing different components for multimodal LLMs. empirically, we notice that previous baselines can implement editing multimodal LLMs to some extent, but the effect is still barely satisfactory, indicating the potential difficulty of this task. We hope that our work can provide the NLP community with insights(1).

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Contextualized Sequence Likelihood: Enhanced Confidence Scores for natural language Generation

Contextualized Sequence Likelihood: Enhanced Confidence Scor...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Lin, Zhen Trivedi, Shubhendu Sun, Jimeng University of Illinois Urbana-Champaign United States Carle's Illinois College of Medicine University of Illinois Urbana-Champaign United States

ISBN: (纸本)9798891761643

The advent of large language models (LLMs) has dramatically advanced the state-of-the-art in numerous natural language generation tasks. For LLMs to be applied reliably, it is essential to have an accurate measure of their confidence. Currently, the most commonly used confidence score function is the likelihood of the generated sequence, which, however, conflates semantic and syntactic components. For instance, in question-answering (QA) tasks, an awkward phrasing of the correct answer might result in a lower probability prediction. Additionally, different tokens should be weighted differently depending on the context. In this work, we propose enhancing the predicted sequence probability by assigning different weights to various tokens using attention values elicited from the base LLM. By employing a validation set, we can identify the relevant attention heads, thereby significantly improving the reliability of the vanilla sequence probability confidence measure. We refer to this new score as the Contextualized Sequence Likelihood (CSL). CSL is easy to implement, fast to compute, and offers considerable potential for further improvement with task-specific prompts. Across several QA datasets and a diverse array of LLMs, CSL has demonstrated significantly higher reliability than state-of-the-art baselines in predicting generation quality, as measured by the AUROC or AUARC. © 2024 Association for Computational Linguistics.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large language Model Prompting

OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Lin...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Liu, Xukai Liu, Ye Zhang, Kai Wang, Kehang Liu, Qi Chen, Enhong State Key Laboratory of Cognitive Intelligence University of Science and Technology of China China

ISBN: (纸本)9798891761643

Entity Linking (EL) is the process of associating ambiguous textual mentions to specific entities in a knowledge base. Traditional EL methods heavily rely on large datasets to enhance their performance, a dependency that becomes problematic in the context of few-shot entity linking, where only a limited number of examples are available for training. To address this challenge, we present OneNet, an innovative framework that utilizes the few-shot learning capabilities of Large language Models (LLMs) without the need for fine-tuning. To the best of our knowledge, this marks a pioneering approach to applying LLMs to few-shot entity linking tasks. OneNet is structured around three key components prompted by LLMs: (1) an entity reduction processor that simplifies inputs by summarizing and filtering out irrelevant entities, (2) a dual-perspective entity linker that combines contextual cues and prior knowledge for precise entity linking, and (3) an entity consensus judger that employs a unique consistency algorithm to alleviate the hallucination in the entity linking reasoning. Comprehensive evaluations across seven benchmark datasets reveal that OneNet outperforms current state-of-the-art entity linking methods. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

CHAmbi: A New Benchmark on Chinese Ambiguity Challenges for Large language Models

CHAmbi: A New Benchmark on Chinese Ambiguity Challenges for ...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Zhang, Qin Cai, Sihan Zhao, Jiaxu Pechenizkiy, Mykola Fang, Meng College of Computer Science and Software Engineering Shenzhen University China Department of Mathematics and Computer Science Eindhoven University of Technology Netherlands Department of Computer Science University of Liverpool United Kingdom

ISBN: (纸本)9798891761681

Ambiguity is an inherent feature of language, whose management is crucial for effective communication and collaboration. This is particularly true for Chinese, a language with extensive lexical-morphemic ambiguity. Despite the wide use of large language models (LLMs) in numerous domains and their growing proficiency in Chinese, there is a notable lack of datasets to thoroughly evaluate LLMs' ability to handle ambiguity in Chinese. To bridge this gap, we introduce the CHAmbi dataset, a specialized Chinese multi-label disambiguation dataset formatted in natural language Inference. It comprises 4,991 pairs of premises and hypotheses, including 824 examples featuring a wide range of ambiguities. In addition to the dataset, we develop a series of tests and conduct an extensive evaluation of pre-trained LLMs' proficiency in identifying and resolving ambiguity in the Chinese language. Our findings reveal that GPT-4 consistently delivers commendable performance across various evaluative measures, albeit with limitations in robustness. The performances of other LLMs, however, demonstrate variability in handling ambiguity-related tasks, underscoring the complexity of such tasks in the context of Chinese. The overall results highlight the challenge of ambiguity handling for current LLMs and underscore the imperative need for further enhancement in LLM capabilities for effective ambiguity resolution in the Chinese language. © 2024 Association for Computational Linguistics.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Zero-Resource Hallucination Prevention for Large language Models

Zero-Resource Hallucination Prevention for Large Language Mo...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Luo, Junyu Xiao, Cao Ma, Fenglong The Pennsylvania State University United States GE Healthcare United States

ISBN: (纸本)9798891761681

The prevalent use of large language models (LLMs) in various domains has drawn attention to the issue of "hallucination", which refers to instances where LLMs generate factually inaccurate or ungrounded information. Existing techniques usually identify hallucinations post-generation that cannot prevent their occurrence and suffer from inconsistent performance due to the influence of the instruction format and model style. In this paper, we introduce a novel pre-detection self-evaluation technique, referred to as SELF-FAMILIARITY, which focuses on evaluating the model's familiarity with the concepts present in the input instruction and withholding the generation of response in case of unfamiliar concepts under the zero-resource setting, where external ground-truth or background information is not available. We also propose a new dataset Concept-7 focusing on the hallucinations caused by limited inner knowledge. We validate SELF-FAMILIARITY across four different large language models, demonstrating consistently superior performance compared to existing techniques. Our findings propose a significant shift towards preemptive strategies for hallucination mitigation in LLM assistants, promising improvements in reliability, applicability, and interpretability. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 56 57 58 59 60 61 62 63 64 65 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：