检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

14,558 篇 会议
663 篇 期刊文献
101 册 图书
40 篇 学位论文
1 篇 科技报告

馆藏范围

15,362 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

11,025 篇 工学
- 10,359 篇 计算机科学与技术...
- 5,436 篇 软件工程
- 1,474 篇 信息与通信工程
- 963 篇 电气工程
- 925 篇 控制科学与工程
- 446 篇 生物工程
- 223 篇 网络空间安全
- 220 篇 化学工程与技术
- 187 篇 机械工程
- 175 篇 生物医学工程（可授...
- 144 篇 电子科学与技术（可...
- 102 篇 仪器科学与技术
- 99 篇 安全科学与工程
2,494 篇 理学
- 1,163 篇 数学
- 655 篇 物理学
- 520 篇 生物学
- 395 篇 统计学（可授理学、...
- 241 篇 系统科学
- 235 篇 化学
2,427 篇 管理学
- 1,755 篇 图书情报与档案管...
- 760 篇 管理科学与工程(可...
- 241 篇 工商管理
- 106 篇 公共管理
1,761 篇 文学
- 1,709 篇 外国语言文学
- 184 篇 中国语言文学
514 篇 医学
- 303 篇 临床医学
- 284 篇 基础医学(可授医学...
- 113 篇 公共卫生与预防医...
278 篇 法学
- 249 篇 社会学
238 篇 教育学
- 225 篇 教育学
100 篇 农学
98 篇 经济学
9 篇 艺术学
7 篇 哲学
4 篇 军事学

主题

3,557 篇 natural language...
1,786 篇 natural language...
953 篇 computational li...
740 篇 semantics
682 篇 machine learning
613 篇 deep learning
520 篇 natural language...
352 篇 computational mo...
343 篇 accuracy
339 篇 training
335 篇 large language m...
335 篇 sentiment analys...
325 篇 feature extracti...
312 篇 data mining
290 篇 speech processin...
260 篇 speech recogniti...
256 篇 transformers
236 篇 neural networks
218 篇 iterative method...
212 篇 support vector m...

机构

85 篇 carnegie mellon ...
52 篇 university of ch...
46 篇 tsinghua univers...
45 篇 carnegie mellon ...
43 篇 zhejiang univers...
43 篇 national univers...
38 篇 nanyang technolo...
36 篇 university of sc...
36 篇 university of wa...
35 篇 univ chinese aca...
34 篇 carnegie mellon ...
33 篇 gaoling school o...
33 篇 stanford univers...
32 篇 school of artifi...
32 篇 alibaba grp peop...
29 篇 tsinghua univ de...
28 篇 harbin institute...
26 篇 microsoft resear...
26 篇 language technol...
26 篇 peking universit...

作者

55 篇 zhou guodong
50 篇 neubig graham
46 篇 liu yang
39 篇 sun maosong
36 篇 zhang min
34 篇 liu qun
33 篇 smith noah a.
28 篇 schütze hinrich
27 篇 liu zhiyuan
26 篇 wen ji-rong
26 篇 lapata mirella
24 篇 chang kai-wei
23 篇 zhou jie
23 篇 yang diyi
23 篇 zhao hai
23 篇 zhao wayne xin
21 篇 chua tat-seng
20 篇 dredze mark
18 篇 biemann chris
18 篇 fung pascale

语言

14,282 篇 英文
966 篇 其他
113 篇 中文
18 篇 法文
14 篇 土耳其文
2 篇 德文
2 篇 西班牙文
2 篇 俄文

检索条件"任意字段=Conference on empirical methods in natural language processing"

共 15363 条记录，以下是961-970 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Breaking language Barriers: Cross-Lingual Continual Pre-Training at Scale

Breaking Language Barriers: Cross-Lingual Continual Pre-Trai...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Zheng, Wenzhen Pan, Wenbo Xu, Xu Qin, Libo Yue, Li Zhou, Ming Chinese Academy of Sciences China Harbin Institute of Technology China Peking University China School of Computer Science and Engineering Central South University China Langboat Inc. China

ISBN: (纸本)9798891761643

In recent years, Large language Models (LLMs) have made significant strides towards Artificial General Intelligence. However, training these models from scratch requires substantial computational resources and vast amounts of text data. In this paper, we explore an alternative approach to constructing an LLM for a new language by continually pre-training (CPT) from existing pre-trained LLMs, instead of using randomly initialized parameters. Based on parallel experiments on 40 model sizes ranging from 40M to 5B parameters, we find that 1) CPT converges faster and saves significant resources in a scalable manner;2) CPT adheres to an extended scaling law derived from Hoffmann et al. (2022) with a joint data-parameter scaling term;3) The compute-optimal data-parameter allocation for CPT markedly differs based on our estimated scaling factors;4) The effectiveness of transfer at scale is influenced by training duration and linguistic properties, while robust to data replaying, a method that effectively mitigates catastrophic forgetting in CPT. We hope our findings provide deeper insights into the transferability of LLMs at scale for the research community. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Scaling Behavior for Large language Models regarding Numeral Systems: An Example using Pythia

Scaling Behavior for Large Language Models regarding Numeral...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Zhou, Zhejian Wang, Jiayu Lin, Dahua Chen, Kai University of Southern California United States Shanghai AI Laboratory China The Chinese University of Hong Kong Hong Kong

ISBN: (纸本)9798891761681

Though Large language Models (LLMs) have shown remarkable abilities in mathematics reasoning, they are still struggling with performing numeric operations accurately, such as addition and multiplication. Numbers can be tokenized into tokens in various ways by different LLMs and affect the numeric operations performance. Currently, there are two representatives: 1) Tokenize into 1-digit, and 2) Tokenize into 1 ∼ 3 digit. The difference is roughly equivalent to using different numeral systems (namely base 10 or base 103). In light of this, we study the scaling behavior of different numeral systems in the context of transformer-based large language models. We empirically show that a base 10 system is consistently more data-efficient than a base 102 or 103 system across training data scale, model sizes under from-scratch training settings, while different number systems have very similar fine-tuning performances. We attribute this to higher token frequencies of a base 10 system. Additionally, we reveal extrapolation behavior patterns on addition and multiplication. We identify that base 100 and base 1000 systems struggle on token-level discernment and token-level operations. We also sheds light on the mechanism learnt by the models. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Exploring Quantization for Efficient Pre-Training of Transformer language Models

Exploring Quantization for Efficient Pre-Training of Transfo...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Chitsaz, Kamran Fournier, Quentin Mordido, Gonçalo Chandar, Sarath Chandar Research Lab Mila - Quebec AI Institute Canada Polytechnique Montréal Canada Canada CIFAR AI Canada

ISBN: (纸本)9798891761681

The increasing scale of Transformer models has led to an increase in their pre-training computational requirements. While quantization has proven to be effective after pre-training and during fine-tuning, applying quantization in Transformers during pre-training has remained largely unexplored at scale for language modeling. This study aims to explore the impact of quantization for efficient pre-training of Transformers, with a focus on linear layer components. By systematically applying straightforward linear quantization to weights, activations, gradients, and optimizer states, we assess its effects on model efficiency, stability, and performance during training. By offering a comprehensive recipe of effective quantization strategies to be applied during the pre-training of Transformers, we promote high training efficiency from scratch while retaining language modeling ability. Code is available at https://***/chandar-lab/EfficientLLMs. © 2024 Association for Computational Linguistics.

关键词： Modeling languages

来源：评论

学校读者我要写书评

暂无评论

LLMC: Benchmarking Large language Model Quantization with a Versatile Compression Toolkit

LLMC: Benchmarking Large Language Model Quantization with a ...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Gong, Ruihao Yong, Yang Gu, Shiqiao Huang, Yushi Lv, Chengtao Zhang, Yunchen Tao, Dacheng Liu, Xianglong Beihang University China SenseTime Research China Nanyang Technological University Singapore

ISBN: (纸本)9798891761667

Recent advancements in large language models (LLMs) are propelling us toward artificial general intelligence with their remarkable emergent abilities and reasoning capabilities. However, the substantial computational and memory requirements limit the widespread adoption. Quantization, a key compression technique, can effectively mitigate these demands by compressing and accelerating LLMs, albeit with potential risks to accuracy. Numerous studies have aimed to minimize the accuracy loss associated with quantization. However, their quantization configurations vary from each other and cannot be fairly compared. In this paper, we present LLMC, a plug-and-play compression toolkit, to fairly and systematically explore the impact of quantization. LLMC integrates dozens of algorithms, models, and hardware, offering high extensibility from integer to floating-point quantization, from LLM to vision-language (VLM) model, from fixed-bit to mixed precision, and from quantization to sparsification. Powered by this versatile toolkit, our benchmark covers three key aspects: calibration data, algorithms (three strategies), and data formats, providing novel insights and detailed analyses for further research and practical guidance for users. Our toolkit is available at https://***/ModelTC/llmc. © 2024 Association for Computational Linguistics.

关键词： Visual languages

来源：评论

学校读者我要写书评

暂无评论

MindGames: Targeting Theory of Mind in Large language Models with Dynamic Epistemic Modal Logic

MindGames: Targeting Theory of Mind in Large Language Models...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Sileo, Damien Lernould, Antoine Univ Lille INRIA CNRS Centrale LilleUMR 9189CRIStAL F-59000 Lille France Univ Lille CRIStAL F-59000 Lille France

ISBN: (纸本)9798891760615

Theory of Mind (ToM) is a critical component of intelligence but its assessment remains the subject of heated debates. Prior research applied human ToM assessments to natural language processing models using either human-created standardized tests or rule-based templates. However, these methods primarily focus on simplistic reasoning and require further validation. Here, we leverage dynamic epistemic logic to isolate a particular component of ToM and to generate controlled problems. We also introduce new verbalization techniques to express these problems in English natural language. Our findings indicate that some language model scaling (from 70M to 6B and 350M to 174B) does not consistently yield results better than random chance. While GPT-4 demonstrates superior epistemic reasoning capabilities, there is still room for improvement. Our code and datasets are publicly available(1)

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

IPL: Leveraging Multimodal Large language Models for Intelligent Product Listing

IPL: Leveraging Multimodal Large Language Models for Intelli...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Chen, Kang Zhang, Qingheng Lian, Chengbao Ji, Yixin Liu, Xuwei Han, Shuguang Wu, Guoqiang Huang, Fei Chen, Jufeng Alibaba Group China Fudan University China

ISBN: (纸本)9798891761667

Unlike professional Business-to-Consumer (B2C) e-commerce platforms (e.g., Amazon), Consumer-to-Consumer (C2C) platforms (e.g., Facebook marketplace) are mainly targeting individual sellers who usually lack sufficient experience in e-commerce. Individual sellers often struggle to compose proper descriptions for selling products. With the recent advancement of Multimodal Large language Models (MLLMs), we attempt to integrate such state-of-the-art generative AI technologies into the product listing process. To this end, we develop IPL, an Intelligent Product Listing tool tailored to generate descriptions using various product attributes such as category, brand, color, condition, etc. IPL enables users to compose product descriptions by merely uploading photos of the selling product. More importantly, it can imitate the content style of our C2C platform Xianyu1. This is achieved by employing domain-specific instruction tuning on MLLMs, and by adopting the multi-modal Retrieval-Augmented Generation (RAG) process. A comprehensive empirical evaluation demonstrates that the underlying model of IPL significantly outperforms the base model in domain-specific tasks while producing less hallucination. IPL has been successfully deployed in our production system, where 72% of users have their published product listings based on the generated content, and those product listings are shown to have a quality score 5.6% higher than those without AI assistance. © 2024 Association for Computational Linguistics.

关键词： Marketplaces

来源：评论

学校读者我要写书评

暂无评论

MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large language Model

MMNeuron: Discovering Neuron-Level Domain-Specific Interpret...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Huo, Jiahao Yan, Yibo Hu, Boren Yue, Yutao Hu, Xuming China The Hong Kong University of Science and Technology Hong Kong Tongji University China

ISBN: (纸本)9798891761643

Projecting visual features into word embedding space has become a significant fusion strategy adopted by Multimodal Large language Models (MLLMs). However, its internal mechanisms have yet to be explored. Inspired by multilingual research, we identify domain-specific neurons in multimodal large language models. Specifically, we investigate the distribution of domain-specific neurons and the mechanism of how MLLMs process features from diverse domains. Furthermore, we propose a three-stage mechanism for language model modules in MLLMs when handling projected image features, and verify this hypothesis using logit lens. Extensive experiments indicate that while current MLLMs exhibit Visual Question Answering (VQA) capability, they may not fully utilize domain-specific information. Manipulating domain-specific neurons properly will result in a 10% change of accuracy at most, shedding light on the development of cross-domain, all-encompassing MLLMs in the future. The source code is available at this URL. © 2024 Association for Computational Linguistics.

关键词： Neurons

来源：评论

学校读者我要写书评

暂无评论

ModSCAN: Measuring Stereotypical Bias in Large Vision-language Models from Vision and language Modalities

ModSCAN: Measuring Stereotypical Bias in Large Vision-Langua...

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Jiang, Yukun Li, Zheng Shen, Xinyue Liu, Yugeng Backes, Michael Zhang, Yang CISPA Helmholtz Center for Information Security Germany

ISBN: (纸本)9798891761643

Large vision-language models (LVLMs) have been rapidly developed and widely used in various fields, but the (potential) stereotypical bias in the model is largely unexplored. In this study, we present a pioneering measurement framework, ModSCAN, to SCAN the stereotypical bias within LVLMs from both vision and language Modalities. ModSCAN examines stereotypical biases with respect to two typical stereotypical attributes (gender and race) across three kinds of scenarios: occupations, descriptors, and persona traits. Our findings suggest that 1) the currently popular LVLMs show significant stereotype biases, with CogVLM emerging as the most biased model;2) these stereotypical biases may stem from the inherent biases in the training dataset and pre-trained models;3) the utilization of specific prompt prefixes (from both vision and language modalities) performs well in reducing stereotypical biases. We believe our work can serve as the foundation for understanding and addressing stereotypical bias in LVLMs. © 2024 Association for Computational Linguistics.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Make Large language Model a Better Ranker

Make Large Language Model a Better Ranker

引用

2024 conference on empirical methods in natural language processing, EMNLP 2024

作者： Chao, Wen-Shuo Zheng, Zhi Zhu, Hengshu Liu, Hao The Hong Kong University of Science and Technology Guangzhou China School of Data Science University of Science and Technology of China China Computer Network Information Center Chinese Academy of Sciences China

ISBN: (纸本)9798891761681

Large language Models (LLMs) demonstrate robust capabilities across various fields, leading to a paradigm shift in LLM-enhanced Recommender System (RS). Research to date focuses on point-wise and pair-wise recommendation paradigms, which are inefficient for LLM-based recommenders due to high computational costs. However, existing list-wise approaches also fall short in ranking tasks due to misalignment between ranking objectives and next-token prediction. Moreover, these LLM-based methods struggle to effectively address the order relation among candidates, particularly given the scale of ratings. To address these challenges, this paper introduces the large language model framework with Aligned Listwise Ranking Objectives (ALRO). ALRO is designed to bridge the gap between the capabilities of LLMs and the nuanced requirements of ranking tasks. Specifically, ALRO employs explicit feedback in a listwise manner by introducing soft lambda loss, a customized adaptation of lambda loss designed for optimizing order relations. This mechanism provides more accurate optimization goals, enhancing the ranking process. Additionally, ALRO incorporates a permutation-sensitive learning mechanism that addresses position bias, a prevalent issue in generative models, without imposing additional computational burdens during inference. Our evaluative studies reveal that ALRO outperforms both existing embedding-based recommendation methods and LLM-based recommendation baselines. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Unnatural language processing: How do language models handle machine-generated prompts?

Unnatural language processing: How do language models handle...

引用

conference on empirical methods in natural language processing (EMNLP)

作者： Kervadec, Corentin Franzon, Francesca Baroni, Marco Univ Pompeu Fabra UPF Barcelona Spain UPF Barcelona Spain ICREA Barcelona Spain

ISBN: (纸本)9798891760615

language model prompt optimization research has shown that semantically and grammatically well-formed manually crafted prompts are routinely outperformed by automatically generated token sequences with no apparent meaning or syntactic structure, including sequences of vectors from a model's embedding space. We use machine-generated prompts to probe how models respond to input that is not composed of natural language expressions. We study the behavior of models of different sizes in multiple semantic tasks in response to both continuous and discrete machine-generated prompts, and compare it to the behavior in response to human-generated natural-language prompts. Even when producing a similar output, machine-generated and human prompts trigger different response patterns through the network processing pathways, including different perplexities, different attention and output entropy distributions, and different unit activation profiles. We provide preliminary insight into the nature of the units activated by different prompt types, suggesting that only natural language prompts recruit a genuinely linguistic circuit.

关键词： Vector spaces

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 93 94 95 96 97 98 99 100 101 102 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：