检索结果-内蒙古大学图书馆

IEEE International Conference on Communications (ICC)

作者： Qu, Anlin Niu, Jianwei Mo, Shasha Beihang Univ Sch Comp Sci & Engn State Key Lab Virtual Real Technol & Syst Beijing 100191 Peoples R China Beihang Univ Sch Comp Sci & Engn Beijing Adv Innovat Ctr Big Data & Brain Comp Beijing 100191 Peoples R China Beihang Univ Hangzhou Innovat Res Inst Hangzhou 310051 Peoples R China Zhengzhou Univ Res Inst Ind Technol Zhengzhou 450001 Peoples R China Beihang Univ Sch Cyber Sci & Technol Beijing 100191 Peoples R China

ISBN: (纸本)9781728171227

language modeling is an important problem in Natural language Processing (NLP), and the multi-layer Transformer network is currently the most advanced and effective model for this task. However, there exist two inherent defects in its multi-head self-attention structure: (1) attention information loss: the lower-level attention weights cannot be explicitly passed through upper layers, which may lead the network lose some pivotal attention information captured by lower-level layers;(2) multi-head bottleneck: the dimension of each head in vanilla Transformer is relatively small and the process of each head is independent, which introduces an expressive bottleneck and makes subspace learning inadequate constitutionally. To overcome these two weaknesses, a novel neural architecture named Guide-Transformer is proposed in this paper. The Guide-Transformer utilizes horizontal and vertical attention information to guide the original process of the multi-head self-attention sublayer without introducing excessive complexity. The experimental results on three authoritative language modeling benchmarks demonstrate the effectiveness of Guide-Transformer. For the popular perplexity (ppl) and bits-per-character (bpc) evaluation metrics, Guide-Transformer achieves moderate improvements over the powerful baseline model.

关键词： neural language modeling transformer attention mechanism information guiding

来源：评论

学校读者我要写书评

暂无评论

A Text Generation Model that Maintains the Order of Words, Topics, and Parts of Speech via Their Embedding Representations and neural language Models

A Text Generation Model that Maintains the Order of Words, T...

引用

IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)

作者： Kawamae, Noriaki NTT Comware Tokyo Japan

ISBN: (纸本)9781450391153

Our goal is to generate coherent text accurately in terms of their semantic information and syntactic structure. Embedding methods and neural language models are indispensable in generating coherent text as they learn semantic information, and syntactic structure, respectively, and they are indispensable methods for generating coherent text. We focus here on parts of speech (POS) (e.g. noun, verb, preposition, etc.) so as to enhance these models, and allow us to generate truly coherent text more efficiently than is possible by using any of them in isolation. This leads us to derive Words and Topics and POS 2 Vec (WTP2Vec) as an embedding method, and Structure Aware Unified language Model (SAUL) as a neural language model. Experiments show that our approach enhances previous models and generates coherent and semantically valid text with natural syntactic structure.

关键词： neural language modeling text generation embedding method

来源：评论

学校读者我要写书评

暂无评论

On the Naturalness of Fuzzer-Generated Code

On the Naturalness of Fuzzer-Generated Code

引用

19th International Conference on Mining Software Repositories (MSR)

作者： Kambhamettu, Rajeswari Hita Billos, John Oluwaseun-Apo, Tomi Gafford, Benjamin Padhye, Rohan Hellendoorn, Vincent J. Carnegie Mellon Univ Pittsburgh PA 15213 USA Wake Forest Univ Winston Salem NC 27109 USA Penn State Univ University Pk PA 16802 USA

ISBN: (纸本)9781450393034

Compiler fuzzing tools such as Csmith have uncovered many bugs in compilers by randomly sampling programs from a generative model. The success of these tools is often attributed to their ability to generate unexpected corner case inputs that developers tend to overlook during manual testing. At the same time, their chaotic nature makes fuzzer-generated test cases notoriously hard to interpret, which has lead to the creation of input simplification tools such as C-Reduce (for C compiler bugs). In until now unrelated work, researchers have also shown that human-written software tends to be rather repetitive and predictable to language models. Studies show that developers deliberately write more predictable code, whereas code with bugs is relatively unpredictable. In this study, we ask the natural questions of whether this high predictability property of code also, and perhaps counter-intuitively, applies to fuzzer-generated code. That is, we investigate whether fuzzer-generated compiler inputs are deemed unpredictable by a language model built on human-written code and surprisingly conclude that it is not. To the contrary, Csmith fuzzer-generated programs are more predictable on a per-token basis than human-written C programs. Furthermore, bug-triggering tended to be more predictable still than random inputs, and the C-Reduce minimization tool did not substantially increase this predictability. Rather, we find that bug-triggering inputs are unpredictable relative to Csmith's own generative model. This is encouraging;our results suggest promising research directions on incorporating predictability metrics in the fuzzing and reduction tools themselves.

关键词： entropy predictability generation-based fuzzing neural language modeling

来源：评论

学校读者我要写书评

暂无评论

Extracting Family History Information From Electronic Health Records: Natural language Processing Analysis

引用

JMIR MEDICAL INFORMATICS 2021年第4期9卷 e24020页

作者： Rybinski, Maciej Dai, Xiang Singh, Sonit Karimi, Sarvnaz Nguyen, Anthony Commonwealth Sci & Ind Res Org Sydney NSW Australia Univ Sydney Sydney NSW Australia Macquarie Univ Sydney NSW Australia Commonwealth Sci & Ind Res Org Brisbane Qld Australia

Background: The prognosis, diagnosis, and treatment of many genetic disorders and familial diseases significantly improve if the family history (FH) of a patient is known. Such information is often written in the free text of clinical notes. Objective: The aim of this study is to develop automated methods that enable access to FH data through natural language processing. Methods: We performed information extraction by using transformers to extract disease mentions from notes. We also experimented with rule-based methods for extracting family member (FM) information from text and coreference resolution techniques. We evaluated different transfer learning strategies to improve the annotation of diseases. We provided a thorough error analysis of the contributing factors that affect such information extraction systems. Results: Our experiments showed that the combination of domain-adaptive pretraining and intermediate-task pretraining achieved an F1 score of 81.63% for the extraction of diseases and FMs from notes when it was tested on a public shared task data set from the National Natural language Processing Clinical Challenges (N2C2), providing a statistically significant improvement over the baseline (P<.001). In comparison, in the 2019 N2C2/Open Health Natural language Processing Shared Task, the median F1 score of all 17 participating teams was 76.59%. Conclusions: Our approach, which leverages a state-of-the-art named entity recognition model for disease mention detection coupled with a hybrid method for FM mention detection, achieved an effectiveness that was close to that of the top 3 systems participating in the 2019 N2C2 FH extraction challenge, with only the top system convincingly outperforming our approach in terms of precision.

关键词： information extraction natural language processing clinical natural language processing named entity recognition sequence tagging neural language modeling data augmentation

来源：评论

学校读者我要写书评

暂无评论

A neural language Model for Multi-Dimensional Textual Data based on CNN-LSTM Network 19

A Neural Language Model for Multi-Dimensional Textual Data b...

引用

19th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)

作者： Park, Seongik Song, Jin-Hee Kim, Yanggon Towson Univ Dept Comp & Informat Sci Towson MD 21204 USA Shinhan Univ Sch IT Convergence Engn Dongducheon Gyeonggi Do South Korea

ISBN: (纸本)9781538658895

language modeling (LM) is a subtask in Natural language Processing (NLP), and the goal of LM is to build a statistical language model that can learn and estimate a probability distribution of natural language over sentences of terms. Recently, many recurrent neural network based LM, a type of deep neural network for dealing with sequential data, have been proposed and achieved remarkable results. However, they only rely upon the analysis on the words occurred in the sentences even though every sentence contains various useful morphological information, such as Part-of-Speech (POS) tag that is necessary for constituting a sentence and can be used for an analysis as a feature. Although morphological information can be useful for LM, using that information as the input data to neural network based LM is not straightforward because adding features between words as a one-dimensional array can cause the vanishing gradient problem by increasing the time steps of recurrent neural network. In order to solve this problem, in this paper, we propose a CNN-LSTM based language model that deals with textual data regarding a multi-dimensional data with respect to the input of the network. To train this multi- dimensional input to Long-Short Term Memory (LSTM), we use a convolutional neural network (CNN) with a 1x1 filter for dimensionality reduction of input data to avoid the vanishing gradient problem by decreasing the time step between input words. In addition, our approach that uses multi-dimension data reduced by CNN can be used as a plugin with many customized LSTM based LM. On the Penn Treebank corpus, our model has shown improvement of the perplexity with not only vanilla LSTM but customized LSTM models.

关键词： language modeling neural language modeling Deep Learning neural Network Natural language Processing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：