The extensive application of smart contract technology in the blockchain domain has positioned it as a key component of the digital economy. However, as the application scope of smart contracts expands, security issue...
详细信息
作者:
Su, LishaYan, RongCollege of Computer Science
Inner Mongolia University Inner Mongolia Key Laboratory of Multilingual Artificial Intelligence Technology National & Local Joint Engineering Research Center of Intelligent Information Processing Technology for Mongolian Hohhot010021 China
Joint extraction of entities and relations is an essential task in information extraction. Recently, tagging-based models have gained attention but with poor performance on overlapping triplets, which confronted the i...
详细信息
Multiple Sclerosis (MS), an autoimmune disorder impacting the central nervous system, is increasingly prevalent. In this context, automated tools for MS lesions segmentation could aid healthcare professionals accelera...
详细信息
In recent years, the extensive utilization of electronic medical records has led to the preservation of a large amount of historical patient information, culminating in the formation of Electronic Health Record (EHR) ...
详细信息
In order to provide adaptive and user-friendly solutions to robotic manipulation, it is important that the agent can learn to accomplish tasks even if they are only provided with very sparse instruction signals. To ad...
详细信息
Aiming at the problem that traditional air quality prediction research ignores spatial information, this paper proposes an air quality index prediction model (SGCN-GRU) that integrates spatiotemporal information based...
详细信息
In order to improve the accuracy of PM2.5 concentration prediction, a CNN-GRU deep learning model based on fusion of Luong Attention is proposed. Firstly, the correlation between various air pollutants and meteorologi...
详细信息
Current state-of-the-art image captioning models generate captions in a single language, requiring a combination of multiple language specific models to build a multilingual image captioning system. However, as the nu...
详细信息
Recent advances in natural language processing have relied heavily on using Transformer-based language models. However, Transformers often require large parameter sizes and model depth. Existing Transformer-free appro...
详细信息
Recent advances in natural language processing have relied heavily on using Transformer-based language models. However, Transformers often require large parameter sizes and model depth. Existing Transformer-free approaches using state-space models demonstrate superiority over Transformers, yet they still lack a neuro-biologically connection to the human brain. This paper proposes LasF, representing Language tokens as Functionals of semantic fields, to simulate the neuronal behaviors for better language modeling. The LasF module is equivalent to a nonlinear approximator tailored for sequential data. By replacing the final layers of pre-trained language models with the LasF module, we obtain LasF-based models. Experiments conducted for standard reading comprehension and question-answering tasks demonstrate that the LasF-based models consistently improve accuracy with fewer parameters. Besides, we use CommonsenseQA's blind test set to evaluate a full-parameter tuned LasF-based model, which outperforms the prior best ensemble and single models by 0.4% and 3.1%, respectively. Furthermore, our LasF-only language model trained from scratch outperforms existing parameter-efficient language models on standard datasets such as WikiText103 and PennTreebank. Copyright 2024 by the author(s)
Language-image pre-training is an effective technique for learning powerful representations in general domains. However, when directly turning to person representation learning, these general pre-training methods suff...
暂无评论