检索结果-内蒙古大学图书馆

Bidirectional Transformer with absolute-position aware relative position encoding for encoding sentences

Frontiers of computer science 2023年第1期17卷 63-71页

作者： Le QI Yu ZHANG Ting LIU School of Computer Science and Technology Harbin Institute of TechnologyHarbin 150001China

Transformers have been widely studied in many natural language processing (NLP) tasks, which can capture the dependency from the whole sentence with a high parallelizability thanks to the multi-head attention and the position-wise feed-forward network. However, the above two components of transformers are position-independent, which causes transformers to be weak in modeling sentence structures. Existing studies commonly utilized positional encoding or mask strategies for capturing the structural information of sentences. In this paper, we aim at strengthening the ability of transformers on modeling the linear structure of sentences from three aspects, containing the absolute position of tokens, the relative distance, and the direction between tokens. We propose a novel bidirectional Transformer with absolute-position aware relative position encoding (BiAR-Transformer) that combines the positional encoding and the mask strategy together. We model the relative distance between tokens along with the absolute position of tokens by a novel absolute-position aware relative position encoding. Meanwhile, we apply a bidirectional mask strategy for modeling the direction between tokens. Experimental results on the natural language inference, paraphrase identification, sentiment classification and machine translation tasks show that BiAR-Transformer achieves superior performance than other strong baselines.

关键词： Transformer relative position encoding bidirectional mask strategy sentence encoder

来源：评论

学校读者我要写书评

暂无评论

Combating with extremely noisy samples in weakly supervised slot filling for automatic diagnosis

引用

Frontiers of computer science 2023年第5期17卷 67-73页

作者： Xiaoming SHI Wanxiang CHE School of Computer Science and Technology Harbin Institute of TechnologyHarbin 150001China

Slot filling,to extract entities for specific types of information(slot),is a vitally important modular of dialogue systems for automatic *** responses can be regarded as the weak supervision of patient *** this way,a large amount of weakly labeled data can be obtained from unlabeled diagnosis dialogue,alleviating the problem of costly and time-consuming data ***,weakly labeled data suffers from extremely noisy *** alleviate the problem,we propose a simple and effective Co-WeakTeaching *** method trains two slot filling models *** two models learn from two different weakly labeled data,ensuring learning from two ***,one model utilizes selected weakly labeled data generated by the other,*** model,obtained by the Co-WeakTeaching on weakly labeled data,can be directly tested on testing data or sequentially fine-tuned on a small amount of human-annotated *** results on these two settings illustrate the effectiveness of the method with an increase of 8.03%and 14.74%in micro and macro f1 scores,respectively.

关键词： dialogue system slot filling co-teaching

来源：评论

学校读者我要写书评

暂无评论

DeepGAN: Utilizing generative adversarial networks for improved deep learning

引用

International Journal of Knowledge-Based and Intelligent Engineering Systems 2024年第4期28卷 732-748页

作者： V, Edward Naveen A, Jenefa T.M, Thiyagu A, Lincy Taurshia, Antony Department of Computer Science and Engineering Sri Shakthi Institute of Engineering and Technology India Department of Computer Science and Engineering Karunya Institute of Technology and Sciences India Division of Computer Science and Engineering Karunya Institute of Technology and Sciences India Department of Computer Science and Engineering National Engineering College India

In the realm of deep learning, Generative Adversarial Networks (GANs) have emerged as a topic of significant interest for their potential to enhance model performance and enable effective data augmentation. This paper addresses the existing challenges in synthesizing high-quality data and harnessing the capabilities of GANs for improved deep learning outcomes. Unlike traditional approaches that heavily rely on manually engineered data augmentation techniques, our work introduces a novel framework that leverages DeepGANs to autonomously generate diverse and high-fidelity data. Our experiments encompass a diverse spectrum of datasets, including images, text, and time series data. In the context of image classification tasks, we conduct experiments on the widely recognized CIFAR-10 dataset, which consists of 50,000 image samples. Our results demonstrate the remarkable efficacy of DeepGANs in enhancing model performance across various data domains. Notably, in image classification using the CIFAR-10 dataset, our innovative approach achieves an impressive accuracy of 97.2%. This represents a substantial advancement beyond conventional CNN models, underscoring the profound impact of DeepGANs in the realm of deep learning. In summary, this research sheds light on DeepGANs as a fundamental component in the pursuit of enhanced deep learning performance. Our framework not only overcomes existing limitations but also heralds a new era of data augmentation, with generative adversarial networks leading the way. The attainment of an accuracy rate of 97.2% on CIFAR-10 serves as a compelling testament to the transformative potential of DeepGANs, solidifying their pivotal role in the future of deep learning. This promises the development of more robust, adaptive, and accurate models across a myriad of applications, marking a significant contribution to the field. © 2024 – IOS Press. All rights reserved.

关键词： Generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

An Encoding-Decoding Framework Based on CNN for circ RNA-RBP Binding Sites Prediction

引用

Chinese Journal of Electronics 2024年第1期33卷 256-263页

作者： Yajing GUO Xiujuan LEI Yi PAN School of Computer Science Shaanxi Normal University Faculty of Computer Science and Control Engineering Shenzhen Institute of Advanced TechnologyChinese Academy of Sciences Department of Computer Science Georgia State University

Predicting RNA binding protein(RBP) binding sites on circular RNAs(circ RNAs) is a fundamental step to understand their interaction mechanism. Numerous computational methods are developed to solve this problem, but they cannot fully learn the features. Therefore, we propose circ-CNNED, a convolutional neural network(CNN)-based encoding and decoding framework. We first adopt two encoding methods to obtain two original matrices. We preprocess them using CNN before fusion. To capture the feature dependencies, we utilize temporal convolutional network(TCN) and CNN to construct encoding and decoding blocks, respectively. Then we introduce global expectation pooling to learn latent information and enhance the robustness of circ-CNNED. We perform circ-CNNED across 37 datasets to evaluate its effect. The comparison and ablation experiments demonstrate that our method is superior. In addition, motif enrichment analysis on four datasets helps us to explore the reason for performance improvement of circ-CNNED.

关键词： Circular RNAs (circRNAs) RNA binding proteins Convolutional neural network Temporal convolutional network Encoder-decoder network

来源：评论

学校读者我要写书评

暂无评论

Existence and uniqueness of mean field equilibrium in continuous bandit game

引用

science China(Information sciences) 2025年第3期68卷 395-396页

作者： Xiong WANG Yuqing LI Riheng JIA School of Computer Science and Technology Huazhong University of Science and Technology School of Cyber Science and Engineering Wuhan University Wuhan University Shenzhen Research Institute School of Computer Science and Technology Zhejiang Normal University

Multiarmed bandit(MAB) models are widely used for sequential decision-making in uncertain environments, such as resource allocation in computer communication systems.A critical challenge in interactive multiagent systems with bandit feedback is to explore and understand the equilibrium state to ensure stable and tractable system performance.

关键词：

来源：评论

学校读者我要写书评

暂无评论

An efficient algorithm for approximate Voronoi diagram construction on triangulated surfaces

引用

Computational Visual Media 2023年第3期9卷 443-459页

作者： Wenlong Meng Pengbo Bo Xiaodong Zhang Jixiang Hong Shiqing Xin Changhe Tu School of Computer Science and Technology Harbin Institute of TechnologyWeihai 264209China School of Computer Science and Technology Shandong UniversityQingdao 266237China

Voronoi diagrams on triangulated surfaces based on the geodesic metric play a key role in many applications of computer *** methods of constructing such Voronoi diagrams generally depended on having an exact geodesic ***,exact geodesic computation is time-consuming and has high memory usage,limiting wider application of geodesic Voronoi diagrams(GVDs).In order to overcome this issue,instead of using exact methods,we reformulate a graph method based on Steiner point insertion,as an effective way to obtain geodesic ***,since a bisector comprises hyperbolic and line segments,we utilize Apollonius diagrams to encode complicated structures,enabling Voronoi diagrams to encode a medial-axis surface for a dense set of boundary *** on these strategies,we present an approximation algorithm for efficient Voronoi diagram construction on triangulated *** also suggest a measure for evaluating similarity of our results to the exact *** our GVD results are constructed using approximate geodesic distances,we can get GVD results similar to exact results by inserting Steiner points on triangle *** results on many 3D models indicate the improved speed and memory requirements compared to previous leading methods.

关键词： geodesic Voronoi diagrams(GVDs) triangular surfaces mesh surfaces approximate geodesics Apollonius diagrams

来源：评论

学校读者我要写书评

暂无评论

A verifiable multi-secret image sharing scheme based on DNA encryption

引用

Multimedia Tools and Applications 2025年第4期84卷 1967-1983页

作者： Chattopadhyay, Arup Kumar Saha, Sanchita Nag, Amitava Singh, Jyoti Prakash Department of Computer Science and Engineering Central Institute of Technology Kokrajhar Kokrajhar India Department of Computer Science and Engineering National Institute of Technology Patna Bihar Patna India

A multi-secret image sharing (MSIS) scheme facilitates the secure distribution of multiple images among a group of participants. Several MSIS schemes have been proposed with a (n, n) structure that encodes secret input images into n share images. This encoding ensures that if a subset of participants collude, no information about the secret images can be revealed. The secret images can only be recovered if all participants cooperate. Most of these schemes are built using basic Boolean operations, primarily XOR operations. MSIS schemes that rely on Boolean logic offer various benefits compared to other methods, such as the ability to recover data without any loss, no increase in pixel size, and efficient computation. This article presents a verifiable MSIS scheme that relies on a secure two-variable one-way function and a secure hash function. The scheme allows for independent verification of both the dealer and the participants to detect any cheating attempts. The proposed scheme utilizes the Deoxyribonucleic acid (DNA) encryption technique to leverage the inherent benefits of DNA computing, such as enhanced security, fast computation, minimal storage needs, and low power usage. The share images produced by the proposed scheme exhibit an average entropy of 7.99, indicating that the generated 8-bit share images are completely random. Additionally, the correlation coefficient is approximately 0.005, suggesting that neighboring pixels in the images are not related to each other. The proposed scheme can detect any instance of cheating, whether it is done by the dealer or by a participant. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： DNA

来源：评论

学校读者我要写书评

暂无评论

An efficient schedulability analysis based on worst-case interference time for real-time systems

引用

science China(Information sciences) 2024年第9期67卷 88-104页

作者： Hongbiao LIU Mengfei YANG Lei QIAO Xi CHEN Jian GONG School of Computer Science and Technology Xidian University China Academy of Space Technology Beijing Institute of Control Engineering

Real-time systems are widely implemented in the Internet of Things(IoT) and safety-critical systems, both of which have generated enormous social value. Aiming at the classic schedulability analysis problem in real-time systems, we proposed an exact Boolean analysis based on interference(EBAI) for schedulability analysis in real-time systems. EBAI is based on worst-case interference time(WCIT), which considers both the release jitter and blocking time of the task. We improved the efficiency of the three existing tests and provided a comprehensive summary of related research results in the field. Abundant experiments were conducted to compare EBAI with other related results. Our evaluation showed that in certain cases, the runtime gain achieved using our analysis method may exceed 73% compared to the stateof-the-art schedulability test. Furthermore, the benefits obtained from our tests grew with the number of tasks, reaching a level suitable for practical application. EBAI is oriented to the five-tuple real-time task model with stronger expression ability and possesses a low runtime overhead. These characteristics make it applicable in various real-time systems such as spacecraft, autonomous vehicles, industrial robots, and traffic command systems.

关键词： five-tuple real-time task model real-time system spacecraft Internet of Things exact schedulability analysis worst-case interference time

来源：评论

学校读者我要写书评

暂无评论

Two-Stage Approach for Targeted Knowledge Transfer in Self-Knowledge Distillation

引用

IEEE/CAA Journal of Automatica Sinica 2024年第11期11卷 2270-2283页

作者： Zimo Yin Jian Pu Yijie Zhou Xiangyang Xue the School of Computer Science Fudan University the Institute of Science and Technology for Brain Inspired Intelligence Fudan University IEEE

Knowledge distillation(KD) enhances student network generalization by transferring dark knowledge from a complex teacher network. To optimize computational expenditure and memory utilization, self-knowledge distillation(SKD) extracts dark knowledge from the model itself rather than an external teacher network. However, previous SKD methods performed distillation indiscriminately on full datasets, overlooking the analysis of representative samples. In this work, we present a novel two-stage approach to providing targeted knowledge on specific samples, named two-stage approach self-knowledge distillation(TOAST). We first soften the hard targets using class medoids generated based on logit vectors per class. Then, we iteratively distill the under-trained data with past predictions of half the batch size. The two-stage knowledge is linearly combined, efficiently enhancing model performance. Extensive experiments conducted on five backbone architectures show our method is model-agnostic and achieves the best generalization ***, TOAST is strongly compatible with existing augmentation-based regularization methods. Our method also obtains a speedup of up to 2.95x compared with a recent state-of-the-art method.

关键词： Cluster-based regularization iterative prediction refinement model-agnostic framework self-knowledge distillation(SKD) two-stage knowledge transfer

来源：评论

学校读者我要写书评

暂无评论

MLRT-UNet:An Efficient Multi-Level Relation Transformer Based U-Net for Thyroid Nodule Segmentation

引用

computer Modeling in Engineering & sciences 2025年第4期143卷 413-448页

作者： Kaku Haribabu Prasath R Praveen Joe IR Department of Computer Science and Engineering RMK College of Engineering and TechnologyTiruvallur601206India School of Computer Science and Engineering Vellore Institute of TechnologyChennai600127India

Thyroid nodules,a common disorder in the endocrine system,require accurate segmentation in ultrasound images for effective diagnosis and ***,achieving precise segmentation remains a challenge due to various factors,including scattering noise,low contrast,and limited resolution in ultrasound *** existing segmentation models have made progress,they still suffer from several limitations,such as high error rates,low generalizability,overfitting,limited feature learning capability,*** address these challenges,this paper proposes a Multi-level Relation Transformer-based U-Net(MLRT-UNet)to improve thyroid nodule *** MLRTUNet leverages a novel Relation Transformer,which processes images at multiple scales,overcoming the limitations of traditional encoding *** transformer integrates both local and global features effectively through selfattention and cross-attention units,capturing intricate relationships within the *** approach also introduces a Co-operative Transformer Fusion(CTF)module to combine multi-scale features from different encoding layers,enhancing the model’s ability to capture complex patterns in the ***,the Relation Transformer block enhances long-distance dependencies during the decoding process,improving segmentation *** results showthat the MLRT-UNet achieves high segmentation accuracy,reaching 98.2% on the Digital Database Thyroid Image(DDT)dataset,97.8% on the Thyroid Nodule 3493(TG3K)dataset,and 98.2% on the Thyroid Nodule3K(TN3K)*** findings demonstrate that the proposed method significantly enhances the accuracy of thyroid nodule segmentation,addressing the limitations of existing models.

关键词： Thyroid nodules endocrine system multi-level relation transformer U-Net self-attention external attention co-operative transformer fusion thyroid nodules segmentation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：