Synthetic Aperture Radar (SAR) image speckle is a type of image noise caused by the coherence of terrain, which can degrade the resolution and detection capability of SAR images. To address this challenge, this paper ...
详细信息
Federated learning (FL) is widely applied in edge computing scenarios to protect data privacy, but the assumption of independent and identically distributed (IID) data across clients often does not hold in practice, l...
详细信息
Underwater target detection is an important method for marine life detection. However, the accuracy of target detection and recognition is affected by the problems of image occlusion, blurred water quality and complex...
详细信息
Code generation aims to generate code in a general-purpose programming language, such as C++, based on natural language intents. Existing efforts primarily focus on relatively simple programming problems and fail to e...
详细信息
In this opinion piece, we question the efficacy of students conducting systematic reviews (SRs) at the very start of their PhDs, especially now that we are riding, or drowning in, the Generative AI wave. How would the...
Electroencephalography (EEG) data is invaluable for investigating EEG physiology, neuroscience and clinical medicine. However, EEG signals are characterized by temporal nature, high dimensionality, and noise, which ma...
详细信息
In recent years, the increasing interest in ontologies resulted in the developing and publishing of many ontologies in the same or different domains. When users try to reuse the existing ontologies in their applicatio...
详细信息
Code summarization aims to generate natural language descriptions of source code, facilitating programmers to understand and maintain it rapidly. While previous code summarization efforts have predominantly focused on...
详细信息
ISBN:
(纸本)9798350330663
Code summarization aims to generate natural language descriptions of source code, facilitating programmers to understand and maintain it rapidly. While previous code summarization efforts have predominantly focused on method-level, this paper studies file-level code summarization, which can assist programmers in understanding and maintaining large source code projects. Unlike method-level code summarization, file-level code summarization typically involves long source code within a single file, which makes it challenging for Transformer-based models to understand the code semantics for the maximum input length of these models is difficult to set to a large number that can handle long code input well, due to the quadratic scaling of computational complexity with the input sequence length. To address this challenge, we propose SparseCoder, an identifier-aware sparse transformer for effectively handling long code sequences. Specifically, the SparseCoder employs a sliding window mechanism for self-attention to model short-term dependencies and leverages the structure message of code to capture long-term dependencies among source code identifiers by introducing two types of sparse attention patterns named global and identifier attention. To evaluate the performance of SparseCoder, we construct a new dataset FILE-CS for file-level code summarization in Python. Experimental results show that our SparseCoder model achieves state-of-the-art performance compared with other pre-trained models, including full self-attention and sparse models. Additionally, our model has low memory overhead and achieves comparable performance with models using full self-attention mechanism. Furthermore, we verify the generality of SparseCoder on other code understanding tasks, i.e., code clone detection and code search, and results show that our model outperforms baseline models in both tasks, demonstrating that our model can generate better code representations for various downstream tasks. Our
Medical image segmentation is pivotal in computer-aided diagnosis systems, demanding high precision and contextual understanding. Vision Transformer-based approaches have gained much attention recently due to their ex...
详细信息
In the era of rapid development of artificial intelligence technologies, traditional teaching models are unable to meet the employment needs of enterprises, and talent cultivation in universities faces more challenges...
详细信息
暂无评论