检索结果-内蒙古大学图书馆

HGNNLink: recovering requirements-code traceability links with text and dependency-aware heterogeneous graph neural networks

引用

AUTOMATED SOFTWARE ENGINEERING 2025年第2期32卷

作者： Wang, Bangchao Zou, Zhiyuan Liang, Xuanxuan Jin, Huan Liang, Peng Wuhan Text Univ Sch Comp Sci & Artificial Intelligence Wuhan 430200 Hubei Peoples R China Wuhan Text Univ Engn Res Ctr Hubei Prov Clothing Informat Wuhan 430200 Hubei Peoples R China Wuhan Univ Sch Comp Sci Wuhan 430072 Hubei Peoples R China Hubei Luojia Lab Wuhan 430072 Hubei Peoples R China

Manually recovering traceability links between requirements and code artifacts often consumes substantial human resources. To address this, researchers have proposed automated methods based on textual similarity between requirements and code artifacts, such as information retrieval (IR) and pre-trained models, to determine whether traceability links exist between requirements and code artifacts. However, in the same system, developers often follow similar naming conventions and repeatedly use the same frameworks and template code, resulting in high textual similarity between code artifacts that are functionally unrelated. This makes it difficult to accurately identify the corresponding code artifacts for requirements artifacts solely based on textual similarity. Therefore, it is necessary to leverage the dependency relationships between code artifacts to assist in the requirements-code traceability link recovery process. Existing methods often treat dependency relationships as a post-processing step to refine textual similarity, overlooking the importance of textual similarity and dependency relationships in generating requirements-code traceability links. To address these limitations, we proposed Heterogeneous Graph Neural Network Link (HGNNLink), a requirements traceability approach that uses vectors generated by pre-trained models as node features and considers IR similarity and dependency relationships as edge features. By employing a heterogeneous graph neural network, HGNNLink aggregates and dynamically evaluates the impact of textual similarity and code dependencies on link generation. The experimental results show that HGNNLink improves the average F1 score by 13.36% compared to the current state-of-the-art (SOTA) method GA-XWcode in a dataset collected from ten open source software (OSS) projects. HGNNLink can extend IR methods by using high similarity candidate links as edges, and the extended HGNNLink achieves a 2.48% improvement in F1 compared to the ori

关键词： Traceability link recovery Heterogeneous graph neural network Pre-trained model code dependency Information retrieval

来源：评论

学校读者我要写书评

暂无评论

Cross-Language Dependencies: An Empirical Study of Kotlin-Java 24

Cross-Language Dependencies: An Empirical Study of Kotlin-Ja...

引用

18th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, ESEM 2024

作者： Feng, Qiong Ji, Huan Ma, Xiaotian Liang, Peng School of Computer Science Nanjing University of Science and Technology Nanjing China School of Computer Science Wuhan University Wuhan China

ISBN: (纸本)9798400710476

Background: Since Google introduced Kotlin as an official programming language for developing Android apps in 2017, Kotlin has gained widespread adoption in Android development. The interoperability of Java and Kotlin's design nature allows them to coexist and interact with each other smoothly within a project. Aims: However, there is limited research on how Java and Kotlin interact with each other in real-world projects and what challenges are faced during these interactions. The answers to these questions are key to understanding these kinds of cross-language software systems. Methods: In this paper, we implemented a tool named DependExtractor, which can extract 11 kinds of Kotlin-Java dependencies, and conducted an empirical study of 23 Kotlin-Java real-world projects with 3,227 Java and 8,630 Kotlin source files. Results: Our findings revealed that Java and Kotlin frequently interact with each other in these cross-language projects, with access and call dependency types being the most dominant. Compared to files interacting with other files in the same language, Java/Kotlin source files, which participate in the cross-language interactions, undergo more commits. Additionally, among all Kotlin-Java problematic interactions, we identified seven common mistakes, along with their fixing strategies. Conclusions: The findings of this study can help developers understand and address the challenges in Kotlin-Java projects. © 2024 ACM.

关键词： code dependency

来源：评论

学校读者我要写书评

暂无评论

Enhancing requirements-to-code traceability with GA-XWcode: Integrating XGBoost, Node2Vec, and genetic algorithms for improving model performance and stability

引用

JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES 2024年第8期36卷

作者： Zou, Zhiyuan Wang, Bangchao Hu, Xinrong Deng, Yang Wan, Hongyan Jin, Huan Wuhan Text Univ Sch Comp Sci & Artificial Intelligence Wuhan Peoples R China Wuhan Text Univ Engn Res Ctr Hubei Prov Clothing Informat Wuhan Peoples R China

This study addresses the challenge of requirements-to-code traceability by proposing a novel model, Genetic Algorithm-XGBoost With code dependency (GA-XWcode), which integrates eXtreme Gradient Boosting (XGBoost) with a Node2Vec model-weighted code dependency strategy and genetic algorithms for parameter optimisation. XGBoost mitigates overfitting and enhances model stability, while Node2Vec improves prediction accuracy for low-confidence links. Genetic algorithms are employed to optimise model parameters efficiently, reducing the resource intensity of traditional methods. Experimental results show that GA-XWcode outperforms the state-of-the-art method TRAceability lInk cLassifier (TRAIL) by 17.44% and Deep Forest for Requirement traceability (DF4RT) by 33.36% in terms of average F1 performance across four datasets. It is significantly superior to all baseline methods at a confidence level of <0.01 and demonstrates exceptional performance and stability across various training data scales.

关键词： Requirement traceability recovery code dependency Automated parameter configuration XGBoost Genetic algorithm Node2Vec

来源：评论

学校读者我要写书评

暂无评论

A novel approach for automatic remodularization of software systems using extended ant colony optimization algorithm

引用

INFORMATION AND SOFTWARE TECHNOLOGY 2019年第Oct.期114卷 107-120页

作者： Varghese, Bright Gee R. Raimond, Kumudha Lovesum, Jeno Karunya Inst Technol & Sci Coimbatore Tamil Nadu India

Context Software modularization is extremely important to streamline the inner structure of the program modules without influencing its core functionality. As the framework advances during the upkeep stage, the pristine design of the software package gets disintegrated and hence it is arduous to understand and maintain. There are many existing approaches being carried out to automatically remodularize using optimization techniques to ease the maintenance and improve the quality of the system. The outcomes are rather insufficiently optimal and depend on problem-specific operators, which in turn expands the time multifaceted nature to land at an answer. Apart from these limitations, the issues, such as time complexity, scalability and performance need to be addressed. Objective: In this paper, an efficient automatic software remodularization using extended Ant Colony Optimization (ACO) has been proposed to remodularize the software systems. Method: The proposed approach mainly includes two phases: optimised traversal of software system using ACO for finding the order of software files to be processed and remodularization of software system using the proposed approach of extended ACO. Results: We experimented our proposed approach on seven software systems. The performance is evaluated by using Turbo modularization quality (MQ) which supports Module dependency graph (MDG) that have edge weights. The time complexity of remodularized software system is evaluated based on number of Turbo MQ. Conclusion It can be concluded that when the performance has been compared with the subsisting methodologies, for example, Genetic algorithm (GA), Hill climbing (HC) and Interactive genetic algorithms (I-GM), the proposed approach has higher Turbo MQ value with lesser time complexity in the evaluated software systems.

关键词： Remodularization Ant colony optimization Turbo modularization quality Software system code dependency

来源：评论

学校读者我要写书评

暂无评论

Bilateral dependency Neural Networks for Cross-Language Algorithm Classification 26

Bilateral Dependency Neural Networks for Cross-Language Algo...

引用

26th IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)

作者： Bui, Nghi D. Q. Yu, Yijun Jiang, Lingxiao Singapore Management Univ Sch Informat Syst Singapore Singapore Open Univ Ctr Res Comp Milton Keynes Bucks England

ISBN: (纸本)9781728105918

Algorithm classification is to automatically identify the classes of a program based on the algorithm(s) and/or data structure(s) implemented in the program. It can be useful for various tasks, such as code reuse, code theft detection, and malware detection. code similarity metrics, on the basis of features extracted from syntax and semantics, have been used to classify programs. Such features, however, often need manual selection effort and are specific to individual programming languages, limiting the classifiers to programs in the same language. To recognize the similarities and differences among algorithms implemented in different languages, this paper describes a framework of Bilateral Neural Networks (Bi-NN) that builds a neural network on top of two underlying sub-networks, each of which encodes syntax and semantics of code in one language. A whole Bi-NN can be trained with bilateral programs that implement the same algorithms and/or data structures in different languages and then be applied to recognize algorithm classes across languages. We have instantiated the framework with several kinds of token-, tree-and graph-based neural networks that encode and learn various kinds of information in code. We have applied the instances of the framework to a code corpus collected from GitHub containing thousands of Java and C++ programs implementing 50 different algorithms and data structures. Our evaluation results show that the use of Bi-NN indeed produces promising algorithm classification results both within one language and across languages, and the encoding of dependencies from code into the underlying neural networks helps improve algorithm classification accuracy further. In particular, our custom-built dependency trees with tree-based convolutional neural networks achieve the highest classification accuracy among the different instances of the framework that we have evaluated. Our study points to a possible future research direction to tailor bilateral and mu

关键词： cross-language mapping program classification algorithm classification code embedding code dependency neural network bilateral neural network

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：