检索结果-内蒙古大学图书馆

Advancing Agile Software Cost Estimation Through data Synthesis: A Comparative Analysis of Five Generation Techniques

引用

IEEE ACCESS 2025年 13卷 63219-63236页

作者： Zhao, Xiaoyan Mansor, Zulkefli Razali, Rozilawati Nazri, Mohd Zakree Ahmad Xiong, Xin Univ Kebangsaan Malaysia Fac Informat Sci & Technol Bangi 43600 Selangor Malaysia Univ Kebangsaan Malaysia Ctr Software Technol & Management Bangi 43600 Selangor Malaysia Univ Kebangsaan Malaysia Ctr Artificial Intelligence Technol Bangi 43600 Selangor Malaysia

Agile has been used in software development for over 20 years and is the preferred development method for more than 85% of software companies. However, cost estimation in agile development remains a significant challenge. This is reflected in the fact that the accuracy of estimation still needs improvement, and most cost estimation techniques still rely on the team's experience and knowledge. While machine learning algorithms have performed better in this area, the lack of sufficient agile cost data hinders large-scale training and in-depth research. To address this issue, this study selected five data generation techniques-Variational Autoencoder (VAE), Wasserstein Generative Adversarial Network (WGAN), Synthetic Minority Over-sampling Technique for Nominal and Continuous Features (SMOTE-NC), data augmentation for tabular data (augmentation), and tabular data Diffusion Probabilistic Models (TabDDPM)-based on the characteristics of agile cost data. Using cost data from 75 agile projects, these techniques were employed to generate three sets of data with sizes of 200, 500, and 1000. A performance evaluation model was created based on consistency, authenticity, diversity, and effectiveness to verify the performance of these generated data. The experimental results show that WGAN consistently scored 16 out of 20 points across all three data sets, excelling in data consistency and authenticity. SMOTE-NC and augmentation Were followed. SMOTE-NC scored 15 out of 20 points for all data sizes and performed best in terms of effectiveness, with an MMRE of 88.16% and a PRED (0.2) of 84.5%. augmentation performed the best when generating 1000 data points. These findings highlight the potential of data generation technologies, particularly WGAN, in enhancing agile cost estimation and providing guidance on selecting the appropriate amount of data. This lays a foundation for further development of machine learning algorithms in this field and offers valuable insights for other res

关键词： Costs Estimation Generative adversarial networks Training Agile software development data models Hidden Markov models data collection data augmentation Autoencoders data generation technique software cost estimation agile software development variational autoencoder Wasserstein generative adversarial network synthetic minority over-sampling technique for nominal and continuous features data augmentation for tabular data tabular data diffusion probabilistic model

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：