检索结果-内蒙古大学图书馆

4th IEEE/ACM International Conference on Automation of Software Test (AST)

作者： Wu, Xiaoxue Shan, Wenjing Zheng, Wei Chen, Zhiguo Ren, Tao Sun, Xiaobing Yangzhou Univ Sch Informat Engn Yangzhou Jiangsu Peoples R China Northwestern Polytech Univ Sch Software Xian Shaanxi Peoples R China

ISBN: (纸本)9798350324020

As the bug description data generated during the software maintenance cycle, bug reports are usually hastily written by different users, resulting in many redundant and duplicate bug reports (DBRs). Once the DBRs are repeatedly assigned to developers, it will inevitably lead to a serious waste of human resources, especially for large-scale open-source projects. Recently, many experts and scholars have devoted themselves to researching the detection of DBRs and put forward a series of detection methods for DBRs. However, there is still much room for improvement in the performance of DBR prediction. Therefore, this paper proposes a new method for detecting DBR based on technical term extraction, CTEDB (Combination of Term Extraction and DeBERTaV3) for short. This method first extracts technical terms from the text information of bug reports based on Word2Vec and TextRank algorithms. Then it calculates the semantic similarity of technical terms between different bug reports by combining Word2Vec and SBERT models. Finally, it completes the DBR detection task by combining the DeBERTaV3 model. The experimental results show that CTEDB has achieved good results in detecting DBR, and has obviously improved the accuracy, F1-score, recall and precision compared with the baseline approaches.

关键词： duplicate bug reports detection automatic term extraction DeBERTaV3

来源：评论

学校读者我要写书评

暂无评论

Automated duplicate bug Report detection Using Multi-Factor Analysis

引用

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS 2016年第7期E99D卷 1762-1775页

作者： Zou, Jie Xu, Ling Yang, Mengning Zhang, Xiaohong Zeng, Jun Hirokawa, Sachio Chongqing Univ Sch Software Engn Chongqing 401331 Peoples R China Minist Educ Key Lab Dependable Serv Comp Cyber Phys Soc Chongqing 400044 Peoples R China Kyushu Univ Res Inst Informat Technol Fukuoka 8128581 Japan

The bug reports expressed in natural language text usually suffer from vast, ambiguous and poorly written, which causes the challenge to the duplicate bug reports detection. Current automatic duplicate bug reports detection techniques have mainly focused on textual information and ignored some useful factors. To improve the detection accuracy, in this paper, we propose a new approach calls LNG (LDA and N-gram) model which takes advantages of the topic model LDA and word-based model N-gram. The LNG considers multiple factors, including textual information, semantic correlation, word order, contextual connections, and categorial information, that potentially affect the detection accuracy. Besides, the N-gram adopted in our LNG model is improved by modifying the similarity algorithm. The experiment is conducted under more than 230,000 real bug reports of the Eclipse project. In the evaluation, we propose a new evaluation-metric, namely exact-accuracy (EA) rate, which can be used to enhance the understanding of the performance of duplicates detection. The evaluation results show that all the recall rate, precision rate, and EA rate of the proposed method are higher than treating them separately. Also, the recall rate is improved by 2.96%-10.53% compared to the state-of-art approach DBTM.

关键词： duplicate bug reports detection topic model LDA N-gram LNG

来源：评论

学校读者我要写书评

暂无评论

Duplication detection for Software bug reports based on Topic Model 9

Duplication Detection for Software Bug Reports based on Topi...

引用

9th International Conference on Service Science, ICSS 2016

作者： Zou, Jie Xu, Ling Yang, Mengning Yan, Meng Yang, Dan Zhang, Xiaohong School of Software Engineering Chongqing University Chongqing 401331 China Key Laboratory of Dependable Service Computing in Cyber Physical Society Ministry of Education Chongqing 400044 China

ISBN: (纸本)9781509027286

The traditional duplicate bug reports detection approaches are usually based on vector space model. However, the experimental result is rarely satisfying since this method cannot distinguish semantic correlation among bug reports which written by natural languages. Topic model, as a method to model underlying topics of texts, can solve the problem of document similarity calculation methods used in the information retrieving. It can find the semantic topics among the texts through massive training data, and obtain semantic relatedness among documents. Therefore, this paper proposes a novel duplication detection method based on topic model. Through selecting bug reports with execution information and combing with classified information of bugs, not only does this new method overcome the problem of high dimension, sparse data and loud noise, but also avoid the problem of synonymy and ambiguity in the natural languages. Comparing to the traditional SVM method, the recall rate and precision rate of our proposed approach have obviously increased, which indicates the effectiveness of this new method. © 2016 IEEE.

关键词： duplicate bug reports detection execution information topic model vector space model

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：