文献详情 >ClassSum: a deep learning mode... 收藏

ClassSum: a deep learning model for class-level code summarization

作者：Li, Mingchen Yu, Huiqun Fan, Guisheng Zhou, Ziyi Huang, Jiawen

作者机构：East China Univ Sci & Technol Dept Comp Sci & Engn Shanghai Peoples R China Shanghai Engn Res Ctr Smart Energy Shanghai Peoples R China Shanghai Comp Software Tech Dev Ctr Shanghai Key Lab Comp Software Evaluating & Testi Shanghai Peoples R China

出版物：《NEURAL COMPUTING & APPLICATIONS》 (神经网络计算与应用)

年卷期：2023年第35卷第4期

页面：3373-3393页

核心收录：

学科分类：08[工学] 0812[工学-计算机科学与技术（可授工学、理学学位）]

基　　金：National Natural Science Foundation of China Shanghai Natural Science Foundation [21ZR1416300] Capacity building project of local universities Science and Technology Commission of Shanghai Municipality

主　　题：Program comprehension Code summarization Class documentation Deep learning

摘要：Code summaries are clear and concise natural language descriptions of program entities. Meaningful code summaries assist developers in better understanding. Code summarization refers to the task of generating a natural language summary from a code snippet. Most researches on code summarization focus on automatically generating summaries for methods or functions. However, in an object-oriented language such as Java, class is the basic programming unit rather than method. To fill this gap, in this paper, we investigate how to generate summaries for Java classes utilizing deep learning-based approaches. We propose a novel encoder-decoder model called ClassSum to generate functionality descriptions for Java classes and build a dataset containing 172,639 pairs from 3185 repositories hosted on Github. Since the code of class is much longer and more complicated, encoding a whole class via neural network is more challenging than encoding a method. On the other hand, the content within a class may be incomplete. To overcome this difficulty, we reduce the code of a class by only keeping its key elements, namely class signatures, method signatures and attribute names. To utilize both lexical and structural information of code, our model takes token sequence and abstract syntax tree of the reduced class content as inputs. ClassSum and five baselines (designed for method-level code summarization) are evaluated on our dataset. Experiment results show that summaries generated by ClassSum are more accurate and readable than those generated by baselines. Our dataset is available at https://***/classsum/ClassSum.

本地馆藏 | 借阅须知 | 我要预约

已订购，未入库

sda

目录详情 | 试阅读 |

读者评论与其他读者分享你的观点

学校读者

用户名:未登录

我的评分

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

ClassSum: a deep learning model for class-level code summarization

读者评论与其他读者分享你的观点

请选择收藏分类：

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

ClassSum: a deep learning model for class-level code summarization

读者评论 与其他读者分享你的观点

请选择收藏分类： 新增自定义分类 确定 取消

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

读者评论与其他读者分享你的观点

请选择收藏分类：