检索结果-内蒙古大学图书馆

56th Annual SIGCSE Technical Symposium on Computer Science Education, SIGCSE TS 2025

作者： Baek, Jeonghun Yamazaki, Tetsuro Morihata, Akimasa Mori, Junichiro Yamakata, Yoko Taura, Kenjiro Chiba, Shigeru The University of Tokyo Tokyo Japan

ISBN: (纸本)9798400705328

As large language models (llms) have become more advanced, generating code to solve exercises in programming courses has become significantly easier. However, this convenience raises the concern of over-reliance on these tools, potentially hindering students from developing independent coding skills. To address this concern, we introduce an llm-based detector that not only detects llm-generated code but also explains the reasons for its judgments. These reasons provide insight into the characteristics of llm-generated code, enhancing transparency in the detection process. We evaluate the detector in an introductory Python programming course, achieving over 99% accuracy. Additionally, instructors manually reviewed the reasons provided by the detector and verified that 64.7% of reasons for classifying code as llm-generated were appropriate. These reasons can also serve as feedback, helping students improve their coding skills by understanding the characteristics of expert-level llm-generated code. © 2025 Copyright held by the owner/author(s).

关键词： detecting and explaining llm-generated code large language model llm-based detector llm-generated code python programming courses reasons for judgment

来源：评论

学校读者我要写书评

暂无评论

Exploring the Boundaries Between llm code Clone Detection and code Similarity Assessment on Human and AI-generated code

引用

BIG DATA AND COGNITIVE COMPUTING 2025年第2期9卷 41-41页

作者： Zhang, Zixian Saber, Takfarinas Univ Galway Sch Comp Sci CRT AI Galway H91TK33 Ireland Univ Galway Sch Comp Sci Lero Galway H91 TK33 Ireland

As Large Language Models (llms) continue to advance, their capabilities in code clone detection have garnered significant attention. While much research has assessed llm performance on human-generated code, the proliferation of llm-generated code raises critical questions about their ability to detect clones across both human- and llm-created codebases, as this capability remains largely unexplored. This paper addresses this gap by evaluating two versions of LLaMA3 on these distinct types of datasets. Additionally, we perform a deeper analysis beyond simple prompting, examining the nuanced relationship between code cloning and code similarity that llms infer. We further explore how fine-tuning impacts llm performance in clone detection, offering new insights into the interplay between code clones and similarity in human versus AI-generated code. Our findings reveal that LLaMA models excel in detecting syntactic clones but face challenges with semantic clones. Notably, the models perform better on llm-generated datasets for semantic clones, suggesting a potential bias. The fine-tuning technique enhances the ability of llms to comprehend code semantics, improving their performance in both code clone detection and code similarity assessment. Our results offer valuable insights into the effectiveness and characteristics of llms in clone detection and code similarity assessment, providing a foundation for future applications and guiding further research in this area.

关键词： code clone detection code similarity large language model fine-tuning llm-generated code

来源：评论

学校读者我要写书评

暂无评论

An Investigation into Misuse of Java Security APIs by Large Language Models 19

An Investigation into Misuse of Java Security APIs by Large ...

引用

19th ACM Asia Conference on Computer and Communications Security (ACM AsiaCCS)

作者： Mousavi, Zahra Islam, Chadni Moore, Kristen Abuadbba, Alsharif Babar, Muhammad Ali Univ Adelaide CREST Adelaide SA Australia CSIRO Data61 Cybersecur CRC Canberra ACT Australia Queensland Univ Technol Brisbane Qld Australia CSIRO Data61 Canberra ACT Australia

ISBN: (纸本)9798400704826

The increasing trend of using Large Language Models (llms) for code generation raises the question of their capability to generate trustworthy code. While many researchers are exploring the utility of code generation for uncovering software vulnerabilities, one crucial but often overlooked aspect is the security Application Programming Interfaces (APIs). APIs play an integral role in upholding software security, yet effectively integrating security APIs presents substantial challenges. This leads to inadvertent misuse by developers, thereby exposing software to vulnerabilities. To overcome these challenges, developers may seek assistance from llms. In this paper, we systematically assess ChatGPT's trustworthiness in code generation for security API use cases in Java. To conduct a thorough evaluation, we compile an extensive collection of 48 programming tasks for 5 widely used security APIs. We employ both automated and manual approaches to effectively detect security API misuse in the code generated by ChatGPT for these tasks. Our findings are concerning: around 70% of the code instances across 30 attempts per task contain security API misuse, with 20 distinct misuse types identified. Moreover, for roughly half of the tasks, this rate reaches 100%, indicating that there is a long way to go before developers can rely on ChatGPT to securely implement security API code.

关键词： Security API Misuse ChatGPT llm-generated code Software Security Secure Software Development

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：