检索结果-内蒙古大学图书馆

The Effectiveness of Hidden Dependence Metrics in Bug Prediction

IEEE ACCESS 2024年 12卷 77214-77225页

作者： Jasz, Judit Univ Szeged Dept Software Engn H-6720 Szeged Hungary

Finding and fixing bugs in programs is perhaps one of the most difficult, yet most important, tasks in software maintenance. This is why in the last decades, a lot of work has been done on this topic, most of which is based on machine learning methods. Studies on bug prediction can be found for almost all programming languages. The solutions presented generally try to predict bugs based on information that can be easily extracted from the source code, rather than more expensive solutions that require a deeper understanding of the program. Another feature of these solutions is that they usually try to predict faults at a high level (module/file/class), which is useful, but locating the bug itself is still a difficult task. This work presents a solution that attempts to predict bugs at the method level, while also tracking the dependencies in the program using an efficient algorithm, resulting in an approach that can predict bugs more accurately. The practical measurements show that the defined approach really outperforms predictions based on traditional metrics in most cases, and with proper filtering, the best-performing RandomForest algorithm according to the F-measure can even achieve an improvement of up to 11%. Finally, it is proven that the introduced metrics are even suitable for predicting bugs that will appear later in a given project if sufficient learning data is available.

关键词： Computer bugs Measurement Software Databases source coding Software measurement Sea measurements Predictive models Bug prediction method level hidden dependencies metrics

来源：评论

学校读者我要写书评

暂无评论

New Proofs of Gaussian Extremal Inequalities With Applications

引用

IEEE TRANSACTIONS ON INFORMATION THEORY 2024年第5期70卷 3082-3099页

作者： Xu, Yinfei Chen, Guojun Chen, Jun Jin, Shi Southeast Univ Sch Informat Sci & Engn Nanjing 210096 Peoples R China Southeast Univ Natl Mobile Commun Res Lab Nanjing 210096 Peoples R China McMaster Univ Dept Elect & Comp Engn Hamilton ON L8S 4K1 Canada

The conventional enhancement-and-perturbation approach to establishing Gaussian extremal inequalities is refined via a novel monotone path argument in the product probability space. This refined approach is illustrated with simplified/corrected proofs of the Liu-Viswanath extremal inequality and a vector generalization of Costa's entropy power inequality. The power of this refinement is further demonstrated by characterizing two information-theoretic limits, namely, the capacity region of the multiple-input multiple-output (MIMO) Gaussian broadcast channel with private and common messages and the rate-distortion-equivocation function of vector Gaussian secure source coding, which have previously resisted the attack of the conventional approach.

关键词： Vectors MIMO communication Covariance matrices source coding Gaussian distribution Perturbation methods Entropy Entropy power inequality extremal inequality Fisher information mean squared error MIMO Gaussian broadcast channel vector Gaussian source coding

来源：评论

学校读者我要写书评

暂无评论

Measuring and Characterizing (Mis)compliance of the Android Permission System

引用

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING 2024年第4期50卷 742-764页

作者： Barzolevskaia, Anna Branca, Enrico Stakhanova, Natalia Univ Saskatchewan Dept Comp Sci Saskatoon SK S7N 5C9 Canada

Within the Android mobile operating system, Android permissions act as a system of safeguards designed to restrict access to potentially sensitive data and privileged components. Multiple research studies indicate flaws and limitations of the Android permission system, prompting Google to implement a more regulated and fine-grained permission model. This newly-introduced complexity creates confusion for developers leading to incorrect permissions and a significant risk to users security and privacy. We present a systematic study of theoretical and practical misuse of permissions. For this analysis we derive the unified permissions and call mappings that represent theoretical requirements of permissions and calls. We develop PChecker, an approach that identifies the discrepancies between the official Android permissions documentation and permission implementation in the Android platform source code based on these mappings. We evaluate four versions of the Android Open source Project code (major versions 10-13) and shed light on the prevalence of discrepancies between the official Android guidelines for permissions and their implementation in the Android platform source code. We further show that these discrepancies result in miscompliance in third-party Android apps.

关键词： Operating systems Smart phones Documentation source coding Codes Guidelines Runtime Android documentation app permissions non-SDK restriction lists security

来源：评论

学校读者我要写书评

暂无评论

Free and Open source Software

引用

COMPUTER 2024年第8期57卷 114-118页

作者： Riehle, Dirk Friedrich Alexander Univ Erlangen Nurnberg D-91054 Erlangen Germany

Free software is software that gives users the right to use the software, to modify the software, and to pass on the software, modified or not, all free of charge and without restrictions on what the software is used for. Open source software provides users with the same rights as free software. For all practical purposes, they are the same.

关键词： Software Packages Open source Software Licenses Codes source coding Patents Intellectual Property Trademarks Companies Guidelines Copyright Protection Business Performance Analysis Economics Open source Open coding Free Package Obligations Non Profit source Code Non English Speaking Massachusetts Institute Of Technology Open Projects Open source License Open License Free Software Foundation Software Vendors Intellectual Property Property Rights Trademark Intellectual Property Rights Collective Work Open source Projects Copyright Law Patent Rights

来源：评论

学校读者我要写书评

暂无评论

A Retrospective on the source Code Control System

引用

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING 2025年第3期51卷 695-699页

作者： Rochkind, Marc J. Bell Labs PISCATAWAY NJ 08854 USA

The source Code Control System (SCCS) was first introduced in 1975 (Rochkind, 1975). It controlled computer program source code by tracking versions and recording who made changes, when, and why. The present retrospective paper assesses the strengths and weaknesses of SCCS and traces its influence on software engineering over the past fifty years.

关键词： Software Software engineering Merging Control systems Programming source coding Mainframes Codes Software reliability Software development management software engineering software reliability software tools version control systems software configuration management source control management

来源：评论

学校读者我要写书评

暂无评论

MAGECODE: Machine-Generated Code Detection Method Using Large Language Models

引用

IEEE ACCESS 2024年 12卷 190186-190202页

作者： Pham, Hung Ha, Huyen Tong, Van Hoang, Dung Tran, Duc Le, Tuyen Ngoc Hanoi Univ Sci & Technol Sch Informat & Commun Technol Hanoi 100000 Vietnam Ming Chi Univ Technol Dept Elect Engn Taipei 24301 Taiwan Ming Chi Univ Technol Ctr Reliabil Engn Taipei 24301 Taiwan

The widespread use of virtual assistants (e.g., GPT4 and Gemini, etc.) by students in their academic assignments raises concerns about academic integrity. Consequently, various machine-generated text (MGT) detection methods, developed from metric-based and model-based approaches, were proposed and shown to be highly effective. The model-based MGT methods often encounter difficulties when dealing with source code due to disparities in semantics compared to natural languages. Meanwhile, the efficacy of metric-based MGT methods on source code has not been investigated. Moreover, the challenge of identifying machine-generated codes (MGC) has received less attention, and existing solutions demonstrate low accuracy and high false positive rates across diverse human-written codes. In this paper, we take into account both semantic features extracted from Large Language Models (LLMs) and the applicability of metrics (e.g., Log-Likelihood, Rank, Log-rank, etc.) for source code analysis. Concretely, we propose MageCode, a novel method for identifying machine-generated codes. MageCode utilizes the pre-trained model CodeT5+ to extract semantic features from source code inputs and incorporates metric-based techniques to enhance accuracy. In order to assess the proposed method, we introduce a new dataset comprising more than 45,000 code solutions generated by LLMs for programming problems. The solutions for these programming problems which were obtained from three advanced LLMs (GPT4, Gemini, and Code-bison-32k), were written in Python, Java, and C++. The evaluation of MageCode on this dataset demonstrates superior performance compared to existing baselines, achieving up to 98.46% accuracy while maintaining a low false positive rate of less than 1%.

关键词： Codes source coding Computer architecture Transformers Programming profession Large language models Feature extraction Measurement Computational modeling Python Machine-generated code detection large language model metrics CodeT5+

来源：评论

学校读者我要写书评

暂无评论

Supersonic: Learning to Generate source Code Optimizations in C/C plus

引用

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING 2024年第11期50卷 2849-2864页

作者： Chen, Zimin Fang, Sen Monperrus, Martin KTH Royal Inst Technol S-11428 Stockholm Sweden

Software optimization refines programs for resource efficiency while preserving functionality. Traditionally, it is a process done by developers and compilers. This paper introduces a third option, automated optimization at the source code level. We present Supersonic , a neural approach targeting minor source code modifications for optimization. Using a seq2seq model, Supersonic is trained on C/C++ program pairs ( x(t) , x(t+1) ), where x(t+1) is an optimized version of x(t) , and outputs a diff. Supersonic 's performance is benchmarked against OpenAI's GPT-3.5-Turbo and GPT-4 on competitive programming tasks. The experiments show that Supersonic not only outperforms both models on the code optimization task but also minimizes the extent of the change with a model more than 600x smaller than GPT-3.5-Turbo and 3700x smaller than GPT-4.

关键词： Optimization Codes Training source coding Task analysis Decoding Vectors Code optimization Seq2Seq learning large language model

来源：评论

学校读者我要写书评

暂无评论

Innovating Industry With Research: eknows and Sysparency

引用

IEEE SOFTWARE 2024年第3期41卷 41-48页

作者： Geist, Verena Moser, Michael Pichler, Josef Schnitzhofer, Florian Software Competence Ctr Hagenberg GmbH A-4232 Hagenberg Im Muhlkreis Austria Univ Appl Sci Upper Austria Programming & Project Dev Campus Hagenberg A-4232 Hagenberg Im Muhlkreis Austria Sysparency GmbH A-4040 Linz Austria

We present the multi-language software platform eknows for building reverse engineering tools and documentation generators as a concrete example of how to successfully translate research on software analysis into innovative products and services. Platform development includes domain-specific requirements and an architecture supporting reuse of components.

关键词： Software Documentation Codes Industries Stakeholders source coding Business

来源：评论

学校读者我要写书评

暂无评论

CRPWarner: Warning the Risk of Contract-Related Rug Pull in DeFi Smart Contracts

引用

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING 2024年第6期50卷 1534-1547页

作者： Lin, Zewei Chen, Jiachi Wu, Jiajing Zhang, Weizhe Wang, Yongjuan Zheng, Zibin Sun Yat Sen Univ Software Engn Zhuhai 519082 Peoples R China Peng Cheng Lab Shenzhen 518000 Peoples R China Sun Yat Sen Univ Sch Software Engn Zhuhai 519082 Peoples R China Harbin Inst Technol Sch Comp Sci & Technol Shenzhen 518055 Peoples R China Peng Cheng Lab Shenzhen 518000 Peoples R China Henan Key Lab Network Cryptog Technol Zhengzhou 450000 Peoples R China

In recent years, Decentralized Finance (DeFi) has grown rapidly due to the development of blockchain technology and smart contracts. As of March 2023, the estimated global cryptocurrency market cap has reached approximately $949 billion. However, security incidents continue to plague the DeFi ecosystem, and one of the most notorious examples is the "Rug Pull" scam. This type of cryptocurrency scam occurs when the developer of a particular token project intentionally abandons the project and disappears with investors' funds. Despite only emerging in recent years, Rug Pull events have already caused significant financial losses. In this work, we manually collected and analyzed 103 real-world rug pull events, categorizing them based on their scam methods. Two primary categories were identified: Contract-related Rug Pull (through malicious functions in smart contracts) and Transaction-related Rug Pull (through cryptocurrency trading without utilizing malicious functions). Based on the analysis of rug pull events, we propose CRPWarner (short for Contract-related Rug Pull Risk Warner) to identify malicious functions in smart contracts and issue warnings regarding potential rug pulls. We evaluated CRPWarner on 69 open-source smart contracts related to rug pull events and achieved a 91.8% precision, 85.9% recall, and 88.7% F1-score. Additionally, when evaluating CRPWarner on 13,484 real-world token contracts on Ethereum, it successfully detected 4168 smart contracts with malicious functions, including zero-day examples. The precision of large-scale experiments reaches 84.9%.

关键词： Smart contracts Cryptocurrency Finance Decentralized applications Blockchains Ecosystems source coding decentralized finance rug pull datalog analysis

来源：评论

学校读者我要写书评

暂无评论

HeVulD: A Static Vulnerability Detection Method Using Heterogeneous Graph Code Representation

引用

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY 2024年 19卷 9129-9144页

作者： Huang, Yuanming He, Mingshu Wang, Xiaojuan Zhang, Jie Beijing Univ Posts & Telecommun Sch Integrated Circuits Beijing 100876 Peoples R China Beijing Univ Posts & Telecommun Sch Cyberspace Secur Beijing 100876 Peoples R China Beijing Univ Posts & Telecommun Sch Elect Engn Beijing 100876 Peoples R China

Vulnerability detection in source code has been a focal point of research in recent years. Traditional rule-based methods fail to identify complex and unknown vulnerabilities, leading to poor performance. While deep learning (DL)-based methods have improved these shortcomings, there is still room for enhancement. For C/C++ source code, effective vulnerability detection requires considering both the information in code statements and the structural information of the code. Graph-based code representation methods can address this need, but existing approaches often use homogeneous graphs that do not differentiate between various types of code statements or dependencies. Few methods use heterogeneous graphs for C/C++ code representation. This study explores this potential and proposes a new C/C++ vulnerability detection method named HeVulD. HeVulD introduces two node definition approaches and a key-node-based program slicing method, generating heterogeneous graph representations for source code. These representations consist of both heterogeneous nodes and edges, providing a more precise representation of source code. HeVulD achieves an F1-score of 96.4% on the SARD dataset, outperforming nine baseline C/C++ vulnerability detection methods. HeVulD has been tested under adversarial attack scenarios to assess its robustness. Additionally, HeVulD has been tested on ten open-source software projects and the latest CVEs, demonstrating its detection and generalization capabilities in real-world scenarios and its ability to identify unknown vulnerabilities.

关键词： Codes source coding Software Image edge detection Syntactics Semantics Security Software security vulnerability detection deep learning program analysis heterogeneous graph representation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：