检索结果-内蒙古大学图书馆

The significance of user-defined identifiers in java source code authorship identification

学校读者我要写书评

暂无评论

arXiv 2021年

作者： Frantzeskou, Georgia MacDonell, Stephen G. Stamatatos, Efstathios Georgiou, Stelios Gritzalis, Stefanos Dept of Info. and Comm. Systems Engineering University of the Aegean Samos83200 Greece SERL School of Comp. and Math. Sciences Auckland University of Technology Private Bag 92006 Auckland1142 New Zealand Department of Statistics and Actuarial - Financial Mathematics University of the Aegean Samos83200 Greece

When writing source code, programmers have varying levels of freedom when it comes to the creation and use of identifiers. Do they habitually use the same identifiers, names that are different to those used by others? Is it then possible to tell who the author of a piece of code is by examining these identifiers? If so, can we use the presence or absence of identifiers to assist in correctly classifying programs to authors? Is it possible to hide the provenance of programs by identifier renaming? In this study, we assess the importance of three types of identifiers in source code author classification for two different java program data sets. We do this through a sequence of experiments in which we disguise one type of identifier at a time. These experiments are performed using as a tool the Source Code Author Profiles (SCAP) method. The results show that, although identifiers when examined as a whole do not seem to reflect program authorship for these data sets, when examined separately there is evidence that class names do signal the author of the program. In contrast, simple variables and method names used in java programs do not appear to reflect program authorship. On the contrary, our analysis suggests that such identifiers are so common as to mask authorship. We believe that these results have applicability in relation to the robustness of code plagiarism analysis and that the underlying methods could be valuable in cases of litigation arising from disputes over program authorship. Copyright © 2021, The Authors. All rights reserved.

关键词： java programming language

Embracing Objects Over Statics: An Analysis of Method Preferences in Open Source java Frameworks

学校读者我要写书评

暂无评论

arXiv 2024年

作者： Zakharov, Vladimir Bugayenko, Yegor Huawei Russia Moscow Russia

In today’s software development landscape, the extent to which java applications utilize object-oriented programming paradigm remains a subject of interest. Although some researches point to the considerable overhead associated with object orientation, one might logically assume that modern java applications would lean towards a procedural style to boost performance, favoring static over instance method calls. In order to validate this assumption, this study scrutinizes the runtime behavior of 28 open-source java frameworks using the YourKit profiler. Contrary to expectations, our findings reveal a predominant use of instance methods and constructors over static methods. This suggests that developers still favor an object-oriented approach, despite its potential drawbacks. © 2024, CC BY.

关键词： java programming language

A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Method-Level Code Smell Detection

学校读者我要写书评

暂无评论

arXiv 2024年

作者： Zhang, Beiqi Liang, Peng Zhou, Xin Zhou, Xiyu Lo, David Feng, Qiong Li, Zengyang Li, Lin School of Computer Science Wuhan University Wuhan China School of Computing and Information Systems Singapore Management University Singapore School of Computer Science Nanjing University of Science and Technology Nanjing China School of Computer Science Central China Normal University Wuhan China School of Computer Science and Artificial Intelligence Wuhan University of Technology Wuhan China

Code smells, which are suboptimal coding practices that can potentially lead to defects or maintenance issues, can negatively impact the quality of software systems. Most existing code smell detection methods rely on heuristics-based or machine learning (ML) and deep learning (DL)-based techniques. However, these techniques have several drawbacks (e.g., unsatisfactory performance). Large language Models (LLMs) have garnered significant attention in the software engineering (SE) field, achieving state-of-the-art performance across a wide range of SE tasks. Parameter-Efficient Fine-Tuning (PEFT) methods, which are commonly used to adapt LLMs to specific tasks with fewer parameters and reduced computational resources, have emerged as a promising approach for enhancing the performance of LLMs in various SE tasks. However, LLMs have not yet been explored for code smell detection, and their effectiveness for this task remains unclear. Furthermore, no comprehensive investigation has been conducted on the efficiency of PEFT methods for method-level code smell detection. In this regard, we systematically evaluate the effectiveness of state-of-the-art PEFT methods on both small and large language Models (LMs) for method-level code smell detection. To begin, we constructed high-quality java code smell datasets sourced from GitHub. We then fine-tuned four small LMs and six LLMs using various PEFT techniques, including prompt tuning, prefix tuning, LoRA, and (IA)3, for code smell detection. Our comparison against full fine-tuning revealed that PEFT methods not only achieve comparable or better effectiveness but also consume less peak GPU memory. Our analysis further explored the performance of small LMs versus LLMs in the context of code smell detection. Surprisingly, we found that LLMs did not outperform small LMs in this specific task, suggesting that smaller models may be more suited for method-level code smell detection. We also investigated the impact of varying hyper-param

关键词： java programming language

java 5.0 perks up with new language constructs

学校读者我要写书评

暂无评论

Electronic Design 2004年第26期52卷 44-44页

作者： Wong, William

The modifications and changes proposed for java language in its new version java 5.0 are discussed. The expected changes include new virtual machine and complier. The new version is focusing on more real time and safety-critical applications. The changes which include generics, an enhanced loop construct and variable length argument lists are expected to simplify the programming.

关键词： java programming language

Executable trigger-action comments

学校读者我要写书评

暂无评论

arXiv 2018年

作者： Nie, Pengyu Rai, Rishabh Li, Junyi Jessy Khurshid, Sarfraz Mooney, Raymond J. Gligoric, Milos University of Texas at Austin

Natural language elements, e.g., todo comments, are frequently used to communicate among the developers and to describe tasks that need to be performed (actions) when specific conditions hold in the code repository (triggers). As projects evolve, development processes change, and development teams reorganize, these comments, because of their informal nature, frequently become irrelevant or forgotten. We present the first technique, dubbed TrigIt, to specify triggeraction todo comments as executable statements. Thus, actions are executed automaticallywhen triggers evaluate to true. TrigIt specifications are written in the host language (e.g., java) and are evaluated as part of the build process. The triggers are specified as query statements over abstract syntax trees and abstract representation of build configuration scripts, and the actions are specified as code transformation steps. We implemented TrigIt for the java programming language and migrated 20 existing trigger-action comments from 8 popular open-source projects. We evaluate the cost of using TrigIt in terms of the number of tokens in the executable comments and the time overhead introduced in the build process. Copyright © 2018, The Authors. All rights reserved.

关键词： java programming language

TestBench: Evaluating Class-Level Test Case Generation Capability of Large language Models

学校读者我要写书评

暂无评论

arXiv 2024年

作者： Zhang, Quanjun Shang, Ye Fang, Chunrong Gu, Siqi Zhou, Jianyi Chen, Zhenyu The State Key Laboratory for Novel Software Technology Nanjing University China Huawei Cloud Computing Technologies Co. Ltd. China

In this paper, we introduce TestBench, a benchmark for class-level LLM-based test case generation. We construct a dataset of 108 java programs from 9 real-world, large-scale projects on GitHub, each representing a different thematic domain. We then design three distinct types of prompts based on context descriptions, including self-contained context, full context, and simple context. Besides, we propose a fine-grained evaluation framework that considers five aspects of test cases: syntactic correctness, compilation correctness, test correctness, code coverage rate, and defect detection rate. Furthermore, we propose a heuristic algorithm to repair erroneous test cases generated by LLMs. We evaluate CodeLlama-13b, GPT-3.5, and GPT-4 on the TestBench, and our experimental results indicate that larger models demonstrate a greater ability to effectively utilize contextual information, leading to generate higher-quality test cases. Smaller models may struggle with the noise introduced by the extensive information contained within the full context. However, when using the simplified version, namely the simple context, which is derived from the full context via abstract syntax tree analysis, the performance of these models improves significantly. Our analysis highlights the current progress and pinpoints future directions to further enhance the effectiveness of models by handling contextual information for test case generation. © 2024, CC BY.

关键词： java programming language

Encapsulating objects with confined types 01

学校读者我要写书评

暂无评论

Encapsulating objects with confined types

ACM SIGPLAN Conference on Object-oriented programming, Systems, languages, and Applications

作者： Christian Grothoff Jens Palsberg Jan Vitek Slt supgt 3lt/supgt Lab Department of Computer Sciences Purdue University

ISBN: (纸本)1581133359

Object-oriented languages provide little support for encapsulating objects. Reference semantics allows objects to escape their defining scope. The pervasive aliasing that ensues remains a major source of software defects. This paper introduces Kacheck/J a tool for inferring object encapulation properties in large java programs. Our goal is to develop practical tools to assist software engineers, thus we focus on simple and scalable techniques. Kacheck/J is able to infer confinement for java classes. A class and its sublasses are confined if all of their instances are encapsulated in their defining package. This simple property can be used to identify accidental leaks of sensitive objects. The analysis is scalable and efficient; Kacheck/J is able t infer confinement on a corpus of 46,000 classes (115 MB) in 6 minutes

关键词： encapsulating Object oriented programming languages Scalability Software engineering Confined Semantics java java programming language

An on-the-fly reference counting garbage collector for java 01

学校读者我要写书评

暂无评论

An on-the-fly reference counting garbage collector for Java

ACM SIGPLAN Conference on Object-oriented programming, Systems, languages, and Applications

作者： Yossi Levanoni Erez Petrank Microsoft Corporation One Microsoft Way Redmond WA Dept. of Computer Science Technion - Israel Institute of Technology Haifa 32000 Israel

ISBN: (纸本)1581133359

Reference counting is not naturally suitable for running on multiprocessors. The update of pointers and reference counts requires atomic and synchronized operations. We present a novel reference counting algorithm suitable for a multiprocessor that does not require any synchronized operation in its write barrier (not even a compare-and-swap type of synchronization). The algorithm is efficient and may complete with any tracing algorithm.

关键词： waste collection java java programming language synchronized operation Multiprocessor Atomicity algorithms

A high speed algorithm for identifying hand gestures for an ATM input system for the blind

学校读者我要写书评

暂无评论

A high speed algorithm for identifying hand gestures for an ...

IEEE Bombay Section Symposium (IBSS)

作者： Sudhir Rao Rupanagudi B. S. Ranjani Varsha G. Bhat K. Surabhi P. R. Reshma G. Shruthi K. P. Sarayu R Sangeetha B Rajesh Rao S Vasanti World Serve Education Bengaluru India Department of Electronics & Communication Jyothy Institute of Technology Bengaluru India Department of Telecommunication Atria Institute of Technology Bengaluru India

With the evolution in science and technology, a lot has been done over the past few years to make the lives of the differently-abled more comfortable and easy. This paper concentrates on a novel methodology to ease the use of an ATM machine for the blind. It describes an approach wherein both the username and PIN for the ATM machine can be input using British Sign language. A cost effective setup and also a high speed algorithm for hand gesture recognition has been elaborated. In comparison with previous algorithms, the method explained in this paper is 1.65 times faster thus proving its efficacy and efficiency. All algorithms were first designed and developed in MATLAB 2011b and then later deployed as software using the java programming language.

关键词： ATM British Sign language java Sign language recognition hand gesture recognition image processing security video processing java programming language Gesture recognition Automatic Teller Machines java Image processing sign language recognition input system evolution Security Blind