The authors have developed an approach to the search for structurally similar projects of software systems. Teachers can use the proposed approach to search for borrowings in the works of students. The concept behind ...
详细信息
Bias is a common problem in both human cognition and machinelearning tasks. However, machines struggle more than humans with bias reduction, mainly because most algorithms rely on the assumption that the training dat...
详细信息
This paper proposes an intelligent and machine-learning based optimization method that targets to optimal windings layer setup for LLC converter transformer with small number of optimizing iterations. The research uti...
详细信息
Computational notebooks have become the go-to way for solving data-science problems. While they are designed to combine code and documentation, prior work shows that documentation is largely ignored by the developers ...
详细信息
ISBN:
(纸本)9798350329964
Computational notebooks have become the go-to way for solving data-science problems. While they are designed to combine code and documentation, prior work shows that documentation is largely ignored by the developers because of the manual effort. Automated documentation generation can help, but existing techniques fail to capture algorithmic details and developers often end up editing the generated text to provide more explanation and sub-steps. This paper proposes a novel machine-learning pipeline, Cell2Doc, for code cell documentation in Python data science notebooks. Our approach works by identifying different logical contexts within a code cell, generating documentation for them separately, and finally combining them to arrive at the documentation for the entire code cell. Cell2Doc takes advantage of the capabilities of existing pre-trained language models and improves their efficiency for code cell documentation. We also provide a new benchmark dataset for this task, along with a data-preprocessing pipeline that can be used to create new datasets. We also investigate an appropriate input representation for this task. Our automated evaluation suggests that our best input representation improves the pre-trained model's performance by 2.5x on average. Further, Cell2Doc achieves 1.33x improvement during human evaluation in terms of correctness, informativeness, and readability against the corresponding standalone pre-trained model.
Identifying testable quality attribute scenarios (QASs) and generating test cases is challenging due to subjectivity, complexity, interdependency, and lack of well-defined metrics in the quality attributes. In this st...
详细信息
Molds have an immense number of species with different classifications and characteristics. Each one of them has its own human health-related effects and conditions after being consumed. Some species are good, most of...
详细信息
The application of pre-trained models in continual learning has become the norm, with most methods relying on them. Many approaches freeze or fine-tune the backbone network during training to combat catastrophic forge...
详细信息
The skin, being the largest organ in the human body, plays a crucial role in protecting and covering the body while performing various functions. However, skin diseases, such as vitiligo, can result in changes to the ...
详细信息
In recent years, deep learning has become an important research method in different applications, including the field of medical and health care, especially in the detection of abnormal ECG signals. However, the exist...
详细信息
This paper proposes a novel solution to the common problem of knee stiffness experienced by patients following knee replacement surgery. The paper suggests designing a wearable knee pad that is fitted with three 6-axi...
详细信息
暂无评论