The proceedings contain 32 papers. The topics discussed include: a comparative analysis of unsupervised language adaptation methods;a logical-based corpus for cross-lingual evaluation;bad form: comparing context-based...
ISBN:
(纸本)9781950737789
The proceedings contain 32 papers. The topics discussed include: a comparative analysis of unsupervised language adaptation methods;a logical-based corpus for cross-lingual evaluation;bad form: comparing context-based and form-based few-shot learning in distributional semantic models;bag-of-words transfer: non-contextual techniques for multi-task learning;BERT is not an interlingua and the bias of tokenization;cross-lingual joint entity and word embedding to improve entity linking and parallel sentence mining;deep bidirectional transformers for relation extraction without supervision;domain adaptation with BERT-based domain classification and data selection;and X-WikiRE: a large, multilingual resource for relation extraction as machine comprehension.
Deep Learning models are at the core of research in Artificial Intelligence research today. A tide in research for deep learning on graphs or graph neural networks. This wave of research at the intersection of graph t...
详细信息
ISBN:
(纸本)9781450383325
Deep Learning models are at the core of research in Artificial Intelligence research today. A tide in research for deep learning on graphs or graph neural networks. This wave of research at the intersection of graph theory and deep learning has also influenced other fields of science, including computer vision, naturallanguageprocessing, program synthesis and analysis, financial security, Drug Discovery and so on. However, there are still many challenges regarding a broad range of the topics in deep learning on graphs, from methodologies to applications, and from foundations to the new frontiers of GNNs. This international workshop on "Deep Learning on graphs: Method and Applications (DLG-KDD'21)" aims to bring together both academic researchers and industrial practitioners from different backgrounds and perspectives to above challenges.
In the realm of software engineering, collaborative efforts among development teams are essential, and comments play a crucial role in maintaining and enhancing software quality. These comments serve various purposes,...
详细信息
In the realm of software engineering, collaborative efforts among development teams are essential, and comments play a crucial role in maintaining and enhancing software quality. These comments serve various purposes, from clarifying complex code logic to aiding in debugging and providing insights into design decisions. However, distinguishing between useful and redundant comments can be a challenging task. This paper explores the use of language Model-based (LLM) approaches, specifically advanced models like GPT-3, to automate comment classification and assess their utility. By harnessing the contextual understanding and generation capabilities of these models, the research hypothesizes significant improvements in comment analysis. Extensive experiments using both real-world human-labeled comments and synthetic comments generated by ChatGPT demonstrate that LLMs can classify comments with remarkable accuracy, surpassing previous methods reliant on surface features. Additionally, the study critically examines factors such as pre-training data, comment coverage, and model architectures, shedding light on their impact on comment analysis. In summary, this research makes several notable contributions. It thoroughly explores the use of state-of-the-art LLMs for comment classification across diverse settings, provides a benchmark dataset of human-annotated comments, and shows that LLMs can substantially enhance codebase documentation by automatically identifying low-quality comments. These techniques have the potential to be integrated into Integrated Development Environments (IDEs) to offer developers continuous feedback. Finally, the paper opens up new possibilities for leveraging advanced naturallanguageprocessing (NLP) in software engineering tasks that demand deeper code comprehension, despite lingering questions about model robustness and the nature of human-AI collaboration. This work highlights the immense potential of LLMs in transforming programming by master
In the realm of software development, collaboration among development teams is vital, and comments play a pivotal role in maintaining and improving software quality. Comments serve diverse purposes, from elucidating c...
详细信息
In the realm of software development, collaboration among development teams is vital, and comments play a pivotal role in maintaining and improving software quality. Comments serve diverse purposes, from elucidating complex code logic to aiding in debugging and offering insights into design decisions. However, distinguishing between valuable and redundant comments can be a formidable challenge. This paper explores the potential of language Model-based (LLM) approaches, particularly advanced models such as GPT-3, to automate the classification of comments and evaluate their effectiveness. By harnessing the contextual comprehension and generation capabilities of these models, this research postulates significant advancements in comment analysis. Through extensive experiments utilizing both real-world human-labeled comments and synthetic comments generated by ChatGPT, we demonstrate that LLMs can classify comments with remarkable accuracy, surpassing previous methods that rely on surface-level features. Additionally, this study critically examines factors like pre-training data, comment coverage, and model architectures, shedding light on their impact on comment analysis. In summary, this research makes several substantial contributions. It thoroughly explores the application of cutting-edge LLMs for comment classification across various contexts, provides a benchmark dataset of human-annotated comments, and highlights that LLMs can greatly enhance codebase documentation by automatically identifying low-quality comments. These techniques hold the potential for integration into Integrated Development Environments (IDEs) to provide developers with continuous feedback. Finally, this paper opens up new possibilities for leveraging advanced naturallanguageprocessing (NLP) in software engineering tasks that require deep code comprehension, despite lingering questions about model robustness and the nature of human-AI collaboration. This work underscores the enormous potenti
Analysing long text documents of political discourse to identify check-worthy claims (claim detection) is known to be an important task in automated fact-checking systems, as it saves the precious time of fact-checker...
详细信息
Moral AI has been studied in the fields of philosophy and artificial intelligence. Although most existing studies are only theoretical, recent developments in AI have made it increasingly necessary to implement AI wit...
详细信息
In the interest of interpreting neural NLI models and their reasoning strategies, we carry out a systematic probing study which investigates whether these models capture the crucial semantic features central to natura...
详细信息
Causality detection draws plenty of attention in the field of naturallanguageprocessing and linguistics research. It has essential applications in information retrieval, event prediction, question answering, financi...
详细信息
naturallanguageprocessing offers new insights into language data across almost all disciplines and domains, and allows us to corroborate and/or challenge existing knowledge. The primary hurdles to widening participa...
详细信息
Estimating the software projects’ efforts developed by agile methods is important for project managers or technical leads. It provides a summary as a first view of how many hours and developers are required to comple...
详细信息
ISBN:
(纸本)9781665462310
Estimating the software projects’ efforts developed by agile methods is important for project managers or technical leads. It provides a summary as a first view of how many hours and developers are required to complete the tasks. There are research works on automatic predicting the software efforts, including Term Frequency - Inverse Document Frequency (TFIDF) as the traditional approach for this problem. graph Neural Network is a new approach that has been applied in naturallanguageprocessing for text classification. The advantages of graph Neural Network are based on the ability to learn information via graph data structure, which has more representations such as the relationships between words compared to approaches of vectorizing sequence of words. In this paper, we show the potential and possible challenges of graph Neural Network text classification in story point level estimation. By the experiments, we show that the GNN Text Level Classification can achieve as high accuracy as about 80% for story points level classification, which is comparable to the traditional approach. We also analyze the GNN approach and point out several current disadvantages that the GNN approach can improve for this problem or other problems in software engineering.
暂无评论