Developers rely on software logs for a variety of tasks, such as debugging, testing, program comprehension, verification, and performance analysis. Despite the importance of logs, prior studies show that there is no i...
详细信息
Developers rely on software logs for a variety of tasks, such as debugging, testing, program comprehension, verification, and performance analysis. Despite the importance of logs, prior studies show that there is no industrial standard on how to write logging statements. In this paper, we focus on studying duplicate logging statements, which are logging statements that have the same static text message. Such duplications in the text message are potential indications of logging code smells, which may affect developers' understanding of the dynamic view of the system. We manually studied over 4K duplicate logging statements and their surrounding code in five large-scale open source systems: Hadoop, CloudStack, Elasticsearch, Cassandra, and Flink. We uncovered five patterns of duplicate logging code smells. For each instance of the duplicate logging code smell, we further manually identify the potentially problematic (i.e., require fixes) and justifiable (i.e., do not require fixes) cases. Then, we contact developers to verify our manual study result. We integrated our manual study result and developers' feedback into our automated static analysis tool, DLFinder, which automatically detects problematic duplicate logging code smells. We evaluated DLFinder on the five manually studied systems and three additional systems: Camel, Kafka and Wicket. In total, combining the results of DLFinder and our manual analysis, we reported 91 problematic duplicate logging code smell instances to developers and all of them have been fixed. We further study the relationship between duplicate logging statements, including the problematic instances of duplicate logging code smells, and code clones. We find that 83% of the duplicate logging code smell instances reside in cloned code, but 17% of them reside in micro-clones that are difficult to detect using automated clone detection tools. We also find that more than half of the duplicate logging statements reside in cloned code snippets, a
Loss functions are at the heart of deep learning, shaping how models learn and perform across diverse tasks. They are used to quantify the difference between predicted outputs and ground truth labels, guiding the opti...
详细信息
Scientific modeling provides mathematical abstractions of real-world systems and builds software as implementations of these mathematical abstractions. Ocean science is a multidisciplinary discipline developing scient...
详细信息
In the era of rapid development of intelligent technology, the children's digital publishing industry is facing a disruptive change, and this change may be realized through the reconstruction of the supply and dem...
详细信息
Meta-learning aims to learn general knowledge with diverse training tasks conducted from limited data, and then transfer it to new tasks. It is commonly believed that increasing task diversity will enhance the general...
详细信息
This paper investigates group distributionally robust optimization (GDRO), with the purpose to learn a model that performs well over m different distributions. First, we formulate GDRO as a stochastic convex-concave s...
This paper investigates group distributionally robust optimization (GDRO), with the purpose to learn a model that performs well over m different distributions. First, we formulate GDRO as a stochastic convex-concave saddle-point problem, and demonstrate that stochastic mirror descent (SMD), using m samples in each iteration, achieves an O(m(log m)/ε2) sample complexity for finding an ε-optimal solution, which matches the Ω(m/ε2) lower bound up to a logarithmic factor. Then, we make use of techniques from online learning to reduce the number of samples required in each round from m to 1, keeping the same sample complexity. Specifically, we cast GDRO as a two-players game where one player simply performs SMD and the other executes an online algorithm for non-oblivious multi-armed bandits. Next, we consider a more practical scenario where the number of samples that can be drawn from each distribution is different, and propose a novel formulation of weighted GDRO, which allows us to derive distribution-dependent convergence rates. Denote by ni the sample budget for the i-th distribution, and assume n1 ≥ n2 ≥ ··· ≥ nm. In the first approach, we incorporate non-uniform sampling into SMD such that the sample budget is satisfied in expectation, and prove that the excess risk of the i-th distribution decreases at an $O(\sqrt{n_1 \log m}/n_i)$ rate. In the second approach, we use mini-batches to meet the budget exactly and also reduce the variance in stochastic gradients, and then leverage stochastic mirror-prox algorithm, which can exploit small variances, to optimize a carefully designed weighted GDRO problem. Under appropriate conditions, it attains an O((log m) / √ni) convergence rate, which almost matches the optimal O(√1/ni) rate of only learning from the i-th distribution with ni samples.
This research paper introduces “Leaf Guard,” a mobile app designed for disease detection and management in banana trees, specifically tailored for Sri Lanka's agricultural landscape. Through the amalgamation of ...
This research paper introduces “Leaf Guard,” a mobile app designed for disease detection and management in banana trees, specifically tailored for Sri Lanka's agricultural landscape. Through the amalgamation of image processing, deep learning, and customized chemical recommendations, this app proficiently identifies early-stage diseases, offering tailored remedies. Catering to farmers and agricultural experts, ‘Leaf Guard’ provides an intuitive tool, revolutionizing disease management and bolstering sustainable banana cultivation practices. By seamlessly integrating these cutting-edge technologies, this research significantly advances disease detection methods, paving the way for environmentally friendly and economical interventions. The study's findings not only enrich the realm of disease detection technology but also hold promise for future innovations, fostering sustainable approaches to enhance banana cultivation.
Reconfigurable intelligent surface (RIS)-aided backscatter communication (BackCom) has recently attracted a lot of attention due to its potential to enhance energy efficiency (EE) and transmission robustness by recons...
详细信息
Accurate segmentation of pulmonary structures is crucial in clinical diagnosis, disease study, and treatment planning. Significant progress has been made in deep learning-based segmentation techniques, but most requir...
详细信息
Human-computer interaction (HCI) is an evolving field of research that focuses on understanding and improving the communication and interaction between humans and computers. Over the past decades, we have seen many si...
Human-computer interaction (HCI) is an evolving field of research that focuses on understanding and improving the communication and interaction between humans and computers. Over the past decades, we have seen many significant advances in this field, which have contributed to the widespread adoption and integration of technology into our daily lives. HCI research and development aims to design information technology systems to meet the needs and preferences of users. Usability, efficiency, accessibility and user satisfaction are important considerations in HCI design. This paper presents the results of a pilot study on user acceptance of HCIs. The results show that users are positive about and willing to use HCIs.
暂无评论