Rapid development of software significantly facilitates developers in all phases of software development. Developers can easily leverage frameworks, content management systems (CMS), and libraries available in each pr...
详细信息
ISBN:
(数字)9798350376210
ISBN:
(纸本)9798350376227
Rapid development of software significantly facilitates developers in all phases of software development. Developers can easily leverage frameworks, content management systems (CMS), and libraries available in each programming language. With the large and rapid addition of frameworks, CMS, and libraries, developers must be able to choose precisely which ones to use based on the requirements of their software development projects. The utilization of machine learning for segmentation or clustering enables developers to identify groups of frameworks, CMS, and libraries by leveraging features obtained from static analysis, such as the size of codebases, complexities, structures, and other code characteristics. One algorithm that can be used for segmentation is K-means clustering. However, the availability of datasets is crucial as it can offer information about the code characteristics of each software. The use of machine learning on datasets can be an ongoing research area in software development related to utilizing code characteristics in software analysis. This study provides a novel dataset for the domains of software development and machine learning. The dataset we created contains 30 instances and 68 features that can be utilized for related or future research. This research utilizes the Silhouette score to determine the optimal $\boldsymbol{k}$ value.
The lung is one of the prime organs, and any disease in the lung causes mild to severe breathing problems;untreated lung disease will lead to several complications. Tuberculosis (TB) is a lung ailment that needs prema...
详细信息
As a result of ML, the healthcare industry undergoes substantial innovation and improvement. As a result, data management, clinical operations, drug research, and surgery are all progressing more quickly. The healthca...
详细信息
Lymphoma diagnosis, particularly distinguishing between subtypes, is critical for effective treatment but remains challenging due to the subtle morphological differences in histopathological images. This study present...
详细信息
On the one hand, to model experts' preferences in group decision-making, intuitionistic reciprocal preference relations have widely been used because they allow for accommodating hesitation degrees, which are inhe...
详细信息
ISBN:
(数字)9781665410205
ISBN:
(纸本)9781665410212
On the one hand, to model experts' preferences in group decision-making, intuitionistic reciprocal preference relations have widely been used because they allow for accommodating hesitation degrees, which are inherent to all decision-making processes. On the other hand, an optimization of information granularity distribution has recently been applied to establish consensus during group decision-making processes. Concretely, a symmetric and uniform distribution of information granularity has been considered for intuitionistic reciprocal preference relations. However, there exist other protocols of information granularity distribution that could be used. Therefore, we aim to analyze all the information granularity distribution protocols and determine their effectiveness in building consensus through intuitionistic reciprocal preference relations. The performance of the different protocols is discussed by conducting some numerical experiments that help provide insights into the effectiveness of the protocols to build consensus.
Myocarditis, characterized by inflammation of the heart muscle, has seen a notable 62.2% increase in incidence over the past three decades, leading to 324,490 deaths in 2019. Despite advancements in understanding its ...
详细信息
Large Language Models (LLMs) are leading the Generative Artificial Intelligence transformation in natural language understanding. Beyond language understanding, LLMs have demonstrated capabilities in reasoning tasks, ...
详细信息
ISBN:
(数字)9798350362916
ISBN:
(纸本)9798350362923
Large Language Models (LLMs) are leading the Generative Artificial Intelligence transformation in natural language understanding. Beyond language understanding, LLMs have demonstrated capabilities in reasoning tasks, including commonsense, logical, and mathematical reasoning. However, their proficiency in causal understanding has been limited due to the complex nature of causal reasoning. Several recent studies have discussed the role of external causal models for improved causal understanding. Building on the success of Retrieval-Augmented Generation (RAG) for factual reasoning in LLMs, this paper introduces a novel approach that utilizes Causal Graphs as external sources for establishing causal relationships between complex vectors. This method is empirically evaluated using two benchmark datasets across the metrics of Context Relevance, Answer Relevance, and Grounding, in its ability to retrieve relevant context with causal alignment. The retrieval effectiveness is further compared with traditional RAG methods that are based on semantic proximity.
This study introduces a system designed to identify pests in crops and classify them as either beneficial or harmful. The project begins by providing a comprehensive overview of various pest identification methods, an...
详细信息
Generative modeling is a powerful technique that involves creating machine learning models capable of creating new data similar to the data it was trained on. Generative Adversarial Networks (GANs) are a leading appro...
Generative modeling is a powerful technique that involves creating machine learning models capable of creating new data similar to the data it was trained on. Generative Adversarial Networks (GANs) are a leading approach for generative modeling. However, GAN training is known to be a notoriously difficult task. GAN convergence issues are largely caused by the supports of the real and generated distributions being disjoint. To tackle this open problem, we propose a novel GAN pre-training process that effectively aligns the supports of the generated and real data prior to applying traditional adversarial GAN training. The key component of our method, called AlignGAN, is learning a mapping between the input data distribution and a latent representation defined over a hypersphere, regularized by a One Class Classifier. This successfully encourages the generator to produce samples throughout the support of the real data, while not generating samples outside the support. We maintain support alignment through low-bandwidth noise convolutions and additional One Class regularization, leading to continued stable GAN training. We validate our approach against leading stabilization methods on three benchmark datasets, showing AlignGAN routinely produces the best results.
Decompilation aims to convert binary code to high-level source code, but traditional tools like Ghidra often produce results that are difficult to read and execute. Motivated by the advancements in Large Language Mode...
详细信息
暂无评论