As artificial intelligence and human life become increasingly inseparable, the legal or ethical issues faced by artificial intelligence systems in autonomous decision-making are also increasing. During the training pr...
详细信息
As artificial intelligence and human life become increasingly inseparable, the legal or ethical issues faced by artificial intelligence systems in autonomous decision-making are also increasing. During the training process, the algorithms may be influenced by human biases such as gender, race, and other factors, leading to discrimination and affecting fairness. To establish secure intelligent systems, fair machinelearning has become a popular research direction. This work demonstrates the existing definitions of fairness and designs experiments to show that combining clustering algorithms into the data handling process can effectively improve the classification accuracy and fairness on bank loan dataset. In the case study, K-means clustering, hierarchical clustering and Gaussian Mixture Model are used, proving that the clustering algorithms can significantly improve the accuracy of the model and ensure the relative fairness of the classification results.
This study identified common genes associated with various primary cancers, including prostate cancer, and established a predictive model to accurately forecast the likelihood of cancer occurrence and its specific typ...
详细信息
ISBN:
(纸本)9798350388732;9798350388725
This study identified common genes associated with various primary cancers, including prostate cancer, and established a predictive model to accurately forecast the likelihood of cancer occurrence and its specific type. The aim is to offer physicians assistance in treatment decisions and meticulous follow-up, ultimately improving patient prognosis. Tumor sample data were collected from TCGA database, and differential expression analysis was employed for feature gene selection. machinelearning models were constructed to trace the origin of cancer genes. The results revealed 663 differentially expressed genes exhibiting characteristic expression in prostate cancer, squamous cell lung cancer, thyroid cancer, clear cell renal cell carcinoma, and bladder urothelial carcinoma. Logistic regression demonstrated superior stability and performance, with an average accuracy increase of 4% compared to other models. Therefore, precise prediction of cancer occurrence and its specific type based on gene expression status can be achieved, providing robust support for physicians' diagnosis and treatment decisions. This approach has the potential to enhance patient prognosis by enabling accurate predictions and targeted interventions.
Deploying machinelearning (ML) applications over distributed stream processing engines (DSPEs) such as Apache Spark Streaming is a complex procedure that requires extensive tuning along two dimensions. First, DSPEs h...
详细信息
ISBN:
(纸本)9781665481045
Deploying machinelearning (ML) applications over distributed stream processing engines (DSPEs) such as Apache Spark Streaming is a complex procedure that requires extensive tuning along two dimensions. First, DSPEs have a vast array of system configuration parameters (such as degree of parallelism, memory buffer sizes, etc.) that need to be optimized to achieve the desired levels of latency and/or throughput. Second, each ML model has its own set of hyper-parameters that need to be tuned as they significantly impact the overall prediction accuracy of the trained model. These two forms of tuning have been studied extensively in the literature but only in isolation from each other. This position paper identifies the necessity for a combined system and ML model tuning approach based on a thorough experimental study. In particular, experimental results have revealed unexpected and complex interactions between the choices of system configuration and hyper-parameters, and their impact on both application and model performance. These findings open up new research directions in the field of self-managing stream processing systems.
Generalization problems are common in machinelearning models, particularly in healthcare applications. This study addresses the issue of real-world generalization and its challenges by analyzing a specific use case: ...
详细信息
This paper considers the general k-μ fading distribution, for which fading model this distribution is proposed. First, an expression for the channel capacity (CC) of multi-branch selection combiner influenced by k-μ...
详细信息
Dementia is a chronic and degenerative condition, which has become a major health concern among the elderly. With ever-continuing cases of dementia, it has become a very challenging task in the 21st century to provide...
详细信息
An intelligent agent is required to fuse heterogeneous sources of information together for which it should be equipped with both the data-driven (statistical) and knowledge-driven (symbolic) AI disciplines. Semantic T...
详细信息
An intelligent agent is required to fuse heterogeneous sources of information together for which it should be equipped with both the data-driven (statistical) and knowledge-driven (symbolic) AI disciplines. Semantic Technologies make it possible by creating links between disparate and heterogeneous data. When the data is linked as well as open, it is termed as Linked Open data (LOD). The Symbolic AI and sub-symbolic AI have to go together. The symbolist approach nowadays is manifested as a knowledge graph that advanced statistics and machinelearning can run on top of.
Federated learning is a new paradigm on the machinelearning system that uses the traditional system of machinelearning but implements privacy features on top of it. The implementation of federated learning is done i...
详细信息
The usage of social networks is growing exponentially day by day. Users can obtain a lot of information from social networks where some genuine information as well as some misinformation are there that mislead users a...
详细信息
The motion picture industry, commonly known as the film industry, is a huge investment sector. Furthermore, there is a massive amount of data related to movies on the internet. As a result, it became an interesting to...
详细信息
暂无评论