When a knowledge Discovery from data (KDD) (Fayyad, Piatetsky-Shapiro, & Smyth, 1996) process is being applied to get knowledge, several methods could be used (Gibert, et al., 2018). A simple and fast way to obtai...
详细信息
ISBN:
(纸本)9781643685434
When a knowledge Discovery from data (KDD) (Fayyad, Piatetsky-Shapiro, & Smyth, 1996) process is being applied to get knowledge, several methods could be used (Gibert, et al., 2018). A simple and fast way to obtain preliminary insights from data before using KDD models is by generating a basic descriptive analysis. It is one of the most popular ways to describe experimental data and should be the beginning of all data projects. Nevertheless some of the main knowledge that can be extracted in a descriptive analysis is hidden due to underlying multivariate structures which could be elicited through multivariate analysis techniques. Moreover, the domain expert is key for a proper interpretation of descriptive results. At the same time, there is a lack of automatic reporting techniques that can report and help in the interpretation of complex patterns and the use of advanced multivariate techniques. This paper shows the tool developed to generate automatic interpretation of Multiple Correspondence Analysis (MCA) and Principal Components Analysis (PCA) by using RMarkdown. This tool generates a Word document which contains the automatic interpretation of the results, built on the basis of regular expressions ellaborating over the R analytical outputs (either numerical or graphical results). The proposal is being applied with some real data, like INSESS database on social vulnerabilities of the Catalan population. In conclusion, the developed tool contributes to facilitate the factorial methods results, avoiding the misinterpretation of the results and the involuntary skipping of conclusions due to the large amount of knowledge that can be extracted from a complete factorial analysis. Also, this software enables non-expert users to read multivariate analysis results in a friendly way. Moreover, this tool saves time in the interpretation step and is a basis to support the expert to start the report with the results, even the output of the software could become the report or
This study focuses on developing an intelligent decision support system (IDSS) that helps a human operator make data-driven decisions. To put IDSS in production, it is necessary to develop two additional components: o...
详细信息
Securing Internet of Things (IoT) devices is paramount to mitigate unauthorised access and potential cyber threats, safeguarding the integrity of transmitted and processed data within interconnected devices. Identifyi...
详细信息
Real-world datasets are often of high dimension and effected by the curse of dimensionality. This hinders their comprehensibility and interpretability. To reduce the complexity feature selection aims to identify featu...
详细信息
This research presents the HardnessTesterV app- a web application for predicting the Vicker hardness of Laser welded Metallic alloy using Flask API, HTML, and CSS to build the front end and back end. Vickers hardness ...
详细信息
Topic models are a popular tool for clustering and analyzing textual data. They allow texts to be classified on the basis of their affiliation to the previously calculated topics. Despite their widespread use in resea...
详细信息
A recent trend in Natural Language Processing is the exponential growth in Language Model (LM) size, which prevents research groups without a necessary hardware infrastructure from taking part in the development proce...
详细信息
We introduce the Birkhoff completion as the smallest distributive lattice in which a given finite lattice can be embedded as semi-lattice. We discuss its relationship to implicational theories, in particular to R. Wil...
详细信息
In the realm of multi-intent spoken language understanding, recent advancements have leveraged the potential of prompt learning frameworks. However, critical gaps exist in these frameworks: the lack of explicit modeli...
详细信息
暂无评论