Water is an important substance for the human body. Clean water is important for not just the human body, but also for the environment. In this paper, Prisma is used to filter many of the reference paper where the pap...
Water is an important substance for the human body. Clean water is important for not just the human body, but also for the environment. In this paper, Prisma is used to filter many of the reference paper where the paper left are used for the research. Machine learning is one of many ways to predict water quality. Using algorithms to process large amounts of data and patterns, water quality can be predicted. Prediction on water temperature, pH, and others can be used to find bad quality water and potentially find the cure for it. There are also different kinds of environment in predicting water quality, where depending on the environment certain machine learning method is better than the others. This paper contains machine learning methods, water parameters, and areas of water for water quality prediction.
The application of information technology in addition to making internal business processes more effective and efficient, is also to improve customer service. This role is important for companies to maintain the susta...
详细信息
The ability to identify cancer at an early stage is critical to increase the likelihood of effective treatment or stopping the progression of the disease in the body. Lung cancer is one of the most common deadly disea...
The ability to identify cancer at an early stage is critical to increase the likelihood of effective treatment or stopping the progression of the disease in the body. Lung cancer is one of the most common deadly diseases and quickly kills the patient. The number of deaths caused by lung cancer surpasses those of colon, rectal, breast, and prostate cancers combined. Unfortunately, only two percent of patients with advanced lung cancer survive for five years or more. However, the survival rates are better, with 49 % of the patients surviving for five years or more, if the disease is detected early. On the other hand, in this modern era, machine learning has become one of the most reliable tools in the world for healthcare, as machine learning can learn from the data obtained and process the data that will later be used to help complete certain tasks. Therefore, in this research, machine learning or especially classification algorithms such as K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Random Forest, Logistic Regression, and XGBoost are used to identify three main symptoms that serve as markers for early detection of lung cancer. Determining the three main symptoms is done by combining the results of the feature importance score on each model using the rank-averaging method. The result is an average ranking of each feature based on its combined importance with an accuracy of 93.5 %.
Protein structure prediction in three dimensions represents a fundamental challenge in Structural Bioinformatics. Leveraging problem-specific information such as fragment insertion, secondary structure, and contact ma...
详细信息
This research focused on social media applications that had been used by large-scale users. Data might be in the form of text, image, video, each with its own data processing complexity. In this study, the researchers...
详细信息
In this paper, we exploit caches on intermediate nodes for QoE enhancement of multi-view video and audio transmission over ICN/CCN by controlling the content request start timing of consumers. We assume the selected s...
Within observational Data science workloads, Berkson's paradox can lead to false causal inferences. One of the prominent quasi-experimental methods to mitigate this selection bias is Propensity Score Matching (PSM...
详细信息
ISBN:
(数字)9798331539603
ISBN:
(纸本)9798331539610
Within observational Data science workloads, Berkson's paradox can lead to false causal inferences. One of the prominent quasi-experimental methods to mitigate this selection bias is Propensity Score Matching (PSM). An approach called Neural PSM (NPSM) was developed to overcome the drawbacks of conventional regression-based PSM, including its limited flexibility to model high-dimensional data and non-linear relationships that could cause imperfect covariate balance. In this study, a three-layer depth of Deep Neural Networks was designed to estimate propensity scores and finally balance both control and treatment groups of the Groupon dataset. An unsupervised k-Nearest Neighbor algorithm then helped the model to efficiently detect and cluster similar matching points. From the five salient features presented, NPSM successfully achieved lower differences in Cohen's d effect size, i.e., 0.313 for coupon duration, 0.017 for promotion length, 0.425 for quantity sold, -0.199 for limited supply, and 0.395 for Facebook likes. While these results mostly outperformed Linear Regression (LR) and Random Forest (RF) models, further evaluation is needed to verify the true effectiveness of NPSM in mitigating Berkson's paradox in broader e-commerce contexts.
Enzymes are biocatalysts with vital roles in biological functions and many industrial applications. Diverse enzymes are classified using Enzyme Commission (EC) nomenclature, making differentiation challenging. On the ...
详细信息
ISBN:
(数字)9798331520311
ISBN:
(纸本)9798331520328
Enzymes are biocatalysts with vital roles in biological functions and many industrial applications. Diverse enzymes are classified using Enzyme Commission (EC) nomenclature, making differentiation challenging. On the other hand, another biological information, gene ontology (GO), can describe the biological aspects of enzymes, covering related biological processes (BP), molecular functions (MF), and their locations within cells (CC). This study proposes a novel EC class and subclass classification of enzymes within the ontology subclass based on their GO semantics using a Bidirectional Encoder Representation of Transformer (BERT). The BERT model is first fine-tuned using the preprocessed GO term name and definition, with the enzymes in each ontology class (BP, MF, or CC) are also divided based on how the GO assigned, either through manual annotation (NONIEA) or electronically inferred (IEA). BERT successfully obtained 0.93, 0.60, 0.99, 0.90, 0.40, and 0.35 F1 scores during fine-tuning for BP IEA, BP NONIEA, MF IEA, MF NONIEA, CC IEA, and CC NONIEA, respectively. On the test set, the fine-tuned BERT significantly outperformed GOntoSim, a framework to calculate semantic similarity based on classical information theory, in EC class classification across all metrics with less inference time in all ontology subclass. Expanded further to the EC subclass, BERT can classify the enzyme on the EC subclass level in BP IEA and MF IEA ontology subclass. However, longer epochs are needed in fine-tuning. This result shows that the names and definitions of GO terms are distinguishable features in classifying enzymes as an alternative to the information content approach.
This paper investigates the effect of bitrate control methods on QoE of multi-view video and audio streaming with MPEG-DASH. We adopt three bitrate control methods for conventional single-view video streaming to the M...
详细信息
This paper evaluates the QoE of video and audio transmission over a full-duplex wireless LAN with interference traffic through a computer simulation and a subjective experiment. We employ a simulation environment with...
详细信息
暂无评论