This systematic literature review explores the application of transformer models in early detection of human depression, encompassing text, audio, and video data modalities. Transformer architectures, notably BERT for...
详细信息
Recent advances in deep learning not only facilitate the implementation of zero-shot singing voice synthesis (SVS) and singing voice conversion (SVC) tasks but also provide the opportunity to unify these two tasks int...
详细信息
Objectives: This study aims to conduct a gap analysis to determine the feasibility of mapping electronic health record data from the Clinical Emergency data Registry (CEDR) to the Observational Medical Outcomes Partne...
详细信息
When processing datasets in diabetes classification, common problems included a large number of missing values, outliers, and dataset imbalance. To deal with those issues, this study analyzed 18 studies on diabetes cl...
详细信息
In this paper, we extend a recently developed machine-learning (ML) based CREASE-2D method to analyze the entire two-dimensional (2D) scattering pattern obtained from small angle X-ray scattering measurements of supra...
详细信息
K-Nearest Neighbor (KNN) is a widely used algorithm to gain an accurate and efficient classification. One of the drawbacks of the algorithm is the time required to calculate the distance for each point. In this paper,...
详细信息
data mining is utilized to explore banks' data to unravel any hidden scams and detect potential frauds. The aim of this paper is to compare between the Naïve Bayes, Decision Tree and Logistic Regression in fr...
详细信息
The characterization and classification of purity of limestone at Madura Island was investigated. Sampling was taken from nine quarries from different areas. The chemical analysis was carried out by X-ray fluorescence...
详细信息
In this research, we explore the technical and computational merits of a machine learning algorithm on a large data set, employing distributed systems. Using 167 million (10 GB) energy consumption observations collect...
详细信息
In this research, we explore the technical and computational merits of a machine learning algorithm on a large data set, employing distributed systems. Using 167 million (10 GB) energy consumption observations collected by smart meters from residential consumers in London, England, we predict future residential energy consumption using a Random Forest machine learning algorithm. Distributed systems such as AWS S3 and EMR, MongoDB and Apache Spark are used. Computational times and predictive accuracy are evaluated. We conclude that there are significant computational advantages to using distributed systems when applying machine learning algorithms on large-scale data. We also observe that distributed systems can be computationally burdensome when the amount of data being processed is below a threshold at which it can leverage the computational efficiencies provided by distributed systems.
暂无评论