This paper presents an improved, unsupervised machinelearning image segmentation system with density-based noisy spatial hierarchical clustering (IS-HDBSCAN). Compared with the conventional image segmentation cluster...
详细信息
This paper presents an improved, unsupervised machinelearning image segmentation system with density-based noisy spatial hierarchical clustering (IS-HDBSCAN). Compared with the conventional image segmentation clustering algorithm K-means, IS-HDBSCAN has better segmentation results, higher adaptivity and lower data pre-processing requirements in complex unstructured environments;compared with convolutional neural network FCN, IS-HDBSCAN does not rely on dataset and model training and has better noise robustness. The article firstly introduces the key issues in the field of image segmentation, then proposes the IS-HDBSCAN image segmentation system and introduces its principles and advantages. Finally, the comparison of image segmentation results in the field environment demonstrates that the IS-HDBSCAN-based image segmentation system has better segmentation results.
As a novel fast and effective intelligent algorithm, Broad learning System (BLS) is popularly used in data classification task. However, the real data are often imbalanced, which makes BLS have limited performance in ...
详细信息
The changes in temperature may arise risks in many industries. To solve this problem, the National Meteorological Center and Dalian Commodity Exchange jointly compiled a temperature index which includes 5 cities. Ther...
详细信息
Sneaker culture is a self-organizing culture spontaneously formed by teenagers. In the Internet and digital age, sports shoes are endowed with appreciation and financial attributes through manual speculation, which ha...
详细信息
The image shaping of tourist attractions is one of the important factors to attract tourists. This paper explores the perceived impression of Fuzhou in tourists' mind and the perceived dimensions of tourists' ...
详细信息
The high-temperature superheater outlet header (Outlet Header) in ultra-supercritical (USC) thermal power plants is subjected to high temperatures and pressures, which increases the risk of creep failure. To assess th...
详细信息
The high-temperature superheater outlet header (Outlet Header) in ultra-supercritical (USC) thermal power plants is subjected to high temperatures and pressures, which increases the risk of creep failure. To assess the structural reliability of the Outlet Header, it is necessary to consider the impact of uncertainty factors. Furthermore, the diverse operating conditions make reliability assessment inconvenient. This study evaluates the creep life reliability of the Outlet Header based on material uncertainty and simplifies the assessment process using machinelearning methods. Considering the scatter of creep rupture data, the uncertainty of material constants in the Larson-Miller (LM) model is quantified by randomly sampling a specific number of creep rupture life data. Based on the results of uncertainty quantification and finite element analysis, the distribution of the Outlet Header's creep life is obtained to calculate its reliability under design life. machinelearning is employed to assist in the reliability assessment of creep life under different operating conditions of Outlet Header. The results indicate that Artificial Neural Network (ANN) demonstrates good performance in this study, and an assessment diagram based on the ANN has been constructed. This approach provides a practical solution for assessing the reliability of high-temperature components in engineering.
The preparation of relational data for machinelearning (ML) has largely remained a manual, labor-intensive process, while automated machinelearning has made great strides in recent years. Long-standing challenges, s...
详细信息
ISBN:
(纸本)9781665481045
The preparation of relational data for machinelearning (ML) has largely remained a manual, labor-intensive process, while automated machinelearning has made great strides in recent years. Long-standing challenges, such as reliable foreign key detection still pose a major hurdle towards more automation of data integration and preparation tasks. We created a new dataset aimed at increasing the level of automation of data preparation tasks for relational data. The dataset, called GITSCHEMAS, consists of schema metadata for almost 50k real-world databases, collected from public GitHub repositories. To our knowledge, this is the largest dataset of such kind, containing approximately 300k table names, 2M column names including data types, and 100k real (not semantically inferred) foreign key relationships. In this paper, we describe how GITSCHEMAS was created, and provide key insights into the dataset. Furthermore, we show how GITSCHEMAS can be used to find relevant tables for data augmentation in an AutoML setting.
Generative adversarial networks (GANs) are powerful deep learning models for synthesizing realistic data. However, their performance critically depends on curating optimal training data. This research conducts a compr...
详细信息
Traffic congestion has become a critical issue in metropolitan areas, inflicting significant hardships on individuals on a daily basis. This adverse situation poses multifaceted challenges, including increased air pol...
详细信息
This tutorial presents a design and implementation of a scikit-compatible system for detecting anomalies from time series data for the purpose of offering a broad range of algorithms to the end user, with special focu...
详细信息
ISBN:
(数字)9781665408837
ISBN:
(纸本)9781665408837
This tutorial presents a design and implementation of a scikit-compatible system for detecting anomalies from time series data for the purpose of offering a broad range of algorithms to the end user, with special focus on unsupervised/semi-supervised learning. Given an input time series, we discuss how data scientist can construct four categories of anomaly pipelines followed by an enrichment module that helps to label anomaly. The tutorial provides an hand-on-experience using a deployed system on IBM API Hub for developer communities that aim to support a wide range of execution engines to meet the diverse need of anomaly workloads such as Serveless for CPU intensive work, GPU for deep-learning model training, etc.
暂无评论