The freely available tabular data represented in various digital formats, such as print-oriented documents, spreadsheets, and web pages, are a valuable source to populate knowledge graphs. However, difficulties that i...
详细信息
We present a robust, parallel primal-dual heuristic algorithm for the k-medoids clustering problem, a widely utilized method in data mining and machine learning. Our approach surpasses current algorithms by effectivel...
We present a robust, parallel primal-dual heuristic algorithm for the k-medoids clustering problem, a widely utilized method in data mining and machine learning. Our approach surpasses current algorithms by effectively addressing their limitations, such as time-consuming distance matrix calculations, inefficient nearest-neighbor searches, and difficulties in handling large-scale datasets. To overcome these challenges, we employ an efficient parallel implementation, combined with a pioneering subgradient search algorithm. We evaluate our algorithm on the BIRCH and Stanford Dog datasets and demonstrate its superiority over existing k-medoids clustering algorithms in terms of solution quality and run time. Additionally, we introduce a novel vectorization technique that enables our algorithm to handle various types of data, such as images, text, and point data. Overall, our work contributes to the field of data mining and machine learning by providing an efficient and effective solution for the k-medoids clustering problem. The proposed algorithm offers improved performance, and versatility, making it a valuable tool for a wide range of applications.
This work addresses the optimistic statement of a bilevel optimization problem with a general d.c. optimization problem at the upper level and a convex optimization problem at the lower level. First, we use the reduct...
详细信息
The paper addresses relevant issues applying the concept of Industry 4.0 in related to modeling infrastructure objects at the Baikal natural territory that use environmentally friendly technologies. In particular, the...
详细信息
This paper addresses the numerical solution of fractional programs with quadratic functions in the ratios. Instead of considering a sum-of-ratios problem directly, we developed an efficient global search algorithm, wh...
详细信息
We offer a specialized toolkit for automating both knowledge management when creating an applied microservices package and data accumulating during its application for scientific computations in a hybrid computing env...
详细信息
This paper addresses the general optimization problem ($$\mathcal P$$) with equality and inequality constraints and the cost function given by d.c. functions. We reduce the problem to a penalized problem ($$\mathcal P...
详细信息
Spreadsheets are one of the most convenient ways to structure and represent statistical and other data. In this connection, automatic processing and semantic interpretation of spreadsheets data have become an active a...
详细信息
The paper discusses the problem of preparing a computing environment for large-scale scientific experiments in the process of continuous integration of applied and system software. A comparative analysis of software c...
详细信息
Facility disruptions or failures may occur due to natural disasters or a deliberate man-made attack. Such an attack is known as interdiction. Recently, facility location problems, addressing intentional strikes agains...
详细信息
暂无评论