检索结果-内蒙古大学图书馆

Join query optimization in distributed database based on multi-source mating selection evolutionary algorithm

CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS 2025年第5期28卷 1-23页

作者： Du, Yan Ding, Zhiming Cai, Zhi Chi, Yuanying Beijing Univ Technol Beijing 100124 Peoples R China Chinese Acad Sci Inst Software Beijing 100190 Peoples R China

In a distributed database system, the data is distributed on multiple sites in the cluster. So for join queries involving large amount of data access and complex computation, how to efficiently use each site to complete data reading and computation is one of the key issues in query optimization. With the development of network communication technology, the cost of data transmission in network is no longer the only factor limiting the query efficiency, especially for distributed databases deployed in high-speed local area networks, the cost of CPU computation of local sites and the cost of data I/O also need to be considered. In this regard, a multi-source mating selection based differential evolutionary artificial bee colony algorithm is proposed in this paper to solve the distributed database query optimization problem under high-speed local area network deployment. In this algorithm, the population is first initialized using the good node set method so that the population can be more evenly distributed in the feasible domain, and then the genetic algorithm is combined with the artificial bee colony algorithm to improve the performance of the algorithm. At the same time, spectral clustering is introduced to mine the regular characteristics of the population, and a multi-source mating selection and recombination operator is designed to guide the algorithm search based on the obtained structured information of the population, which can accelerate the convergence of the algorithm by using the recombination of similar individuals while maintaining the diversity of the population by setting multiple sources of mating selection for each individual. Finally, simulation comparison experiments are conducted with other methods under different query sizes, and the results show that the proposed method is able to produce less costly query execution plans. And to a certain extent, it is able to reduce the query response time and improve the query efficiency.

关键词： distributed database High-speed local area network Artificial bee colony algorithm Good node set Spectral clustering Multi-source recombination operator

来源：评论

学校读者我要写书评

暂无评论

The Conception of distributed database of Video Data of the Educational Process 27th

The Conception of Distributed Database of Video Data of the ...

引用

27th International Conference on Interactive Collaborative Learning-ICL

作者： Kirilova, Galia I. Khasanova, Gulnara F. Levina, Elena U. Garifullina, Rezeda R. Kazan Volga Region Fed Univ Kazan Russia A N Tupolev Kazan Natl Res Technol Univ Kazan Russia Kazan Natl Res Tech Univ KAI Kazan Russia Kazan State Power Engn Univ Kazan Russia

ISBN: (纸本)9783031856518;9783031856525

The paper presents a conception of a distributed database of video data for educational purpose. The study was conducted at the Kazan Federal University, the Kazan National Research Technological University, and the Kazan State Power Engineering University. Sharing the distributed data to improve educational content and digital resources designed to serve the processes of digitalization in the Russian educational system is under analysis. Features of this conception include: stack metadata structure that provides inheritance and personalized liability in the educational information database;secure storage of texts, audio and video coding using watermarks;taking into account the structure of the speech signal based on the algorithms of its selection;connecting and maintaining the work of an intelligent analyzer. The experimental study (N = 350) involving teachers and higher education students working in schools included initial and formative stages. The result of the initial stage made it possible to reveal fragmented experience, low motivation and unformed readiness of the majority of practicing teachers for the use of a distributed educational database. Students from control and experimental groups were involved in the formative stage. Accordingly, statistically significant positive results were obtained, indicating the effectiveness of the proposed concept.

关键词： distributed database educational videos stack metadata

来源：评论

学校读者我要写书评

暂无评论

Research on distributed database Stability Testing Platform based on Chaos Engineering 22

Research on Distributed Database Stability Testing Platform ...

引用

IEEE 22nd International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom) / BigDataSE Conference / CSE Conference / EUC Conference / ISCI Conference

作者： Wang, Chaolun Yang, Jiaxing Han, Xiaolu Ma, Jianrui Liu, Siyuan Ma, Pengwei China Acad Informat & Commun Technol Beijing Peoples R China

ISBN: (纸本)9798350381993;9798350382006

With the fast development of information technology, the databases used as the fundamental storage and computing component of information system have to deal with much more complicated scenarios with high frequency and concurrency. distributed databases are becoming more and more common in data-intensive industries such as banking and telecommunications. The selection of distributed database products requires comprehensively consider the function, performance, security, ease of use and stability of the product. The stability of distributed database is known to be difficult in testing and product selection. The method widely used in database testing is TPC-DS (transaction processing performance council-decision support) benchmark, which is only suitable for functional and performance testing. To deal with this drawback, a distributed database stability testing platform based on chaos engineering methodology is developed Through the perturbation injection by using the stability testing platform, the performance fluctuation of the database under pressure condition can be observed, and the stability of the database can then be evaluated

关键词： database distributed database database testing database product selection big data chaos engineering

来源：评论

学校读者我要写书评

暂无评论

Security and Efficient Data Verification Protocol for distributed database based on Zero-knowledge Proof 27

Security and Efficient Data Verification Protocol for Distri...

引用

27th International Conference on Computer Supported Cooperative Work in Design (CSCWD)

作者： Liu, Han Bai, YunXu Cent South Univ Sch Mech & Elect Engn Changsha Peoples R China

ISBN: (纸本)9798350349184;9798350349191

Recently, distributed databases have achieved tremendous realistic performances and developed one of the most essentially utilized tools in society communication applications. However, the existing distributed databases often contain users' sensitive information and are vulnerable to web attackers, which may cause severe privacy issues and economic loss. In this paper, we first attempt to propose a novel protocol to dispose of the potential verification risks in distributed databases. Compared with currently distributed databases, the requester can steal important data without any payment. Therefore, our model faces two primary challenges including guaranteeing the efficiency and security of the distributed databases, the data verification procedure may lead to data leakage. To address the above problems, we utilize zero-knowledge proof to dispose of the data verification issue for the requester. Moreover, a secure and effective proof protocol is established to achieve database responses the privacy data access. From our extensive experimental results, we can conclude that our developed framework can achieve an effective performance with reasonable communication costs.

关键词： distributed database Data Verification Security Zero-knowledge Proof

来源：评论

学校读者我要写书评

暂无评论

Multivariate Log-based Anomaly Detection for distributed database 24

Multivariate Log-based Anomaly Detection for Distributed Dat...

引用

30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

作者： Zhang, Lingzhe Jia, Tong Jia, Mengxi Li, Ying Yang, Yong Wu, Zhonghai Peking Univ Beijing Peoples R China

ISBN: (纸本)9798400704901

distributed databases are fundamental infrastructures of today's large-scale software systems such as cloud systems. Detecting anomalies in distributed databases is essential for maintaining software availability. Existing approaches, predominantly developed using Loghub-a comprehensive collection of log datasets from various systems-lack datasets specifically tailored to distributed databases, which exhibit unique anomalies. Additionally, there's a notable absence of datasets encompassing multi-anomaly, multinode logs. Consequently, models built upon these datasets, primarily designed for standalone systems, are inadequate for distributed databases, and the prevalent method of deeming an entire cluster anomalous based on irregularities in a single node leads to a high false-positive rate. This paper addresses the unique anomalies and multivariate nature of logs in distributed databases. We expose the first open-sourced, comprehensive dataset with multivariate logs from distributed databases. Utilizing this dataset, we conduct an extensive study to identify multiple database anomalies and to assess the effectiveness of state-of-the-art anomaly detection using multivariate log data. Our findings reveal that relying solely on logs from a single node is insufficient for accurate anomaly detection on distributed database. Leveraging these insights, we propose MultiLog, an innovative multivariate log-based anomaly detection approach tailored for distributed databases. Our experiments, based on this novel dataset, demonstrate MultiLog's superiority, outperforming existing state-of-the-art methods by approximately 12%.

关键词： Anomaly Detection distributed database Anomaly Injection Multivariate Log Analysis

来源：评论

学校读者我要写书评

暂无评论

Construction of distributed database Prototype for Real time data Synchronization Method under Improved ABC Algorithm 2

Construction of Distributed Database Prototype for Real time...

引用

2nd IEEE International Conference on Integrated Circuits and Communication Systems, ICICACS 2024

作者： Xu, Huan Liang, Yingwei Chen, Bin Digital Department of China Southern Power Grid Guangzhou Guangzhou China Information Center of Guangdong Power Grid Guangzhou Guangzhou China

ISBN: (纸本)9798350317558

At present, with the rapid development of the information industry, database technology has been greatly affected, and distributed database systems have gradually emerged. This article explains in detail the implementation of the ABC (Artistic Bee Colony) algorithm, starting from the overall system diagram, and the data processing of the system is explained in detail. On this basis, this paper proposes a data warehouse model based on XML (eXtensible Markup Language), and simulates the model. This article also expounds the application of ETL (Extract Transform Load) and XML technology in data transmission, and points out that this mechanism has the characteristics of high efficiency and reliability. In a normal network, the number of failures is 3, and the average time between failures is 999 hours. The synchronization mechanism in this article can reduce the pressure on computers that use technologies such as data warehouses for synchronization. © 2024 IEEE.

关键词： Data Warehouse Model distributed database Improved ABC Algorithm Prototype Construction Real-time Data Synchronization

来源：评论

学校读者我要写书评

暂无评论

A Query-Level distributed database Tuning System with Machine Learning 13

A Query-Level Distributed Database Tuning System with Machin...

引用

13th IEEE International Conference on Joint Cloud Computing (JCC)

作者： Fang, Xiang Zou, Yi Fang, Yange Tang, Zhen Li, Hui Wang, Wei Chinese Acad Sci Inst Software State Key Lab Comp Sci Beijing Peoples R China Univ Chinese Acad Sci Beijing Peoples R China Nanjing Inst Software Technol Nanjing Peoples R China Univ Chinese Acad Sci Nanjing Coll Beijing Beijing Peoples R China

ISBN: (数字)9781665462853

ISBN: (纸本)9781665462853

Knob tuning is important to improve the performance of database management system. However, the traditional manual tuning method by DBA is time-consuming and error-prone, and can not meet the requirements of different database instances. In recent years, the research on automatic knob tuning using machine learning algorithm has gradually sprung up, but most of them only support workload-level knob tuning, and the studies on query-level tuning is still in the initial stage. Furthermore, few works are focus on the knob tuning for distributed database. In this paper, we propose a query-level tuning system for distribute database with the machine learning method. This system can efficiently recommend knobs according to the feature of the query. We deployed our techniques onto CockroachDB, a distribute database, and experimental results show that our system achieves higher performance under typical OLAP workload. For all categories of queries, our system reduces the latency by 9.2% on average, and for some categories of queries, this system reduces the latency by more than 60%.

关键词： query-level knob tuning distributed database machine learning

来源：评论

学校读者我要写书评

暂无评论

Research and Implementation of Parallel CART Algorithm Based on distributed database 6

Research and Implementation of Parallel CART Algorithm Based...

引用

2023 IEEE 6th International Conference on Information Systems and Computer Aided Education, ICISCAE 2023

作者： Wang, Jie The Open University of Guangdong School of Artificial Intelligence Guangzhou China

ISBN: (纸本)9798350313444

This paper presents a parallel CART method for multivariate association query. The amount of communication information is reduced by performing two semi-joins. This method adopts a multi-node parallel method, which greatly speeds up the execution efficiency of the system. The sequence of operations generated by this algorithm has global optimization characteristics. One of its important development trends is to realize effective data query by optimizing the database. The simulation experiment proves that the method has fast processing speed, high node utilization rate and strong practicability. This method is suitable for large data processing. © 2023 IEEE.

关键词： distributed database optimization algorithm parallel CART algorithm query processing

来源：评论

学校读者我要写书评

暂无评论

Research and Application of Data Partition Technology in distributed database 3

Research and Application of Data Partition Technology in Dis...

引用

3rd Information Communication Technologies Conference (ICTC)

作者： Zhu, Mingying Liu, Zhiqiong Chi, Weicheng Zhang, Jinjuan Hua, Zhuxuan Shi, Lixue China Telecom Co Ltd Res Inst Guangzhou Peoples R China China Telecom Co Ltd Beijing Peoples R China

ISBN: (纸本)9781665495080

distributed database has the characteristics of high scalability, high availability, low cost and performance improvement. How to build an appropriate data partition is the core problem for distributed database to solve the storage problem and improve the performance at the same time. By studying the data partition technology of distributed database, this paper gives the principles and design methods of database partition in distributed database design, and puts forward formulae of aggregation and balance that needs to be paid attention to in database partition, which provides a new idea for the transformation of database partition design from qualitative analysis to quantitative analysis. Moreover, why aggregation should be considered as one of the principles of data partitioning in distributed database is verified by experiments, and the application of database partition technology is illustrated by an example in telecom business support system.

关键词： distributed database data partition distributed transaction aggregation balance

来源：评论

学校读者我要写书评

暂无评论

Fuzzy c-Lines for Vertically distributed database with Missing Values 12

Fuzzy c-Lines for Vertically Distributed Database with Missi...

引用

Joint 12th International Conference on Soft Computing and Intelligent Systems / 23rd International Symposium on Advanced Intelligent Systems (SCIS and ISIS)

作者： Kunisawa, Kohei Honda, Katsuhiro Ubukata, Seiki Notsu, Akira Osaka Prefecture Univ Grad Sch Engn Sakai Osaka 5998531 Japan Osaka Metropolitan Univ Grad Sch Informat Sakai Osaka 5998531 Japan Osaka Metropolitan Univ Grad Sch Sustainable Syst Sci Sakai Osaka 5998531 Japan

ISBN: (纸本)9781665499248

Privacy preserving data clustering is a useful method for extracting intrinsic cluster structures from distributed databases keeping personal privacy. In a previous research, a model of performing Fuzzy c-Lines clustering was proposed, where a privacy preserving scheme of k-means-type model was adopted with cryptographic calculation. This paper further improves the model for handling incomplete data ignoring the influences of missing values. The element-wise clustering criterion enables to derive local principal component vectors in each data sources by considering minimization of low-rank approximation of observed elements only. Then, fuzzy memberships of each object are calculated in a collaborative manner among organizations, where partial distances between objects and prototypes are derived with cryptographic framework so that intra-organization information is kept secret. The characteristic features of the proposed method are demonstrated through numerical experiments.

关键词： Fuzzy clustering Linear clustering distributed database

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：