The traditional database technology can't meet the rapidly growing information and the demands of extreme scalability, high availability and reliability for the mass data. In this paper, we have designed a distrib...
详细信息
ISBN:
(纸本)9780769548975;9781467348256
The traditional database technology can't meet the rapidly growing information and the demands of extreme scalability, high availability and reliability for the mass data. In this paper, we have designed a distributed storage system based MPP (massively parallel processing) architecutre on RDBMS to solve these problems. First, we present the main framework and the function architecutre of the desiging. Then we do some experiments and performance tests. At last, we conclude that with the good scalability and massively parallel processing advantage, the system can solve the mass data storage problem. The idea of distributed storage system based MPP architecture in relational databases speeds up reading and writing, improving the shortcomings of traditional database.
Locality-Sensitive Hashing (LSH) is a well-known approximate nearest-neighbor search algorithm for high-dimensional data. Though LSH searches nearest-neighbor points for a query very fast, LSH has a drawback that the ...
详细信息
ISBN:
(纸本)9780769546513
Locality-Sensitive Hashing (LSH) is a well-known approximate nearest-neighbor search algorithm for high-dimensional data. Though LSH searches nearest-neighbor points for a query very fast, LSH has a drawback that the space complexity is very large. For this reason, so as to apply LSH to a large dataset, it is crucial to implement LSH in distributed environments which consist of multiple nodes. One simple and natural method to implement LSH in the distributed environment is to have every node keep the same number of hash tables. However, this method increases remote accesses, because many nodes are accessed to access all the hash tables. Thus, this simple method will suffer from the long query response time, if the communication delay is the bottleneck. This paper proposes to reduce remote accesses by assigning hash buckets smartly to the nodes. In particular, our method assigns hash buckets from different hash tables to the same node, if the hash buckets store the same points. Due to this strategy, our method can access multiple hash buckets that should be accessed in processing a query with a single remote access, thereby decreasing remote accesses.
In the past, the MFFP-tree algorithm was proposed to handle the quantitative database for efficiently mining the complete fuzzy frequent itemsets. In this paper, we propose an integrated MFFP (called iMFFP)-tree algor...
详细信息
ISBN:
(纸本)9783642284878
In the past, the MFFP-tree algorithm was proposed to handle the quantitative database for efficiently mining the complete fuzzy frequent itemsets. In this paper, we propose an integrated MFFP (called iMFFP)-tree algorithm for merging several individual MFFP trees into an integrated one. It can help derive global fuzzy rules among distributed databases, thus allowing managers to make more sophisticated decisions. Experimental results also showed the performance of the proposed approach.
This paper proposes an approach to improve the level of Quality of Experience (QoE) that distributed database systems provide. Quality of Experience is a measure of users' satisfaction when using a certain service...
详细信息
This paper proposes an approach to improve the level of Quality of Experience (QoE) that distributed database systems provide. Quality of Experience is a measure of users' satisfaction when using a certain service or application. Therefore, the main objective of this paper is to provide mechanisms to increase users' satisfaction when accessing distributed database systems. In traditional database systems, users cannot specify execution-related constraints. Then, the database system cannot evaluate if user expectations are satisfied and neither the system can take corrective actions when necessary. In this work, we present the QoE-oriented distributed database system (QoE-DDB). It allow users to specify Data Access Requirements (DARs) and aims to please users by satisfying the DARs they define. We define a set of types of Data Access Requirements and propose some SQL extensions that enable users to specify execution-related requirements. Proposed types of DARs include execution deadline and priority, execution start and finish times, data availability and freshness degrees, and disconnected execution mode. In our QoE-DDB, each user's command is transformed into one or more tasks that are executed by data services. Community modules and local data services negotiate Service Level Objectives (SLOs) for each task, which improves the system's dependability. We propose both QoE-oriented scheduling and dynamic data placement strategies. Proposed architecture and scheduling strategies enable the system to be used in a wide range of distributed environments, from tightly-coupled homogeneous environments (e.g. composed by off-the-shelf computers connected by a LAN) to highly heterogeneous and geographically distributed systems, where data services have some degree of autonomy. Traditional performance indicators (e.g. throughput and response time) are not adequate to measure the QoE a system provides. We also propose some specialized Key Performance Indicators (KPIs) to estima
Considering the expense, the security of communication and demand in practice, a strategy is proposed, which propagates replicated data based on message and transaction-log table to update and maintain the database. I...
详细信息
To realize consistency and security of the system database, the method to ensure data synchronization of distributed database is discussed. Firstly the whole system requirement and network environment is introduced, a...
详细信息
ISBN:
(纸本)9781612848334
To realize consistency and security of the system database, the method to ensure data synchronization of distributed database is discussed. Firstly the whole system requirement and network environment is introduced, and then describes the distributed structure model of database. Using the advanced replication feature in Oracle database, basing on the "Commerce and Industry business data center" of a special province industrial and commercial system, data synchronization for data center of commerce and industry management system is designed and implemented. The approach used in Commerce and Industry business data center is also suitable for business organizations, taxation, public security, and some industries.
Based on the study about the common methods of speeding queries, a new project of using data fragment technology to speed queries of mass data has been advanced. At the same time, the technologies of database synchron...
详细信息
Based on the study about the common methods of speeding queries, a new project of using data fragment technology to speed queries of mass data has been advanced. At the same time, the technologies of database synchronization and load balance scheduling are also solved in the project. As an example, the solution of the public security management information system in hotels shows how to realize it.
In the framework of the African Monsoon Multidisciplinary Analyses (AMMA) programme, several tools have been developed in order to facilitate and speed up data and information exchange between researchers from differe...
详细信息
In the framework of the African Monsoon Multidisciplinary Analyses (AMMA) programme, several tools have been developed in order to facilitate and speed up data and information exchange between researchers from different disciplines. The AMMA information system includes a multidisciplinary user-friendly distributed data management and distribution system, a reports and quick looks archive associated with a display website and scientific papers exchange systems. All the applications have been developed by several French institutions and fully duplicated in Niamey, Niger. Copyright (C) 2010 Royal Meteorological Society
Several national and global initiatives aim to increase access to biodiversity information worldwide. The Italian National Biodiversity Network, started in the framework of the project "Sistema Ambiente 2010"...
详细信息
Several national and global initiatives aim to increase access to biodiversity information worldwide. The Italian National Biodiversity Network, started in the framework of the project "Sistema Ambiente 2010", will organise and manage biodiversity data hosted by museums, universities and research centres in Italy, in order to make them widely available on the Web.
Due to covert channel problem, traditional concurrency control protocols cannot be used in multilevel secure database system. Concurrency control protocol for multilevel secure database should ensure that transactions...
详细信息
ISBN:
(纸本)0769530508
Due to covert channel problem, traditional concurrency control protocols cannot be used in multilevel secure database system. Concurrency control protocol for multilevel secure database should ensure that transactions at lower security level are never delayed by higher security level transactions in the event of a data conflict. This may lead to the abortion or re-execution of high security level transactions over and over again making the secure concurrency control protocol unfair towards high security level transactions. In this paper we propose a feedback based secure concurrency control protocol for multilevel secure distributed databases, which ensure fairness property for transactions at different security level. Through a detailed simulation study we examine the fairness and performance of our feedback based secure concurrency control for a variety of workloads.
暂无评论