Distributed database systems offer scalability and fault tolerance by replicating databases across geographically dispersed nodes. This redundancy aims to ensure data availability even during failures and allows for b...
详细信息
Distributed database systems offer scalability and fault tolerance by replicating databases across geographically dispersed nodes. This redundancy aims to ensure data availability even during failures and allows for backups in case of disasters. However, maintaining strong data consistency, where all nodes reflect the latest data simultaneously, becomes a challenge in such geographically distributed setups since database systems need to prioritize availability or performance over strong data consistency. This study examines how different data consistency configurations affect the performance of popular nosql (Not only SQL) databases, namely Cassandra, MongoDB, and Redis, in a multi-region cloud environment. We adopt the Yahoo! Cloud Serving Benchmark (YCSB) tool to simulate various workloads, measure performance metrics, and compare the results. Our findings reveal significant performance degradation associated with strong data consistency configurations. For instance, in Cassandra, the number of writing/reading operations processed per second can decrease by up to 95% for specific workloads. Similarly, enforcing strong data consistency in Redis can result in execution times that are over 20 times slower on writing/reading operations.
nosql databases, known for their flexibility and scalability, have become pivotal in handling diverse and unstructured data. However, their schema-less nature introduces significant challenges in metadata management, ...
详细信息
For over forty years, relational databases have been the leading model for data storage, retrieval and management. However, due to increasing needs for scalability and performance, alternative systems have started bei...
详细信息
ISBN:
(纸本)9783319164861;9783319164854
For over forty years, relational databases have been the leading model for data storage, retrieval and management. However, due to increasing needs for scalability and performance, alternative systems have started being developed, namely nosql technology. With increased interest in nosql technology, as well as more use case scenarios, over the last few years these databases have been more frequently evaluated and compared. It is necessary to find if all the possibilities and characteristics of non-relational technology have been disclosed. While most papers perform mostly performance evaluation using standard benchmarks, it is nevertheless important to notice that real world scenarios, with real enterprise data, do not function solely based on performance. In this paper, we have gathered a concise and up-to-date comparison of nosql engines, their most beneficial use case scenarios from the software engineer viewpoint, their advantages and drawbacks by surveying the currently available literature.
Industrial Internet of Things (IIoT)-enabled production facilities generate vast amounts of data, which, if harnessed effectively, can substantially enhance manufacturing efficiency through latency reduction. The sele...
详细信息
Industrial Internet of Things (IIoT)-enabled production facilities generate vast amounts of data, which, if harnessed effectively, can substantially enhance manufacturing efficiency through latency reduction. The selection of the appropriate data storage technology is a pivotal consideration in achieving this objective. While prior studies have examined SQL and nosql databases in terms of latency and energy efficiency, these evaluations have not been conducted specifically within the context of IIoT. This paper aims to fill this research gap by conducting a rigorous comparison of SQL and nosql databases, focusing on their performance latency and energy efficiency when interfaced with IoT nodes. By elucidating these relationships, our research offers actionable insights that can guide IIoT-enabled manufacturing facilities in optimizing their operations. Specifically, the paper aids in the selection of the most suitable database technology, thereby contributing to latency minimization and efficiency maximization in industrial settings.
With the development of cloud computing and Internet;e-Commerce, e-Business and corporate world revenue are increasing with high rate. These areas not only require scalable and consistent databases but also require in...
详细信息
With the development of cloud computing and Internet;e-Commerce, e-Business and corporate world revenue are increasing with high rate. These areas not only require scalable and consistent databases but also require inter database transaction support. In this paper, we present, a scalable three-tier architecture along with a distributed middle-ware protocol to support atomic transactions across heterogeneous nosql databases. Our methodology does not compromise on any assumption on the accuracy of failure modalities. Hence, it is suitable for a class of heterogeneous distributed systems. To achieve such a target, our architectural model exploits an innovative methodology to achieve distributed atomic transactions. We simulate this architectural setup with different latency tests under different environments to produce reliable impact and correctness. (C) 2014 Elsevier Inc. All rights reserved.
nosql databases disrupted the database market when first introduced. Their contemporary relevance has increased further in the era of big data due to the demands placed on (real-time) analytics. nosql databases are we...
详细信息
nosql databases disrupted the database market when first introduced. Their contemporary relevance has increased further in the era of big data due to the demands placed on (real-time) analytics. nosql databases are well placed to meet these demands due to their performance, availability, scalability, and storage solutions. Unfortunately, to achieve these features, compromises have been made with respect to security and privacy. Growing community awareness and unease combined with increased legislative requirements around data privacy have made such compromises less palatable, risky, or downright unacceptable. And though there is a growing body of knowledge related to data privacy in nosql databases, it is diverse and fragmented, and does not adequately address the challenges arising from the current environment. This paper aims to systematically examine various privacy weaknesses of nosql databases in the form of patterns. The patterns are shown to manifest themselves in well-known nosql databases and this evaluation can be used for benchmarking purposes. Through a survey it is demonstrated that the patterns have been observed in practice and are perceived as relevant. The pattern collection forms a repository of knowledge that can serve as a starting point for future privacy-related research for nosql databases through its identification of key problems, trade-offs, existing solution mechanisms, and its provision of terminology.
The EU’s Artificial Intelligence Act(AI Act)imposes requirements for the privacy compliance of AI *** systems must comply with privacy laws such as the GDPR when providing *** laws provide users with the right to iss...
详细信息
The EU’s Artificial Intelligence Act(AI Act)imposes requirements for the privacy compliance of AI *** systems must comply with privacy laws such as the GDPR when providing *** laws provide users with the right to issue a Data Subject Access Request(DSAR).Responding to such requests requires database administrators to identify information related to an individual ***,manual compliance poses significant challenges and is *** administrators need to write queries through time-consuming *** demand for large amounts of data by AI systems has driven the development of nosql *** to the flexible schema of nosql databases,identifying personal information becomes even more *** paper develops an automated tool to identify personal information that can help organizations respond to *** tool employs a combination of various technologies,including schema extraction of nosql databases and relationship identification from query *** describe the algorithm used by our tool,detailing how it discovers and extracts implicit relationships from nosql databases and generates relationship graphs to help developers accurately identify personal *** evaluate our tool on three datasets,covering different database designs,achieving an F1 score of 0.77 to *** results demonstrate that our tool successfully identifies information relevant to the data *** tool reduces manual effort and simplifies GDPR compliance,showing practical application value in enhancing the privacy performance of nosql databases and AI systems.
nosql databases are being increasingly used for efficient management of high volumes of unstructured data in applications like information retrieval, natural language processing, social computing, etc. However, unlike...
详细信息
nosql databases are being increasingly used for efficient management of high volumes of unstructured data in applications like information retrieval, natural language processing, social computing, etc. However, unlike traditional databases, data protection measures such as access control for these databases are still in their infancy, which could lead to significant vulnerabilities and security/privacy issues as their adoption increases. Attribute-based Access Control (ABAC), which provides a flexible and dynamic solution to access control, can be effective for mediating accesses in typical usage scenarios for nosql databases. In this paper, we propose a novel methodology for enabling ABAC in nosql databases. Specifically we consider MongoDB, which is one of the most popular nosql databases in use today. We present an approach to both specify ABAC access control policies and to enforce them when an actual access request has been made. MongoDB Wire Protocol is used for extracting and processing appropriate information from the requests. We also present a method for supporting dynamic access decisions using environmental attributes and handling of ad-hoc access requests through digitally signed user attributes. Results from an extensive set of experiments on the Enron corpus as well as on synthetically generated data demonstrate the scalability of our approach. Finally, we provide details of our implementation on MongoDB and share a Github repository so that any organization can download and deploy the same for enabling ABAC in their own MongoDB installations.
In this work we target horizontal scaling of nosql databases, which exhibit highly varying, unpredictable and difficult to model behavior coupled with transient phenomena during VM removals and/or additions. We propos...
详细信息
In this work we target horizontal scaling of nosql databases, which exhibit highly varying, unpredictable and difficult to model behavior coupled with transient phenomena during VM removals and/or additions. We propose a solution that is cost-aware, systematic, dependable while it accounts for performance unpredictability and volatility. To this end, we model the elasticity as a dynamically instantiated Markov decision process, which can be both solved and verified using probabilistic model checking. Further, we propose a range of complementary decision making policies, which are thoroughly evaluated in workloads from real traces. The evaluation provides strong insights into the trade-offs between performance and cost that our policies can achieve and prove that we can avoid both over- and under-provisioning.
Relational databases are well suited for vertical scaling;however, specialized hardware can be expensive. Conversely, NewSQL and nosql data stores are designed to scale horizontally. NewSQL databases provide ACID tran...
详细信息
ISBN:
(纸本)9781538623268
Relational databases are well suited for vertical scaling;however, specialized hardware can be expensive. Conversely, NewSQL and nosql data stores are designed to scale horizontally. NewSQL databases provide ACID transaction support;however, joins are limited to the partition keys, resulting in restricted query expressiveness. On the other hand, nosql databases are designed to scale out on commodity hardware;however, they are limited by slow join performance. Hence, we consider if the nosql join performance can be improved while ensuring ACID semantics and without drastically sacrificing write performance, disk utilization and query expressiveness. This paper presents the Synergy system that leverages schema and workload driven mechanisms to identify materialized views, and a specialized concurrency control system on top of a nosql database to enable scalable data management with familiar relational conventions. Synergy trades slight write performance degradation and increased disk utilization for faster join performance (compared to standard nosql databases) and improved query expressiveness (compared to NewSQL databases).
暂无评论