In order to perfectly meet the needs of business leaders, decision-makers have resorted to the integration of external sources (such as Linked Open data) in the decision-making system in order to enrich their existing...
详细信息
In order to perfectly meet the needs of business leaders, decision-makers have resorted to the integration of external sources (such as Linked Open data) in the decision-making system in order to enrich their existing datawarehouses with new concepts contributing to bring added value to their organizations, enhance its productivity and retain its customers. However, the traditional datawarehouse environment is not suitable to support external Big data. To deal with this new challenge, several researches are oriented towards the direct conversion of classical relational datawarehouse to a columnar nosql data warehouse, whereas the existing advanced works based on clustering algorithms are very limited and have several shortcomings. In this context, our paper proposes a new solution that conceives an optimized columnardatawarehouse based on CLARANS clustering algorithm that has proven its effectiveness in generating optimal column families. Experimental results improve the validity of our system by performing a detailed comparative study between the existing advanced approaches and our proposed optimized method.
The emergence of large volumes of data imposed by the major players of the web requires new management models and new data storage architectures and treatment able to find information quickly in a large volume of data...
详细信息
ISBN:
(纸本)9781479938407
The emergence of large volumes of data imposed by the major players of the web requires new management models and new data storage architectures and treatment able to find information quickly in a large volume of data. The column-oriented nosql (Not Only SQL) database provide for big data the most suitable model to the datawarehouse and the structure of multidimensional data in OLAP cube form. However, in the absence of OLAP cube computation operators, we propose in this paper, a new aggregation operator called CN-CUBE (columnarnosql CUBE), which allows data cubes to be computed from datawarehouses stored in column-oriented nosqldatabase management system. We implemented the CN-CUBE operator using the SQL Phoenix interface of HBase DBMS and conducted experiments on a public datawarehouse in a distributed environment produced using the Hadoop platform. Thus we have shown that our CN-CUBE operator has OLAP cubes computation times very suitable for nosqlwarehouses.
暂无评论