The size of the databases used in today39;s enterprises has been growing at exponential rates day by day. Simultaneously, the need to process and analyze the large volumes of data for business decision making has al...
详细信息
ISBN:
(纸本)9781467317191;9781467317207
The size of the databases used in today's enterprises has been growing at exponential rates day by day. Simultaneously, the need to process and analyze the large volumes of data for business decision making has also increased. In several business and scientific applications, there is a need to process terabytes of data in efficient manner on daily bases. This has contributed to the big data problem faced by the industry due to the inability of conventional database systems and software tools to manage or process the big data sets within tolerable time limits. processing of data can include various operations depending on usage like culling, tagging, highlighting, indexing, searching, faceting, etc operations. It is not possible for single or few machines to store or process this huge amount of data in a finite time period. This paper reports the experimental work on big data problem and its optimal solution using Hadoop cluster, Hadoop distributed File System (HDFS) for storage and using parallelprocessing to process large data sets using Map Reduce programming framework. We have done prototype implementation of Hadoop cluster, HDFS storage and Map Reduce framework for processing large data sets by considering prototype of big data application scenarios. The results obtained from various experiments indicate favorable results of above approach to address big data problem.
This article presents a method for evaluating the CPU power, independently from the system used, in heterogeneous networks of work stations. It is based on the use of Java language in order to ensure application porta...
详细信息
ISBN:
(纸本)0769522106
This article presents a method for evaluating the CPU power, independently from the system used, in heterogeneous networks of work stations. It is based on the use of Java language in order to ensure application portability and more particularly on the mechanism of thread CPU processing time measurement introduced in the version 1.5 of Sun Java. That tool will be integrated into the load balancing mechanism which is totally written in Java and that we developed in the LIFL project ADAJ. We show how to evaluate the potential power of the CPU with a software totally written in Java. Moreover, we will justify the results provided by our approach. We will also analyse the exploitation of the calibration tool in order to improve the execution time of parallel and distributedapplications in the context of load balancing in a network of workstations.
Decision support systems are characterized by large data sets and complex queries with multi-way joins, aggregation and nesting. SQLmpp is a highly parallel database server designed to efficiently support these applic...
详细信息
Decision support systems are characterized by large data sets and complex queries with multi-way joins, aggregation and nesting. SQLmpp is a highly parallel database server designed to efficiently support these applications. SQLmpp's query processing strategy is driven by three principal goals: 1) data parallelism in all operations, 2) maximal use of any relevant indexes, and 3) minimal processing of complete relations. This paper describes the general query processing techniques and the specific operations used to achieve these goals.
Managing distributed ontologies is a challenging issue in the Semantic Web area. Different to most current distributed ontologies management researches, which focus on ontologies maintenance, evolutions, and versionin...
详细信息
ISBN:
(纸本)3540297693
Managing distributed ontologies is a challenging issue in the Semantic Web area. Different to most current distributed ontologies management researches, which focus on ontologies maintenance, evolutions, and versioning, this paper proposes a new distributed ontologies management framework based on the function-oriented perspective, and its goal is to bring multiple distributed ontologies together to provide more powerful capabilities. Ontology mapping is the key factor for manage distributed ontologies. This management framework also proposes a novel approach to eliminate the redundancies and errors of mappings in distributed ontologies.
The proceedings contains 35 papers. Topics discussed include databases, parallelprocessing systems, distributed computer systems, data processing, large scale systems, data transfer, data storage, storage allocation,...
详细信息
The proceedings contains 35 papers. Topics discussed include databases, parallelprocessing systems, distributed computer systems, data processing, large scale systems, data transfer, data storage, storage allocation, information management, computer architectures, computer operating systems and data structures.
Computing paradigms are introduced for solving complex problems by analyzing, designing and implementing by complex systems. Computing can be defined as the effective use of computer or computer technology to solve ta...
详细信息
ISBN:
(纸本)9781665428644
Computing paradigms are introduced for solving complex problems by analyzing, designing and implementing by complex systems. Computing can be defined as the effective use of computer or computer technology to solve tasks that are goal oriented. Computing is used in development of producing scientific studies, building intelligent systems, channeling different media for communication. Over the last few years, internet became so popular which lead to the increase in computer processing capacity, data storage and communication with one another. Computing has evolved from one technology to another in its field and formed a robust framework over the years. In this paper a survey on different computing paradigms like evergreen computing is cloud computing, to deal with basic scheduling is grid computing, for multi task handing is parallel computing, to handle smart phone data's that is mobile computing, cluster computing, and distributed computing is carried out. These technologies improved the way computing functions and made it easier to the computer world. The applications and research issues of the most of the computing paradigms are discussed in this article. The recent research issues in computing platform are scheduling and security. The scheduling is dealing with data processing from one computing platform to other computing device. Security is one of the important research issues.
This paper presents a real-time solution to the problem of obstacle detection in automotive applications using image processing techniques. To speed-up the processing a massively parallel engine has been used and the ...
详细信息
ISBN:
(纸本)0780342291
This paper presents a real-time solution to the problem of obstacle detection in automotive applications using image processing techniques. To speed-up the processing a massively parallel engine has been used and the algorithms tuned to match the specific features of the computing architecture. The system acquires pairs of stereo images, checks for correspondences, and remaps the resulting image in a new domain to ease the following processing steps. The whole processing is performed on PAPRICA-3, a massively parallel system whose processing elements are disposed on a linear array;the proposed system allows to reach video rate performance. The whole system is currently simulated and is expected to be available by mid '98.
This paper proposes a workflow management mechanism to address a neglected aspect of existing P2P computing platforms - the lack of support for various computational models. In the workflow management mechanism, a wor...
详细信息
ISBN:
(纸本)3540297693
This paper proposes a workflow management mechanism to address a neglected aspect of existing P2P computing platforms - the lack of support for various computational models. In the workflow management mechanism, a workflow description file is used to define the workflow diagram of the target application. We develop a prototype system, and evaluate it using a test program to demonstrate how the workflow management mechanism effectively works.
The introduction of SRAM-based field programmable gate arrays (FPGAs) has opened-up a new dimension to parallel computing architectures. This paper describes an alternative approach to parallel computing - reconfigura...
详细信息
ISBN:
(纸本)0780342291
The introduction of SRAM-based field programmable gate arrays (FPGAs) has opened-up a new dimension to parallel computing architectures. This paper describes an alternative approach to parallel computing - reconfigurable or virtual parallelprocessing (VPP). Rather than mapping an application onto a given par allel machine, the VPP approach synthesizes the appropriate type and number of processing elements, as well as the interconnection topology, that is optimal for the application. For each application, configuration data is downloaded to the machine that personalizes the hardware for the task at hand. The paper provides a brief description of the authors reconfigurable computer, Archimedes. The benefits of the VPP approach are highlighted by an example application - the 2-D FFT. A novel parallel implementation of a polynomial transform based 2-D transform is described and compared to results for distributed memory parallel machines that have been reported in the literature. The comparison highlights the computational advantage provided by reconfigurable computing.
The ShanghaiGrid, as a Grid Computing Environment, is an information Grid to serve the public in the city, and all resources are regarded as Grid Services in the Open Grid Services Architecture (OGSA). The primary goa...
详细信息
ISBN:
(纸本)3540297693
The ShanghaiGrid, as a Grid Computing Environment, is an information Grid to serve the public in the city, and all resources are regarded as Grid Services in the Open Grid Services Architecture (OGSA). The primary goal of the ShanghaiGrid is to build a generally shared information grid platform. Charging and accounting are an important part of the grid computing system in the ShanghaiGrid. This paper discusses an accounting services model and accounting life cycle that will be used in the ShanghaiGrid. We will analyze the charging and accounting process in detail based on this model and cycle.
暂无评论