this paper discusses a parallel genetic algorithm (GA) which focuses on the local operator for Traveling salesman problem (TSP). the local operator is a simple GA named as Local Genetic Algorithm (LGA). the LGA is com...
详细信息
ISBN:
(纸本)0769524052
this paper discusses a parallel genetic algorithm (GA) which focuses on the local operator for Traveling salesman problem (TSP). the local operator is a simple GA named as Local Genetic Algorithm (LGA). the LGA is combined to another GA named as Global Genetic Algorithm (GGA). It increases the computational time running a GA as a local operator in another one. To solve this problem, we build a parallel system based on our previous works for running the LGA to speed up the process. the results show that LGA improve the search quality significantly and it is more efficient running LGA withparallel system than single CPU.
Withthe increasing popularity of shared-memory programming model, especially at the advent of multicore processors, applications need to become more concurrent to take advantage of the increased computational power p...
详细信息
A parallel algorithm, namely parallel block diagonal dominant (PBDD) algorithm, is proposed to solve block tridiagonal linear systems on multi-computers. this algorithm is based on divided-and-conquer idea of the PDD ...
详细信息
ISBN:
(纸本)9780769534435
A parallel algorithm, namely parallel block diagonal dominant (PBDD) algorithm, is proposed to solve block tridiagonal linear systems on multi-computers. this algorithm is based on divided-and-conquer idea of the PDD method. When the systems is strictly block diagonal dominant, the PBDD is highly parallel and provides approximate solutions that equals to the exact solutions within machine accuracy. the PBDD method has been implemented on a 64-node multi-computer. the analytic results match closely withthe results measured from the numerical experiments.
In this paper, we study the dependency between MapReduce configuration parameters and network load of fixed-size MapReduce jobs during the shuffle phase;then we propose an analytical method to model this dependency. O...
详细信息
ISBN:
(纸本)9780769548791
In this paper, we study the dependency between MapReduce configuration parameters and network load of fixed-size MapReduce jobs during the shuffle phase;then we propose an analytical method to model this dependency. Our approach consists of three key phases: profiling, modeling, and prediction. In the first stage, an application is run several times with different sets of MapReduce configuration parameters (here number of map tasks and number of reduce tasks) to profile the network load of an application in the shuffle phase on a given cluster. then, the relation between these parameters and the network load is modeled by multivariate linear regression. For evaluation, three applications (WordCount, Exim Mainlog parsing, and TeraSort) are utilized to evaluate our technique on a 5-node MapReduce private cluster.
the Uniform Manifold Projection and Approximation (UMAP) assisted Feature-type distributed Clustering (FDC) workflow, contrary to the traditional UMAP algorithm, was found to be more informative for dimensionality red...
详细信息
ISBN:
(纸本)9798350385939;9798350385922
the Uniform Manifold Projection and Approximation (UMAP) assisted Feature-type distributed Clustering (FDC) workflow, contrary to the traditional UMAP algorithm, was found to be more informative for dimensionality reduction in tabular Clinical and Biomedical Routine Data (CBRD) due to the presence of diverse feature-types in such datasets. However, the prospect of using powerful neural network models for the Feature-type distributed dimensionality reduction task is yet to be explored. Considering the broader applicative potential of neural networks, we compared some autoencoder-based neural networks as an alternative to the UMAP algorithm for the FDC workflow. the study uses standard objective measures such as Silhouette Score and DB-Index to compare the quality of clusters and embeddings generated by the Autoencoder-assisted FDC (FDC-AE) approaches withthe established UMAP-assisted FDC (FDC-UMAP) workflow. the evaluation involved two datasets in each size category from the medical field. the results indicate that for a dimensionality reduction and cluster identification task, the FDC-AE can be more effective compared to the FDC-UMAP, especially on larger datasets. Our research opens up the possibility of feature-distributed multi-label supervised dimensionality reduction, as well as the usage of pre-trained networks for such complex dimensionality reduction tasks.
With extensive use of Internet of Vehicle (IoV) technologies in vehicle traffic management, real-time analysis of vehicle behavior trajectories is of great significance to the assessment of traffic conditions and the ...
详细信息
ISBN:
(纸本)9781538637906
With extensive use of Internet of Vehicle (IoV) technologies in vehicle traffic management, real-time analysis of vehicle behavior trajectories is of great significance to the assessment of traffic conditions and the avoidance of abnormal conditions. this paper presents a solution which can efficiently deal with real-time streaming data of trajectory and excavate temporal and spatial abnormal information. In order to represent the local feature information of the trajectory and solve the problem of large loss of information in the feature point extraction algorithm, a trajectory partitioning strategy based on multi-motion feature and a similarity measure method based on trajectory structure are proposed. And based on the proposed strategy and method, a distributed clustering algorithm is designed for streaming trajectories to improve the efficiency of clustering algorithm. In order to solve the problem of massive calculation of distance and neighborhood density in trajectory anomaly detection algorithm, the data set is pruned by track clustering results, and the efficiency of the algorithm increases the real-time performance of abnormal trajectory detection.
the amount of high-quality data determines the performance of the deep learning model. In reality, the data is often physically distributed in different organizations, and model averaging can train a deep model on the...
详细信息
ISBN:
(数字)9781728114422
ISBN:
(纸本)9781728114422
the amount of high-quality data determines the performance of the deep learning model. In reality, the data is often physically distributed in different organizations, and model averaging can train a deep model on the distributed data, while providing competitive performance compared with training a model on the centralized data. However, it cannot prevent inversion attack, as the intermediate parameters are transmitted during training. Some data enhancement methods, such as mixup, can effectively enhance the data privacy. In this paper, we propose a novel model averaging method combined with mixup, which provides protection against inversion attack. Besides we conduct experiments using state-of-the-art deep network architectures on multiple types of dataset to show that our method improves the classification accuracy of models.
the increasing interest in utilizing distributed Generation (DG) in transmission system is due to rise in energy demand, restriction in new transmission lines, deregulated power and utility limitations. Introduction D...
详细信息
ISBN:
(纸本)9781538659069
the increasing interest in utilizing distributed Generation (DG) in transmission system is due to rise in energy demand, restriction in new transmission lines, deregulated power and utility limitations. Introduction DG into the electrical network can improve its performance technically. Both DG placement and its penetration level become critical issue for both utility and DG owners. this paper presents the extent to which DG effect the power losses depending on the type of DG, its penetration level (PL) and its optimal location. In this paper, optimal DG placement and sizing in electrical network is found by using a Directed Bee Colony (DBC) optimization algorithm. the objective is to reduce the power losses and enhance voltage profile of the electrical system. the effectiveness of the DBC had been tested on IEEE 14-bus sub-transmission system and is compared with results obtained with Particle Swarm Optimization (PSO) method. As the PL of 55-60% and 70-75% is achieved the real and reactive power losses begin to increase respectively for DGs of type-2 in the electrical system. It is also observed that the net saving cost due to DG placement is beneficial up to 65% PL of DG.
Large-scale interactive applications and realtime data-processing are facing problems with traditional disk-based storage solutions. Because of the often irregular access patterns they must keep almost all data in RAM...
详细信息
ISBN:
(纸本)9781479924189
Large-scale interactive applications and realtime data-processing are facing problems with traditional disk-based storage solutions. Because of the often irregular access patterns they must keep almost all data in RAM caches, which need to be manually synchronized with secondary storage and need a lot of time to be re-loaded in case of power outages. In this paper we propose a novel key-value storage keeping all data always in RAM by aggregating resources of potentially many nodes in a data center. We aim at supporting management of billions of small data objects (16-64 byte) like for example needed for storing graphs. A scalable low-overhead meta-data management is realized using a novel range-based ID approach combined with a super-overlay network. Furthermore, we provide persistence by a novel SSD-aware logging approach allowing to recover failed nodes very fast.
the purpose of this paper is to design and implement a hybrid compiler that combines JOMP's directives with javar's annotations in order to obtain a more performing compiler. this is an original approach and c...
详细信息
ISBN:
(纸本)9780769548791
the purpose of this paper is to design and implement a hybrid compiler that combines JOMP's directives with javar's annotations in order to obtain a more performing compiler. this is an original approach and consists of pooling the advantages of those two compilers while fixing some of their issues. However the achievement of this aim is facing the issue of the difference of implementation of these two compilers because JOMP is implemented in Java while javar is implemented in C language. We propose to entirely re-implement javar in Java by using JavaCC. thereafter, we present the implementation of the hybrid compiler. In the experiments, we propose to parallelize the matrix sort program by using this hybrid compiler. the results of experiments and the mathematical demonstration lead us to state that dealing withthis hybrid compiler gives performances better or equal to the best one between javar and JOMP.
暂无评论