We consider the problem of finding a spanning tree that maximizes the number of leaves (MAXLEAF). We provide a 3/2-approximation algorithm for this problem when restricted to cubic graphs, improving on the previous 5/...
详细信息
ISBN:
(纸本)9783540922476
We consider the problem of finding a spanning tree that maximizes the number of leaves (MAXLEAF). We provide a 3/2-approximation algorithm for this problem when restricted to cubic graphs, improving on the previous 5/3-approximation for this class. To obtain this approximation we define a graph parameter x(G), and construct a tree with at least (n - x(G) + 4)/3 leaves, and prove that no tree with more than (n - x(G) + 2)/2 leaves exists. In contrast to previous approximation algorithms for MAXLEAF;our algorithm works with connected dominating sets instead of constructing a tree directly. The algorithm also yields a 4/3-approximation for Minimum Connected Dominating Set in cubic graphs.
The input to the MINIMUM LATENCY SET COVER PROBLEM consists of a set of jobs and a set of tools. Each job j needs a specific subset S-j of the tools in order to be processed. It is possible to install a single tool in...
详细信息
ISBN:
(纸本)3540291180
The input to the MINIMUM LATENCY SET COVER PROBLEM consists of a set of jobs and a set of tools. Each job j needs a specific subset S-j of the tools in order to be processed. It is possible to install a single tool in every time unit. Once the entire subset S-j has been installed, job j can be processed instantly. The problem is to determine an order of job installations which minimizes the weighted sum of job completion times. We show that this problem is NP-hard in the strong sense and provide an e-approximation algorithm. Our approximation algorithm uses a framework of approximation algorithms which were developed for the minimum latency problem.
DNA Sequencing is a process where we determine and identify every single DNA base and element that is in the genome of an individual. There are six billion of those in every normal cell in every person. When we apply ...
详细信息
ISBN:
(纸本)9781467395458
DNA Sequencing is a process where we determine and identify every single DNA base and element that is in the genome of an individual. There are six billion of those in every normal cell in every person. When we apply DNA sequencing in the Cancer project. We first figure out what those six billion DNA bases are in the normal cells in that person and then we take some of the tumor in that person and figure out what the DNA bases are in the tumor. In this paper, Here we discuss implements and future works in bioinformatics with Hadoop. We study the MapReduce algorithm from algorithm lay by point and demonstrate the appropriates of our approach by tracing and analyzing efficient MapReduce algorithms for sorting and simulation problem of parallel algorithms specified in the help of pigeonhole principle. The big data is a great computational challenge to statistical analysis of DNA big data. We can get general statistical analysis through R language. After studying the survey paper various approaches of using GPU and MapReduce. We adopted the best solution to using R with MapReduce. An R package is created to shift a set of critical R functions on GPU card. It allows users to run R code with GPU spread that enable much faster large data set of computation.
In this paper, we consider the B-prize-collecting multicut problem in trees. In this problem, we are given a tree T = (V, E), a set of k source-sink pairs P = {(s(1), t(1)), (s(2), t(2)),..., (s(k), t(k))} and a profi...
详细信息
ISBN:
(纸本)9783031203497;9783031203503
In this paper, we consider the B-prize-collecting multicut problem in trees. In this problem, we are given a tree T = (V, E), a set of k source-sink pairs P = {(s(1), t(1)), (s(2), t(2)),..., (s(k), t(k))} and a profit bound B. Every edge e is an element of E has a cost c(e), and every source-sink pair (s(j), t(j)). P has a profit p(j) and a penalty pi(j). This problem is to find a multicut M subset of E such that the total cost, which consists of the total cost of the edges in M and the total penalty of the pairs still connected after removing M, is minimized and the total profit of the disconnected pairs by removing M is at least B. Based on the primal-dual scheme, we present an (8/3 + epsilon)-approximation algorithm by carefully increasing the penalty, where epsilon is any fixed positive number.
The goal in topological network design is to build a minimum-cost topology meeting specific real-life constraints. There is a cost-robustness trade-off under single and multiple failures. Previous works in the field s...
详细信息
Given a terrain and a query point p on or above it, we want to count the number of triangles of terrain that are visible from p. We present an approximation algorithm to solve this problem. We implement the algorithm ...
详细信息
ISBN:
(纸本)9789897581335
Given a terrain and a query point p on or above it, we want to count the number of triangles of terrain that are visible from p. We present an approximation algorithm to solve this problem. We implement the algorithm and then we run it on the real data sets. The experimental results show that our approximation solution is very close to the real solution and compare to the other similar works, the running time of our algorithm is better than their algorithm. The analysis of time complexity of algorithm is also presented. Also, we consider visibility testing problem, where the goal is to test whether p and a given triangle of train are visible or not. We propose an algorithm for this problem and show that the average running time of this algorithm will be the same as running time of the case where we want to test the visibility between two query point p and q.
Genome Rearrangements affect large stretches of genomes during evolution. One of the most studied genome rearrangement is the transposition, which occurs when a sequence of genes is moved to another position inside th...
详细信息
ISBN:
(纸本)9783030422660;9783030422653
Genome Rearrangements affect large stretches of genomes during evolution. One of the most studied genome rearrangement is the transposition, which occurs when a sequence of genes is moved to another position inside the genome. Mathematical models have been used to estimate the evolutionary distance between two different genomes based on genome rearrangements. However, many of these models have focused only on the (order of the) genes of a genome, disregarding other important elements in it. Recently, researchers have shown that considering existing regions between each pair of genes, called intergenic regions, can enhance the distance estimation in realistic data. In this work, we study the transposition distance between two genomes, but we also consider intergenic regions, a problem we name Sorting Permutations by Intergenic Transpositions (SbIT). We show that this problem is NP-hard and propose a 3.5-approximation algorithm for it.
Proposed algorithms for calculating the shortest paths such as Dijikstra and Flowd-Warshall's algorithms are limited to small networks due to computational complexity and cost. We propose an efficient and a more a...
详细信息
Proposed algorithms for calculating the shortest paths such as Dijikstra and Flowd-Warshall's algorithms are limited to small networks due to computational complexity and cost. We propose an efficient and a more accurate approximation algorithm that is applicable to large scale networks. Our algorithm iteratively constructs levels of hierarchical networks by a node condensing procedure to construct hierarchical graphs until threshold. The shortest paths between nodes in the original network are approximated by considering their corresponding shortest paths in the highest hierarchy. Experiments on real life data show that our algorithm records high efficiency and accuracy compared with other algorithms.
In this paper, we introduce a squared metric k-facility location problem (SM-k-FLP) which is a generalization of the squared metric facility location problem (SMFLP) and k-facility location problem (k-FLP). In the SM-...
详细信息
ISBN:
(纸本)9783319711508;9783319711492
In this paper, we introduce a squared metric k-facility location problem (SM-k-FLP) which is a generalization of the squared metric facility location problem (SMFLP) and k-facility location problem (k-FLP). In the SM-k-FLP, we are given a client set C and a facility set F from a metric space, a facility opening cost f(i) >= 0 for each i is an element of F, and an integer k. The goal is to open a facility subset F subset of F with vertical bar F vertical bar <= k and to connect each client to the nearest open facility such that the total cost (including facility opening cost and the sum of squares of distances) is minimized. Using local search and scaling techniques, we offer a constant approximation algorithm for the SM-k-FLP.
In this paper, we introduce a model of distributionally robust facility location problem (DRFLP) under moment constraints up to the second order. We show, via duality theory of moment problems, that the linear relaxat...
详细信息
ISBN:
(纸本)9783319083773;9783319083766
In this paper, we introduce a model of distributionally robust facility location problem (DRFLP) under moment constraints up to the second order. We show, via duality theory of moment problems, that the linear relaxation of the DRFLP is equivalent to that of the standard uncapacitated facility location problem (UFLP). Consequently, any LP-based approximation algorithm for the UFLP implies an approximation algorithm for the DRFLP with the same approximation ratio.
暂无评论