In this paper, we study the problem of point-to-point shortest path query in massive complex networks. Nowadays a breadth first search in a network containing millions of vertices may cost a few seconds and it can not...
详细信息
ISBN:
(数字)9783030340292
ISBN:
(纸本)9783030340292;9783030340285
In this paper, we study the problem of point-to-point shortest path query in massive complex networks. Nowadays a breadth first search in a network containing millions of vertices may cost a few seconds and it can not meet the demands of real-time applications. Some existing landmark-based methods have been proposed to solve this problem in sacrifice of precision. However, their query precision and efficiency is not high enough. We first present a notion of navigator, which is a data structure constructed from the input network. Then navigation algorithm based on the navigator is proposed to solve this problem. It effectively navigates a path only using local information of each vertex by interacting with navigator. We conduct extensive experiments in massive real-world networks containing hundreds of millions of vertices. The results demonstrate the efficiency of our methods. Compared with previous methods, ours can navigate a shortest path with higher probability in less time.
In this work, we fabricated a miniaturized water quality monitoring system with microfluidic analysis chip to meet the high density and network requirements of wide area water system monitoring. The framework of the s...
详细信息
ISBN:
(纸本)9781728118536
In this work, we fabricated a miniaturized water quality monitoring system with microfluidic analysis chip to meet the high density and network requirements of wide area water system monitoring. The framework of the system is a layered tower structure, including human-computer interaction module, micro-control module, microfluidic chip and master control module. The miniaturized water quality monitoring system optimizes the integration of microfluidic technology and absorption photometric detection methods, and the design of the micro control system ensures automatic analysis of water quality. Experiments show that the system has excellent detection performance while achieving miniaturization. The detection limit (LOD) of the miniatured analyzer was 3.4 mu M, and the reaction time was between 100 s and 300 s. The reagent consumption is only 6 mu L per sample.
For web servers, the most typical and common failure is that the client's web page requests surge in a certain period of time, resulting in the server's collapse under extreme pressure. The early warning of se...
详细信息
ISBN:
(数字)9781728189154
ISBN:
(纸本)9781728189161
For web servers, the most typical and common failure is that the client's web page requests surge in a certain period of time, resulting in the server's collapse under extreme pressure. The early warning of server crash time provides the possibility to avoid crash loss. In this paper, we propose an early-warning method for Web service failure. Firstly, we obtain the key nodes on the running path of the server, collect and analyze the running status data of the path under various pressures through program instrumentation, and inform the occurrence of failure in advance through LSTM-SVM (the algorithm combining SVM and LSTM). We apply this method to Nginx, a widely used server, and the accuracy of crash warning is over 95%. Experiments show that the method of acquiring target data has little effect on the performance of server, achieves high degree of automation, and realizes high-precision fault warning. The innovation of this paper is that we realize the fault warning through the change of the parameters in the software.
The development of the next generation sequencing technology (NGS) has advanced the genomics research in many application domains. Metagenomics is one such powerful approach to study large community of microbial speci...
详细信息
ISBN:
(数字)9783030234072
ISBN:
(纸本)9783030234072;9783030234065
The development of the next generation sequencing technology (NGS) has advanced the genomics research in many application domains. Metagenomics is one such powerful approach to study large community of microbial species. For the unknown species in the metagenomic samples, gene assembly and identification without a reference genome is a very challenging problem. To overcome this issue, distributed gene assembly software handling multiple metagenome samples can be used. In this paper, based on our previously developed highly scalable gene assembly software SWAP, we present a work flow called WFswap to assemble large genomic data based on many samples and to identify more genes. Our results suggested that WFswap is able to identify 94.2% of the bench-mark genes when tested on the 19 metagenomic samples that contain Bifidobacterium animalis subsp. lactis CNCM I-2494. Our proposed work-flow WFswap showed better performance than WFsoap, a similar workflow that used SOAPdenovo2 for gene assembly.
Presents the introductory welcome message from the conference proceedings. May include the conference officers' congratulations to all involved with the conference event and publication of the proceedings record.
Presents the introductory welcome message from the conference proceedings. May include the conference officers' congratulations to all involved with the conference event and publication of the proceedings record.
This paper investigates the performance of the ON-Grid solar PV system using three different Maximum Power Point Tracking (MPPT) techniques (Perturbation and Observation (P&O), Incremental Conductance (I.C.) and G...
详细信息
ISBN:
(数字)9781728170688
ISBN:
(纸本)9781728170695
This paper investigates the performance of the ON-Grid solar PV system using three different Maximum Power Point Tracking (MPPT) techniques (Perturbation and Observation (P&O), Incremental Conductance (I.C.) and Grey-Wolf technique). The performance of the MPPT techniques has been scrutinized while the change in the external environment such as partial shading conditions and the variation of the temperature has been applied. The purpose of this research is regulating the output voltage and output power with a very high-speed response. A comparison between the proposed MPPT techniques has been discussed and it is concluded that the Grey-Wolf MPPT technique ensures a reduction in the time response by 96% and 71.4% corresponding to the I.C. and P&O techniques respectively. All the simulations have been implemented through the MATLAB/Simulink computerprogram.
The integration of high-performance computing jobs and big data processing jobs has developed into a significant trend in the industry. Since distributed big data clusters are slightly inferior to supercomputers in te...
详细信息
ISBN:
(数字)9781665422321
ISBN:
(纸本)9781665446662
The integration of high-performance computing jobs and big data processing jobs has developed into a significant trend in the industry. Since distributed big data clusters are slightly inferior to supercomputers in terms of performance, more and more big data jobs are operated on supercomputers, such as Tianhe-2. These two types of jobs have different characteristics and resource requirements, which makes it difficult for the job scheduler currently used in supercomputers to make a perfect connection between jobs and resources. To this end, this paper proposes a resource scheduling framework based on the combination of Push and Pull. According to the node resource usage and job resource request amount, two different scheduling strategies, Push or Pull, can be implemented for the job. Among them, Push means that the management node dispatches a task to the work node, and Pull means that the work node applies to the management node a task to execute. According to the experiments, compared to the Push-based scheduling mode, our Push/Pull-based scheduling mode has about 3 times the job throughput without affecting the scalability and increases the resource utilization 20.4%.
Space information network (SIN) consisting of communication satellites plays an important role in information acquisition and transmission. An increasing volume of data produced by different space missions is forwarde...
详细信息
ISBN:
(纸本)9783030191535;9783030191528
Space information network (SIN) consisting of communication satellites plays an important role in information acquisition and transmission. An increasing volume of data produced by different space missions is forwarded by satellites to ground stations (GSs), which leads to satellites that are responsible for forwarding being overload and data cannot be timely downloaded to GS. Moreover the dynamic and complex SIN operating environment deteriorates the performance of data downloading. Thus, for improving data downloading, it is a key to realize data load balance. That means extra data is offloaded to other satellites having extra downloading capacity with an effective scheduling method. To this end, we modeled collaborative data offloading problem as multi-objective mixed integer nonlinear programming (MOMINLP) problems based on developing time-evolving graph (TEG) and contact plan. Due to its computational complexity, we proposed a heuristic approach with phasing based on contact plan, i.e., phased offloading algorithm (POA) operating on a slot-by-slot basis, to jointly schedule data offloading among the satellites and data downloading from satellites to the GS. Simulation results demonstrate that, in many cases, the proposed algorithms can guarantee relatively high data down-loading throughput and low energy consumption produced by data offloading.
Distributed training is a novel approach to accelerating training of Deep Neural Networks (DNN), but common training libraries fall short of addressing the distributed nature of heterogeneous processors or interruptio...
详细信息
ISBN:
(数字)9781665423243
Distributed training is a novel approach to accelerating training of Deep Neural Networks (DNN), but common training libraries fall short of addressing the distributed nature of heterogeneous processors or interruption by other workloads on the shared processing nodes. This paper describes distributed training of DNN on computational storage devices (CSD), which are NAND flash-based, high-capacity data storage with internal processing engines. A CSD-based distributed architecture incorporates the advantages of federated learning in terms of performance scalability, resiliency, and data privacy by eliminating the unnecessary data movement between the storage device and the host processor. The paper also describes Stannis, a DNN training framework that improves on the shortcomings of existing distributed training frameworks by dynamically tuning the training hyperparameters in heterogeneous systems to maintain the maximum overall processing speed in term of processed images per second and energy efficiency. Experimental results on image classification training benchmarks show up to 3.1x improvement in performance and 2.45x reduction in energy consumption when using Stannis plus CSD compare to the generic systems.
Undergraduate programs in computer science (CS) face high dropout rates, and many students struggle while learning to program. Studies show that perceived programming ability is a significant factor in students' d...
详细信息
ISBN:
(纸本)9781450361859
Undergraduate programs in computer science (CS) face high dropout rates, and many students struggle while learning to program. Studies show that perceived programming ability is a significant factor in students' decision to major in CS. Fortunately, psychology research shows that promoting the growth mindset, or the belief that intelligence grows with effort, can improve student persistence and performance. However, mindset interventions have been less successful in CS than in other domains. We conducted a small-scale interview study to explore how CS students talk about their intelligence, mindsets, and programming behaviors. We found that students' mindsets rarely aligned with definitions in the literature;some present mindsets that combine fixed and growth attributes, while others behave in ways that do not align with their mindsets. We also found that students frequently evaluate their self-efficacy by appraising their programming intelligence, using surprising criteria like typing speed and ease of debugging to measure ability. We conducted a survey study with 103 students to explore these self-assessment criteria further, and found that students use varying and conflicting criteria to evaluate intelligence in CS. We believe the criteria that students choose may interact with mindsets and impact their motivation and approach to programming, which could help explain the limited success of mindset interventions in CS.
暂无评论