distributed storage technology adopts an extensible system structure,shares storage load by many storage servers,locates stored information through location *** with a previous framework that financial management and ...
distributed storage technology adopts an extensible system structure,shares storage load by many storage servers,locates stored information through location *** with a previous framework that financial management and control system of electric power companies adopted centralized storage servers to store all the data,storage servers are no longer bottleneck of information system *** servers not only improve reliability,availability and access efficiency of system,but also are apt to extend.
How to model the influence diffusion process accurately is an open issue that it has attracted a lot of researchers in the field of social network analysis. The existing researches assume they have already owned the s...
详细信息
ISBN:
(纸本)9781538637913
How to model the influence diffusion process accurately is an open issue that it has attracted a lot of researchers in the field of social network analysis. The existing researches assume they have already owned the social graphs with edges labeled with the influence probability. However, the question of how to obtain these probability from social networks has been largely ignored. Thus, it is interesting to address the problem of how to model the influence diffusion based on the data of social graphs and action logs. This is the main problem we addressed in this paper, and our purpose is to solve the problem of seeds detection via the data-driven influence probability calculation. We consider the influence probability can be viewed as two parts of the influence strength and the influence threshold. For learning the influence probability, we propose a novel Data-driven Influence Learning (DIL) algorithm including three stages. The experimental results illustrate our algorithm performs better than other baselines in various datasets. In addition, our algorithm enables us to detect the seed sets in large social networks.
For the past seven decades the term Big Data is known, but due to the emerging technology shift of this era, it is captivating a lot of attention from the researchers of mathematics, computing, telecommunication, info...
详细信息
In this paper,we study the customer loyalty based on survival *** regard the time of using product as the survival time,and utilize the AFT model to analysis the influencing factors of the customer *** results of simu...
In this paper,we study the customer loyalty based on survival *** regard the time of using product as the survival time,and utilize the AFT model to analysis the influencing factors of the customer *** results of simulation show that our method is more effective than commonly regression method.A real example is also provided as an illustration.
In cloud computing, a data owner outsources a huge dataset to cloud servers, and authorizes many clients to retrieve data of interest anytime and anywhere. Data outsourcing is helpful to lessen the burden on local dat...
详细信息
ISBN:
(纸本)9781538637913
In cloud computing, a data owner outsources a huge dataset to cloud servers, and authorizes many clients to retrieve data of interest anytime and anywhere. Data outsourcing is helpful to lessen the burden on local data storage and also to improve quality of services. However, the cloud service provider (CSP) outside the data owner's trusted domain may return wrong query results to the client deliberately or ***, query authentication and query correction are of crucial importance to achieve reliable cloud services. In this paper,we consider a flow-oriented cloud environment, where query results will be transmitted in the form of data flow to the clients via networks. Specifically, we propose a two-phase query (TPQ) protocol to support efficient authentication and correction of multi-dimensional range queries. By ingeniously integrating signature chain with Reed Solomon codes, our proposed protocol allows a client to efficiently verify the correctness of query results while performing error correction in an adaptive way. Extensive experiments on a real dataset demonstrate effectiveness of our proposed protocol.
Light curves are fundamental tools for variable star astronomy. They describe the change of celestial object's light intensity as time goes on. Because of the sharp increase of data in astronomical researches, the...
详细信息
ISBN:
(纸本)9781538637913
Light curves are fundamental tools for variable star astronomy. They describe the change of celestial object's light intensity as time goes on. Because of the sharp increase of data in astronomical researches, the general methods are now utilized cannot meet the requirement of time-domain astronomical observation. In this paper, FLCGS, a Flexible Light Curves Generation System is proposed to achieve scalability and parallelism for generating light curves from astronomical catalogs. We design metadata files for light curves generation based on sky partition. Moreover, a new partition strategy is defined to keep workload balance. The function works very well via dynamic programming when the distribution of Big Data is skewed. We focus on the cross-matching between celestial objects from different catalogs and introduce a new method to determine whether they are the same celestial objects for light curves generation. Experimental results show that FLCGS is nearly 11 times faster than using MySQL, especially when the data is massive.
Proteomics is a hot pot topic in current, its development experienced from proteins qualitative research to quantitative research. Label-free quantification method is the most widely used protein quantification method...
详细信息
ISBN:
(纸本)9781538637913
Proteomics is a hot pot topic in current, its development experienced from proteins qualitative research to quantitative research. Label-free quantification method is the most widely used protein quantification method. But during the process of label-free quantification, lots of Mass spectrometry (MS) data do not be used so that cause the waste of data resource. In this paper, a quantitative algorithm of protein based on simulated annealing algorithm was proposed to improve the efficiency of the use of MS spectral. This paper is about using all possible peptide ions to extract quantitative information from MS spectral. Verified by Experimental data set about that this protein quantification algorithm can increase the coverage of proteins on the condition of ensured accuracy. This paper focused on the development of proteomics and the analysis and excavation of mass spectrum data, its study results can be widely applied in protein qualification area.
With the advent of clustered systems, more and more parallel computing is required. However a lot of programming skills is needed to write a parallel codes, especially when you want to benefit from the various paralle...
详细信息
File-based data access represents one of the most common ways of providing input data and retrieving output data in scientific applications. However, in distributedcomputing environments, the execution platform of th...
详细信息
Mobile cloud computing is a rapidly increasing technology where user's demand, for cloud contents, might exceed the capability of mobile networks even with 5G. Multicast communication can offer excellent service f...
详细信息
暂无评论