Graph sampling is a very effective method to deal with scalability issues when analyzing largescale graphs. Lots of sampling algorithms have been proposed, and sampling qualities have been quantified using explicit pr...
详细信息
Graph sampling is a very effective method to deal with scalability issues when analyzing largescale graphs. Lots of sampling algorithms have been proposed, and sampling qualities have been quantified using explicit properties(e.g., degree distribution) of the sample. However, the existing sampling techniques are inadequate for the current sampling task: sampling the clustering structure, which is a crucial property of the current networks. In this paper, using different expansion strategies, two novel top-leader sampling methods(i.e., TLS-e and TLS-i) are proposed to obtain representative samples, and they are capable of effectively preserving the clustering structure. The rationale behind them is to select top-leader nodes of most clusters into the sample and then heuristically incorporate peripheral nodes into the sample using specific expansion strategies. Extensive experiments are conducted to investigate how well sampling techniques preserve the clustering structure of graphs. Our empirical results show that the proposed sampling algorithms can preserve the population's clustering structure well and provide feasible solutions to sample the clustering structure from large-scale graphs.
Agriculture plays a major role in developing countries like India, however the food security still remains a vital issue. Most of the crops get wasted due to lack of storage facility, transportation, and plant disease...
详细信息
The development of legal records requires extra time and their absurd length raises the need for programmed legal record handling frameworks. One of the handling steps is to recognize the essence of the reports expres...
详细信息
Forecasting stock prices is crucial because even minor incremental improvements translate into significant advantages in their financial decisions. A novel method that combines ARIMA Models with Simple Moving Averages...
详细信息
The surrogate-assisted evolutionary algorithm (SAEA) is one of the most efficient approaches for addressing expensive continuous or combinatorial optimization problems. However, it encounters significant challenges in...
详细信息
Cardiovascular disease continues to be a predominant cause of mortality globally, requiring precise and effective strategies for early identification. This work examines the application of clinical datasets to explore...
详细信息
This research attempts to identify stock market bubbles using technical indicators combined with machine-learning processes. If not detected early on, stock bubbles arising from overvaluation and speculation can incur...
详细信息
This research analyzes the effectiveness of the K-Nearest Neighbors algorithm combined with moving average techniques, Five-day and Ten-day Exponential Moving Averages, specifically EMA-5 and EMA-10, to predict stock ...
详细信息
The increasing adoption of hybrid clouds in organizations stems from their ability to bolster private cloud resources with additional public cloud capacity when required. However, scheduling distributed applications, ...
详细信息
Healthcare data is most unorganized and decentralized in many countries, including India. EHR (Electronic Health Record) has increased its acceptability and importance as it assists in medical research and helps backt...
详细信息
暂无评论