Internet as the fertile ground of free speech, everyone can express their emotions, cognition and views of things via the internet, especially in the era of Web2.0, the occurrence of blog which provides a wider space ...
详细信息
Elasticsearch, as an open source distributed data search and analysis engine, has been widely used in recent years due to its characteristics. But in a wide range of utilization and deployment, it is not suitable for ...
详细信息
Elasticsearch, as an open source distributed data search and analysis engine, has been widely used in recent years due to its characteristics. But in a wide range of utilization and deployment, it is not suitable for all scenarios and requirements. Therefore, this paper proposes a method to optimize the number of Elasticsearch index shard based on Elasticsearch full-text retrieval technology and data features in practical application. This method can comprehensively analyze and calculate Elasticsearch remaining storage space and index shard size of each node in distributed cluster to determine the optimal number of index shard in the system, which can improve the efficiency of data retrieval. Experimental results show that, compare with traditional methods, the proposed method can improve the system performance in data distribution, data writing efficiency and data query delay.
Data mining is the focus of big data applications in various fields. Data pre-processing is a crucial step in the data mining process. With the development of the information society and the application of databases, ...
详细信息
ISBN:
(纸本)9781665404457;9781665447119
Data mining is the focus of big data applications in various fields. Data pre-processing is a crucial step in the data mining process. With the development of the information society and the application of databases, the educational data has seen explosive growth, and the data on poor students has become informative. However, the actual student financial aid management system collects the data on poor students which generally has problems such as missing values, attributes redundancy, and noise. To solve this problem, we proposed a novel method called DPBP to preprocess data. The proposed DPBP approach consists of four stages: the preparation of data, the scoping of characteristics, the combination of characteristics, and the filtering of missing number. Firstly, we prepare the dataset by extracting data. Next, the characteristic range is limited by choosing experimental results of feature selection algorithm. Then, third stage performs feature combination to obtain the feature decomposition sets. Finally, based on accuracy and missing number, we gain the optimal dataset. Series of experiments result show that our proposed method significantly improves the data quality and stability.
Rapidly growing urbanization is putting more pressure on the well-being of citizens and the environment. Applying the latest ICT technology to address urban transport challenges is a key strategy to release the new in...
详细信息
In this paper, we design and develop Vehicle License Plate Recognition (VLPR) system, which is one part of comprehensive video management platform for parking lot. Combined with intelligent video analysis module, the ...
详细信息
As the largest class of small non-coding RNAs, piRNAs primarily present in the reproductive cells of mammals, which influence post-transcriptional processes of mRNAs in multiple ways. Effective methods for predicting ...
详细信息
Traditional computer portrait caricature system mainly take the method that exaggerate and deform real images directly, that lead the facial image background also been deformed when exaggerate facial image. If in pret...
详细信息
This paper investigates the problem of collision-free coordination tracking control for multi-agent system under modeling uncertainties,actuator faults and *** syncretizing the Null-Space-based Behavioral(NSB) control...
详细信息
ISBN:
(纸本)9781509009107
This paper investigates the problem of collision-free coordination tracking control for multi-agent system under modeling uncertainties,actuator faults and *** syncretizing the Null-Space-based Behavioral(NSB) control and finite-time control method,a novel reference velocity signal is predesigned to achieve finite-time obstacle avoidance and coordination *** a set of finite-time coordination control laws are presented to guarantee all the agents to track a dynamic target while avoiding obstacles/***,numerical simulation is presented to demonstrate the efficacy of the control strategy.
cloudcomputing is an emergent technology in computer science. To provide high performance for servers, how the resources are scheduled is an important issue in cloudcomputing. To solve this problem, many load-balanc...
详细信息
Recently, the real-time synthetic aperture radar (SAR) imaging technique is a hotspot of research in the field of remote sensing and military applications. As the SAR imaging algorithm is associated with high data and...
详细信息
ISBN:
(纸本)9781467392013
Recently, the real-time synthetic aperture radar (SAR) imaging technique is a hotspot of research in the field of remote sensing and military applications. As the SAR imaging algorithm is associated with high data and computation intensive, it is suitable for using hybrid storage systems, e.g. A cluster, for the performance acceleration. To design a SAR algorithm with high performance, we need consider a prerequisite to maximize the parallelizability of the algorithm due to multi-level parallelization features of the cluster platform. Focusing on the large-scale data, we explore concurrency characteristics of the SAR imaging algorithm on a hybrid storage system, and propose some parallel optimization techniques to accelerate the SAR imaging algorithm. According to the study, we implement a parallel SAR imaging algorithm and evaluate its performance. Experiment results show that the optimized SAR imaging program has high-speed network utilization, and can realize obvious improvement on the performance.
暂无评论