The processing of XML queries can result in evaluation of various structural relationships. Efficient algorithms for evaluating ancestor-descendant and parent-child relationships have been proposed. Whereas the proble...
详细信息
The processing of XML queries can result in evaluation of various structural relationships. Efficient algorithms for evaluating ancestor-descendant and parent-child relationships have been proposed. Whereas the problems of evaluating preceding-sibling-following-sibling and preceding-following relationships are still open. In this paper, we studied the structural join and staircase join for sibling relationship. First, the idea of how to filter out and minimize unnecessary reads of elements using parent's structural information is introduced, which can be used to accelerate structural joins of parent-child and preceding-sibling-following-sibling relationships. Second, two efficient structural join algorithms of sibling relationship are proposed. These algorithms lead to optimal join performance: nodes that do not participate in the join can be judged beforehand and then skipped using B^+-tree index. Besides, each element list joined is scanned sequentially once at most. Furthermore, output of join results is sorted in document order. We also discussed the staircase join algorithm for sibling axes. Studies show that, staircase join for sibling axes is close to the structural join for sibling axes and shares the same characteristic of high efficiency. Our experimental results not only demonstrate the effectiveness of our optimizing techniques for sibling axes, but also validate the efficiency of our algorithms. As far as we know, this is the first work addressing this problem specially.
Geometric constraint problem can be transformed to an optimization problem which the objective function and constraints are non-convex functions. In this paper an evolutionary algorithm based on ant colony optimizatio...
详细信息
Geometric constraint problem can be transformed to an optimization problem which the objective function and constraints are non-convex functions. In this paper an evolutionary algorithm based on ant colony optimization algorithm and the immune system model is proposed to provide solution to the geometric constraints problem. In the new algorithm, affinity calculation process and pheromone trail lying is embedded to maintain diversity and carry out the global search and the local search in many directions rather than one direction around the same individual simultaneously. This new algorithm different with current optimization methods in that it gets the good solution by excluding bad solutions. The experimental results reported here will shed more light into how affects the hybrid algorithm's search power in solving geometric constraint problem.
Monitoring on data streams is an efficient method of acquiring the characters of data stream. However the available resources for each data stream are limited, so the problem of how to use the limited resources to pro...
详细信息
Monitoring on data streams is an efficient method of acquiring the characters of data stream. However the available resources for each data stream are limited, so the problem of how to use the limited resources to process infinite data stream is an open challenging problem. In this paper, we adopt the wavelet and sliding window methods to design a multi-resolution summarization data structure, the Multi-Resolution Summarization Tree (MRST) which can be updated incrementally with the incoming data and can support point queries, range queries, multi-point queries and keep the precision of queries. We use both synthetic data and real-world data to evaluate our algorithm. The results of experiment indicate that the efficiency of query and the adaptability of MRST have exceeded the current algorithm, at the same time the realization of it is simpler than others.
Advances in wireless sensor networks and positioning technologies enable new applications monitoring moving objects. Some of these applications, such as traffic management, require the possibility to query the future ...
详细信息
Advances in wireless sensor networks and positioning technologies enable new applications monitoring moving objects. Some of these applications, such as traffic management, require the possibility to query the future trajectories of the objects. In this paper, we propose an original data access method, the ANR-tree, which supports predictive queries. We focus on real life environments, where the objects move within constrained networks, such as vehicles on roads. We introduce a simulation-based prediction model based on graphs of cellular automata, which makes full use of the network constraints and the stochastic traffic behavior. Our technique differs strongly from the linear prediction model, which has low prediction accuracy and requires frequent updates when applied to real traffic with velocity changing frequently. The data structure extends the R-tree with adaptive units which group neighbor objects moving in the similar moving patterns. The predicted movement of the adaptive unit is not given by a single trajectory, but instead by two trajectory bounds based on different assumptions on the traffic conditions and obtained from the simulation. Our experiments, carried on two different datasets, show that the ANR-tree is essentially one order of magnitude more efficient than the TPR-tree, and is much more scalable.
When tri-axial magnetometer is used to estimate a magnetic field's module, different results will be obtained in different attitudes. Focusing on the diversionary errors of tri-axial magnetometer, A new adaptive c...
详细信息
When tri-axial magnetometer is used to estimate a magnetic field's module, different results will be obtained in different attitudes. Focusing on the diversionary errors of tri-axial magnetometer, A new adaptive calibration method based on artificial neural networks (ANN) was proposed. The relationships between the diversionary errors and the tri-axial magnetometer's system parameters, such as nonorthogonality between axes, different sensitivity and zero-shift of each axes, were analyzed. Through the corresponding ANN structure, a mathematical adaptive calibration model was established, which could be used to estimate the magnetic field's module. The simulation and practical experiment's results show that the diversionary errors of tri-axial magnetometer can be rectified under 10nT from 500nT. Without expensive, complex and special equipments, the presented calibration method is more suitable for tri-axial magnetometer.
Sequential pattern mining is an important method in data mining. Traditional mining algorithms are not adapted to the fast, unlimited, continuous and dynamic data stream because they are multiple pass in scanning data...
详细信息
Sequential pattern mining is an important method in data mining. Traditional mining algorithms are not adapted to the fast, unlimited, continuous and dynamic data stream because they are multiple pass in scanning database. Some approximate sequential pattern mining algorithms are proposed recently which cost too many system resources in sequence compare process. A sequential compare method based on Levenshtein-Automata is proposed in this paper. This method build state conversion model with pretreatment which can finish computing the sequences' similarity in linear time. A combination of Levenshtein-Automata computation and common computation of edit distance is presented in allusion to the Levenshtein-Automata's problem of using too much memory, so a tradeoff between time cost and space cost is implemented. The experiment result shows this method is effective and efficient.
The skyline query is frequently used to find a set of dominating data points (called skyline points) in a multidimensional dataset It is one of the most important query methods for database, datastream, P2P networks. ...
详细信息
The skyline query is frequently used to find a set of dominating data points (called skyline points) in a multidimensional dataset It is one of the most important query methods for database, datastream, P2P networks. However, it has not been implemented in sensor networks due to limited energy of the sensor nodes. This paper presents an energy-efficient approximate skyline query scheme for sensor networks. According to the experiments, this scheme can greatly improve the lifetime of sensor networks compared to the naive skyline query.
keyword Search Over Relational databases (KSORD) enables casual or Web users easily access databases through free-form keyword queries. Improving the performance of KSORD systems is a critical issue in this area. In...
详细信息
keyword Search Over Relational databases (KSORD) enables casual or Web users easily access databases through free-form keyword queries. Improving the performance of KSORD systems is a critical issue in this area. In this paper, a new approach CLASCN (Classification, Learning And Selection of Candidate Network) is developed to efficiently perform top-κ keyword queries in schema-graph-based online KSORD systems. In this approach, the Candidate Networks (CNs) from trained keyword queries or executed user queries are classified and stored in the databases, and top-κ results from the CNs are learned for constructing CN Language Models (CNLMs). The CNLMs are used to compute the similarity scores between a new user query and the CNs from the query. The CNs with relatively large similarity score, which are the most promising ones to produce top-κ results, will be selected and performed. Currently, CLASCN is only applicable for past queries and New All-keyword-Used (NAU) queries which are frequently submitted queries. Extensive experiments also show the efficiency and effectiveness of our CLASCN approach.
Load shedding has been widely used in data stream management systems (DSMSs) to keep DSMSs running steadily. One key problem in load shedding is determining how much system load to shed. Existing works tend to adapt c...
详细信息
Load shedding has been widely used in data stream management systems (DSMSs) to keep DSMSs running steadily. One key problem in load shedding is determining how much system load to shed. Existing works tend to adapt coarse algorithm (CA) to solve this problem. In this paper, we present an adaptive PI controller-based load shedding framework for data stream. The main contribution of this paper is our use of feedback control theory to design the load shedding scheme. In contrast to the existing approaches, we firstly apply system identification to establish a dynamic model to describe DSMS, which enables us analyze DSMS quantitatively. Then, based on the model, we use the Root Locus method to design the PI controller with proven performance guarantees. The adaptive framework has been implemented by modifying Borealis system. Theoretic analysis and experimental results demonstrate that our approach is robust even when system load changes frequently. Comparing to existing strategies, our approach also achieves significantly better performance.
Recently there have been growing interests in the applications of wireless sensor networks. Innovative techniques that improve energy efficiency to prolong the network lifetime are highly required. Clustering is an ef...
详细信息
暂无评论