In this paper, a framework for replacing missing values in a database is proposed since a real-world database is seldom complete. Good data quality in a database can directly improve the performance of any data mining...
详细信息
In this paper, a framework for replacing missing values in a database is proposed since a real-world database is seldom complete. Good data quality in a database can directly improve the performance of any data mining algorithm in various applications. Our proposed framework adopts the basic concepts from conditional probability theories and further develops an algorithm to facilitate the capability of handling both nominal and numerical values, which addresses the problem of the inability of handling both nominal and numerical values with a high degree of accuracy in the existing algorithms. Several experiments are conducted and the experimental results demonstrate that our framework provides a high accuracy when compared with most of the commonly used algorithms such as using the average value, using the maximum value, and using the minimum value to replace missing values.
In order to manage the whole network, a novel traffic model is proposed based the active probe method. The core of the model has three parts: first, we measure the routing probability matrix by injecting a probe packe...
详细信息
In order to manage the whole network, a novel traffic model is proposed based the active probe method. The core of the model has three parts: first, we measure the routing probability matrix by injecting a probe packet train into the network based on the Internet protocol measurement protocol (IPMP) by the Poisson law. Second, we use a passive measurement method to acquire sd traffic. Last, we can compute link traffic by means of network tomography. We prove the validity of the edge measurement method. Our computing results show the competing link traffic error within about 30% by the edge measurement model.
The large diffusion of the distributed measurement systems (DMS) aims to providing innovative solutions to different situations which appear for both industrial and educational applications. Consequently, advanced har...
详细信息
The large diffusion of the distributed measurement systems (DMS) aims to providing innovative solutions to different situations which appear for both industrial and educational applications. Consequently, advanced hardware platforms and innovative software architectures are demanded. In these situations, the development of novel approaches in software and hardware tools for DMS has fundamental importance. The paper provides an overview of the hardware and software architectures which are currently used to develop DMSs, focusing on advanced topics concerning the DMS management. In particular, the workload, the synchronization and the communication delay aspects are taken into account.
In order to manage the whole network, a novel traffic model is proposed based on the active probe method. The core of the model has three parts: first, we measure routing probability matrix by injecting probe packet t...
详细信息
In order to manage the whole network, a novel traffic model is proposed based on the active probe method. The core of the model has three parts: first, we measure routing probability matrix by injecting probe packet train into the network based on the Internet protocol measurement protocol (IPMP) by Poisson law. Second, we use passive measurement method to acquire sd traffic. Last, we can compute link traffic by means of network tomography. We prove the validity of edge measurement method. Our computing results show the competing link traffic error within about 30% by edge measurement model.
Recent research in mining user access patterns for predicting Web page requests focuses only on consecutive sequential Web page accesses, i.e., pages which are accessed by following the hyperlinks. In this paper, we p...
详细信息
Recent research in mining user access patterns for predicting Web page requests focuses only on consecutive sequential Web page accesses, i.e., pages which are accessed by following the hyperlinks. In this paper, we propose a new method for mining user access patterns that allows the prediction of multiple non-consecutive Web pages, i.e., any pages within the Web site. Our approach consists of two major steps. First, the shortest path algorithm in graph theory is applied to find the distances between Web pages. In order to capture user access behavior on the Web, the distances are derived from user access sequences, as opposed to static structural hyperlinks. We refer to these distances as minimum reaching distance (MRD) information. The association rule mining (ARM) technique is then applied to form a set of predictive rules which are further refined and pruned by using the MRD information. The proposed approach is applied as a collaborative filtering technique to recommend Web pages within a Web site. Experimental results demonstrate that our approach improves performance over the existing Markov model approach in terms of precision and recall, and also has a better potential of reducing the user access time on the Web
We demonstrate electrical properties of an advanced memory structure with high-k dielectric stack of HfAlO/HfSiO/HfXIO and p-type metal gate IrO 2 . Combining advantages of high-k HfAlO, good trapping capability of Hf...
详细信息
We demonstrate electrical properties of an advanced memory structure with high-k dielectric stack of HfAlO/HfSiO/HfXIO and p-type metal gate IrO 2 . Combining advantages of high-k HfAlO, good trapping capability of HfSiO, and high work function of the IrO 2 gate, we were able to attain much better retention with 10-year DeltaV th decay ratio within 18%, higher erasing speed with DeltaV th of 3V within 0.5ms at Vg equiv -12V, and lower operation voltage as well as lower reading voltage, compared to other contending device structures
A variety modeling approaches assume a relatively static manufacturing network structure and focus on optimizing the flow in the DMN (dynamic manufacturing network). These techniques are unable to model structural and...
详细信息
The final goal of this research is to develop a mobile system to collect 3-D information and panoramic image in rubble to search human bodies and situation in the rubble. This system has abilities to collect and deliv...
详细信息
ISBN:
(纸本)0780384636
The final goal of this research is to develop a mobile system to collect 3-D information and panoramic image in rubble to search human bodies and situation in the rubble. This system has abilities to collect and deliver infomation and cooperate with other robots etc. As a part of this project, multiple CMOS cameras are integrated in a sensor head, and software for generating panorama image and distance information is developed.
Grid services provide an important abstract layer on top of heterogeneous components (hardware and software) that take part into a Grid environment. In this scenario, applications, like scientific visualization, requi...
详细信息
ISBN:
(纸本)1581139500
Grid services provide an important abstract layer on top of heterogeneous components (hardware and software) that take part into a Grid environment. In this scenario, applications, like scientific visualization, require access to data of non-conventional data types, like fluid path geometry, and the evaluation of special user programs on these data. In order to support such applications we are developing CoDIMS-G, which is a data and program integration service for the Grid. CoDIMS-G provides users transparent access to data and programs distributed on the Grid, as well as dynamic resource allocation and management. We conceived a new node scheduling algorithm and designed an adaptive distributed query engine for the grid environment. Copyright 2004 ACM.
暂无评论