over the last few years, Grid technologies have progressed towards a service-oriented paradigm that enables a new way of service provisioning based on utility computing models. Users consume these services based on th...
详细信息
In this paper, a framework for replacing missing values in a database is proposed since a real-world database is seldom complete. Good data quality in a database can directly improve the performance of any data mining...
详细信息
It has been an important task of discovering frequent fragments as particular patterns from large sequence databases generated from a variety of applications. In general, the patterns to be discovered may partially an...
详细信息
It has been an important task of discovering frequent fragments as particular patterns from large sequence databases generated from a variety of applications. In general, the patterns to be discovered may partially and asynchronously exist in sequences, and even contain gaps. In addition, it is necessary to collect the information regarding the locations and frequencies of the patterns. How to enumerate candidate patterns for evaluation without exponentially increasing the computation is another problem. In this paper, the modified periodicity transform is proposed to meet the requirements mentioned above. Also, a distributed computing framework is implemented to perform the mining task more efficiently. Both synthetic and biological sequences are utilized to examine the approach. The experimental results demonstrate the efficiency and effectiveness the system.
Over the last few years, grid technologies have progressed towards a service-oriented paradigm that enables a new way of service provisioning based on utility computing models. Users consume these services based on th...
详细信息
Over the last few years, grid technologies have progressed towards a service-oriented paradigm that enables a new way of service provisioning based on utility computing models. Users consume these services based on their QoS (quality of service) requirements. In such "pay-per-use" grids, workflow execution cost must be considered during scheduling based on users' QoS constraints. In this paper, we propose a cost-based workflow scheduling algorithm that minimizes execution cost while meeting the deadline for delivering results. It can also adapt to the delays of service executions by rescheduling unexecuted tasks. We also attempt to optimally solve the task scheduling problem in branches with several sequential tasks by modeling the branch as a Markov decision process and using the value iteration method
The Generalized Temporal Role-Based Access Control (GTRBAC) model provides a comprehensive set of temporal constraint expressions which can facilitate the specification of fine-grained time-based access control polici...
详细信息
In this paper, a framework for replacing missing values in a database is proposed since a real-world database is seldom complete. Good data quality in a database can directly improve the performance of any data mining...
详细信息
In this paper, a framework for replacing missing values in a database is proposed since a real-world database is seldom complete. Good data quality in a database can directly improve the performance of any data mining algorithm in various applications. Our proposed framework adopts the basic concepts from conditional probability theories and further develops an algorithm to facilitate the capability of handling both nominal and numerical values, which addresses the problem of the inability of handling both nominal and numerical values with a high degree of accuracy in the existing algorithms. Several experiments are conducted and the experimental results demonstrate that our framework provides a high accuracy when compared with most of the commonly used algorithms such as using the average value, using the maximum value, and using the minimum value to replace missing values.
Development of Web-based multimedia applications is expected to hold central importance for engineering and technological progress during the rest of this decade. It is already opening up new research frontiers in a n...
详细信息
ISBN:
(纸本)1581139756
Development of Web-based multimedia applications is expected to hold central importance for engineering and technological progress during the rest of this decade. It is already opening up new research frontiers in a number of areas such as multimedia data modeling and indexing, data mining, multimedia document management, semantic Web, pervasive computing, distributed sensor networks, computer security, real-time operating systems, human-computer interaction, and storage technology etc. As a result of concerted effort in these areas, many Web-accessible multimedia applications involving different media types, e.g., video, audio, text, images, animation and graphics, are rapidly emerging Examples of such applications abound in the domains of health care, education, entertainment, manufacturing, e-commerce, digital libraries as well as military and critical national infrastructures. The premise is that the integration of Web and multimedia technologies can provide cost effective solutions for management and dissemination of information, which is a primary tool for increasing economic efficiency. Development of Web-based multimedia applications needs a broad range of technological solutions that deal with organizing, storing, and delivering multimedia information in an integrated, secure and timely manner with guaranteed quality of service (QoS). Multimedia database management, when viewed in conjunction with integration of contents from independent Web-based data sources, present formidable research and development challenges. Key challenges include: content analysis and indexing of distributed multimedia data and documents semantic modeling and knowledge-based representation of multimedia data transformation and organization of multimedia data semantics as a part of Semantic Web security, privacy and QoS related issues concerning Web-based multimedia database applications emerging Web standards and their role in managing distributed multimedia databases In this talk,
Peer-to-peer (P2P) networks are generally considered to be free havens for pirated content, in particular with respect to music. We describe a solution for the problem of copyright infringement in P2P networks for mus...
详细信息
In this paper, we present BioGrid, a novel computing resource that combines advantages of grid computing technology with bioinformatics parallel applications. The grid environment permits the sharing of a large amount...
详细信息
暂无评论