When given a small sample, we show that classification with SVM can be considerably enhanced by using a kernel function learned from the training data prior to discrimination. This kernel is also shown to enhance retr...
详细信息
ISBN:
(纸本)1595933832
When given a small sample, we show that classification with SVM can be considerably enhanced by using a kernel function learned from the training data prior to discrimination. This kernel is also shown to enhance retrieval based on data similarity. Specifically, we describe KernelBoost - a boosting algorithm which computes a kernel function as a combination of 'weak' space partitions. The kernel learning method naturally incorporates domain knowledge in the form of unlabeled data (i.e. in a semi-supervised or transductive settings), and also in the form of labeled samples from relevant related problems (i.e. in a learning-to-learn scenario). The latter goal is accomplished by learning a single kernel function for all classes. We show comparative evaluations of our method on datasets from the UCI repository. We demonstrate performance enhancement on two challenging tasks: digit classification with kernel SVM, and facial image retrieval based on image similarity as measured by the learnt kernel.
Selecting appropriate resources for running a job efficiently is one of the common objectives in a computational grid. Resource scheduling should consider the specific characteristics of the application, and decide th...
详细信息
Selecting appropriate resources for running a job efficiently is one of the common objectives in a computational grid. Resource scheduling should consider the specific characteristics of the application, and decide the metrics to be used accordingly. This paper presents a distributed resource scheduling framework mainly consisting of a job scheduler and a local scheduler. In order to meet the requirements of different applications, we adopt HGSA, a Heuristic-based Greedy Scheduling Algorithm, to schedule jobs in the grid, where the heuristic knowledge is the metric weights of the computing resources and the metric workload impact factors. The metric weight is used to control the effect of the metric on the application. For different applications, only metric weights and the metric workload impact factors need to be changed, while the scheduling algorithm remains the same. Experimental results are presented to demonstrate the adaptability of the HGSA.
In order to integrate the distributed computing resources in Zhejiang University (ZJU), a campus grid is constructed by using the hierarchical combination of middleware, Globus Toolkit and Sun Grid Engine. As one real...
详细信息
In order to integrate the distributed computing resources in Zhejiang University (ZJU), a campus grid is constructed by using the hierarchical combination of middleware, Globus Toolkit and Sun Grid Engine. As one realization of the computational grid, the foundation of the campus grid provides a large amount of scientificcomputation cycles. Furthermore, our grid system adopts two different approaches, Java 3D and ParaView, to manipulate a collaborative visualization of the computed data respectively. Therefore the users can choose the most appropriate method according to their requirements. Three cases are tested on the campus grid via portal access. The results demonstrate that the ZJU campus grid provides powerful computation cycles and an efficient collaborative visualization service
This paper describes our efforts to provide a grid-based parallel visualization environment to visualize massive dataset in parallel. The visualization environment is implemented as a visualization service on the grid...
详细信息
This paper describes our efforts to provide a grid-based parallel visualization environment to visualize massive dataset in parallel. The visualization environment is implemented as a visualization service on the grid. This paper focuses on deploying a proxy process on the master node of each cluster in the grid, to ensure that Globus jobs can be scheduled on internal nodes of clusters that have only local IP addresses. We have conducted an experiment to visualize in parallel a dataset of computational domains in a grid environment with PC clusters
A parallel plant ecosystem simulation, running on a computer cluster with commodity graphic cards, is performed to simulate and visualize large groups of plants. With a scalable architecture, the system can simulate v...
详细信息
A parallel plant ecosystem simulation, running on a computer cluster with commodity graphic cards, is performed to simulate and visualize large groups of plants. With a scalable architecture, the system can simulate very large and complex plant ecosystem in much shorter time than the traditional. The extreme complexity is first simplified by using multilevel models, and then dividing into multiple parts and simulating parallel. The entire simulation process is visualized in an immersive mode approximately in real time by rendering the results locally and concatenated to a large display wall
Developing collaborative applications presents a challenge in solving the communication and synchronization problems. In this paper, we introduce a framework created to assist the developers. The framework takes the a...
详细信息
Developing collaborative applications presents a challenge in solving the communication and synchronization problems. In this paper, we introduce a framework created to assist the developers. The framework takes the advantage of the access grid (AG) components, venue server, venue client and event channel, aiming to build a collaborative environment. In addition, the framework contains two user interface components, a wizard providing an interface to import new applications, and a main frame listing all the applications developed upon the framework. For a testing task, we implemented some collaborative applications using this framework. And the examples demonstrate that the framework enables the users to develop collaborative applications conveniently and efficiently
In this paper, we present a general framework for 2D parallel mesh generation. A prepartitioner for domain decomposition is integrated into the framework, which strives to make the resulting subdomains well-shaped and...
详细信息
In this paper, we present a general framework for 2D parallel mesh generation. A prepartitioner for domain decomposition is integrated into the framework, which strives to make the resulting subdomains well-shaped and thus guarantees the high quality of resulting meshes. Moreover, the time-consuming prepartitioning stage is parallelized in two ways with various grained levels, and the finer grained one turns out preferable. The subdomain connections are stored as a subdomain graph (SDG), which helps prevent mapping disconnected subdomains into a single processor whenever static or dynamic load balancing strategies are adopted. Well partitioned meshes could be generated simultaneously with parallel mesh generation, and hence the cost of mesh repartitioning could be eliminated or reduced. The SDG construction is simplified and independent of domain decomposition by introducing the concept of the characteristic polygon set (CPS). Shared nodes between neighboring subdomains are duplicated, that ensures subdomain meshing be completed with little or without communications, and full code-reuse of serial meshing algorithms be achieved in this framework
Embedding algorithms are a method for revealing low dimensional structure in complex data. Most embedding algorithms are designed to handle objects of a single type for which pairwise distances are specified. Here we ...
详细信息
Ontology for scientific visualization is constructed in our laboratory. Its framework, construction method and design trade-off decision are presented in this paper. The ontology is composed of four main pillars: data...
详细信息
Ontology for scientific visualization is constructed in our laboratory. Its framework, construction method and design trade-off decision are presented in this paper. The ontology is composed of four main pillars: dataset, filter, device and pipeline, and they are then further decomposed into many sub-components. Using this concept of ontology can help scientists and engineers in various aspects such as communicating and sharing, quick learning, knowledge reusing, etc. As an application, we developed a semantic grid service for scientific visualization based on this ontology
We present a general method to obtain convergent approximate value iteration algorithms with function approximation. The result is applicable to any arbitrary approximation architecture and generalizes existing result...
详细信息
暂无评论