Active Learning and co-training are cases of semi-supervised learning both are used when labeled data is scarce. Active learning attempts to improve learning model by querying over unlabeled data and the main challeng...
详细信息
ISBN:
(纸本)9781509064953
Active Learning and co-training are cases of semi-supervised learning both are used when labeled data is scarce. Active learning attempts to improve learning model by querying over unlabeled data and the main challenge there, is to find the optimum instance query. And co-training tries to exploit two different feature sets to enlarge number of labeled data without any need to get external information. Several researches tried to couple these two methods and get best out of them and they achieve noteworthy results. But we have witnessed that using co-training and active learning in sequence architecture outperforms when they are working in parallel. Using them in sequence means we have used co-training techniques to just find the best queries for active learning, and not in learning process itself. We will demonstrate that it has better results than plain active learning and co-training and even current parallel architectures. For this work we have used different techniques to split data into two distinct datasets; we will also discuss about it alongside our query selection method.
The accomplishments on classifier ensembles originate the studies of clustering ensembles. In this study the factors on performance of clustering ensembles (clustering algorithm, the number of features used in cluster...
详细信息
The accomplishments on classifier ensembles originate the studies of clustering ensembles. In this study the factors on performance of clustering ensembles (clustering algorithm, the number of features used in clustering, the size of ensemble, the decision combining algorithm) are investigated and compared on 15 benchmark datasets. The decisions of clustering algorithms based on different feature subsets are combined. On the process of decision combination, the graph partition algorithms are averaged successful while hierarchical algorithms have best individual successes. The number of features used in clustering algorithms increases the success. The size of clustering ensemble is also direct proportional with clustering performance. Kmeans and fuzzy-kmeans are best clustering algorithms over our experimented datasets.
We are proposing a novel framework that ameliorates locality-aware parallel programming models, by defining a hierarchical data locality model extension. We also propose two hierarchical thread partitioning algorithms...
详细信息
We are proposing a novel framework that ameliorates locality-aware parallel programming models, by defining a hierarchical data locality model extension. We also propose two hierarchical thread partitioning algorithms. These algorithms synthesize hierarchical thread placement layouts that targets minimizing the program's overall communication costs. We demonstrate the effectiveness of our approach using the NAS Parallel Benchmarks implemented in Unified Parallel C (UPC) using a modified Berkeley UPC Compiler and runtime system. We achieved performance gains of up to 88% in performance by applying the placement layouts our algorithms suggest.
The paper identifies a number of issues that are believed to be important for hardware/software codesign. The issues are illustrated by a small comprehensible example: a priority queue. Based on simulations of a real ...
详细信息
ISBN:
(纸本)9780818663154
The paper identifies a number of issues that are believed to be important for hardware/software codesign. The issues are illustrated by a small comprehensible example: a priority queue. Based on simulations of a real application, we suggest a combined hardware/software realization of the priority queue. A priority queue is a data structure with a simple interface which in many applications is a performance bottleneck.< >
Nature is always a source of inspiration. In last few decades, the research is stimulated on new computing paradigms and result of this effort is emergence of new problem solving techniques like Nature Inspired Comput...
详细信息
Nature is always a source of inspiration. In last few decades, the research is stimulated on new computing paradigms and result of this effort is emergence of new problem solving techniques like Nature Inspired Computing, Evolutionary Computing. Nature inspired problem solving techniques are widely used to solve complex problems. These techniques are widely used due to their decentralized and self-organized behavior. Such behavior is observed in social systems such as artificial bee colony algorithm, particle swarm optimization, ant colony optimization, bat algorithm, firefly algorithm, glowworm swarm optimization etc. In this paper we have given overview of nature inspired techniques used for data clustering, hybridization with traditional clustering techniques and their effectiveness.
The online biological data analytics tool GeneWeaver [1] uses a fast algorithm to directly compute k-cliques between different sets of data. By caching such results, we are able to compute k-clique faster in some case...
详细信息
The online biological data analytics tool GeneWeaver [1] uses a fast algorithm to directly compute k-cliques between different sets of data. By caching such results, we are able to compute k-clique faster in some cases. We derived a formula to determine whether or not to cache a result. In order to know if the cached results can be used to compute a desired k-clique, we also created a new algorithm to solve the generalized set coverage problem.
Dynamic Visual Sensors (DVS) output pixel information asynchronously as an address event, only transmitting information for pixels where the intensity change exceeds a threshold, thereby reducing redundant data genera...
详细信息
ISBN:
(数字)9798350365221
ISBN:
(纸本)9798350365238
Dynamic Visual Sensors (DVS) output pixel information asynchronously as an address event, only transmitting information for pixels where the intensity change exceeds a threshold, thereby reducing redundant data generation at the source. They possess advantages such as low-latency response and high efficiency with low power consumption. However, factors such as thermal noise and leakage current can lead to noise events even when the light intensity remains constant, which affects image quality and subsequent processing. In this paper, we propose a new method for calculating event density called Local Density Segmentation (LSD), which divides events and their surrounding neighborhoods into different regions for independent event density calculation. Based on LSD-calculated event density, we further propose a denoising method named Local Density Segmentation K-means (LSD-K), which utilizes event density as a feature to select and remove noise events. Finally, through experiments, we validate the performance of our algorithm.
To efficiently process recursive queries in a DBMS (database management system), a parallel, direct transitive closure algorithm is proposed. Efficiency is obtained by reorganizing the computation order of Warren'...
详细信息
To efficiently process recursive queries in a DBMS (database management system), a parallel, direct transitive closure algorithm is proposed. Efficiency is obtained by reorganizing the computation order of Warren's algorithm. The number of transfers among processors depends only on the number of processors and does not depend on the depth of the longest path. The evaluation shows an improvement due to the parallelism and the superiority of the proposed algorithm over recent propositions. The speed of the production of new tuples is very high and the volume of transfers between the sites is reduced.< >
This paper presents a new method for the management of buffer-stocks for textile items by a sales partition approach. The proposed algorithm enables to partition the set of items into different optimal classes, so tha...
详细信息
This paper presents a new method for the management of buffer-stocks for textile items by a sales partition approach. The proposed algorithm enables to partition the set of items into different optimal classes, so that items belonging to each class follow the same forecasting sales graph. This method uses a sequence of merging of classes. In this sequence, an internal index measuring the compactness inside each class and the separability between different classes enables to determine the best partition. Then, for each class created, the compactness is compared to the initial compactness criterion given by the industrial manager and, the class which doesn't respect the classification accuracy is rejected. A faster searching of the forecasted sales graph for an item is thus achieved by the determination of a mean graph corresponding to its family sale.
An efficient layout design algorithm is presented. The algorithm simultaneously perform the placement and the global routing for the design of using FPGA with hierarchical interconnection structure (HFPGA). It is base...
详细信息
An efficient layout design algorithm is presented. The algorithm simultaneously perform the placement and the global routing for the design of using FPGA with hierarchical interconnection structure (HFPGA). It is based on a k-way min-cut placement technique. The min-cut technique generates a partitioning tree which is then used to assign logic blocks to realize the function of modules in the netlist. The partitioning tree is also used to find the routing paths of interconnections between the logic blocks. Investigation is performed experimentally by implementing a set of industrial circuits using the algorithm.
暂无评论