Floating point computing ability is an important concern in high performance scientific application and engineering computing. Although as a fundamental operation, floating point division (or reciprocal) has long been...
详细信息
We explore four local learning versions of regularization networks. While global learning algorithms create a global model for all testing points, the local learning algorithms use neighborhoods to learn local paramet...
详细信息
Current research on audio signal processingalgorithms for digital hearing aid devices is extremely pushing the performance demands. Nowadays, there is a trend of using several microphones in such systems (e.g., binau...
详细信息
Current research on audio signal processingalgorithms for digital hearing aid devices is extremely pushing the performance demands. Nowadays, there is a trend of using several microphones in such systems (e.g., binaural systems) to improve the speech perception of a hearing impaired person. However, there is a lack of mobile platforms, capable of processing such algorithms in real-time. this paper presents a new mobile SoC-based evaluation and development platform (including a multi-channel audio extension board), specially thought not only for evaluating new hearing aid signal processingalgorithms but also to develop new hardware co-processor architectures, that could be integrated in current hearing aid devices to improve their performance with a minimal extra energy consumption.
Multi-target tracking in video has been a research focus, withthe combination of many fields, such as computer vision, artificial intelligence, pattern matching. In this paper, we present an efficient multi-target re...
详细信息
Distributed computing technology has been widely used to solve complex problems appearing in parallelprocessing systems. Job scheduling is very important in many distributed computing systems, like grid systems and h...
详细信息
Organizations have begun outsourcing management of their data to third party cloud service providers after the introduction of Database as a Service (DAS) model. A cloud database is a database that typically runs on a...
详细信息
ISBN:
(纸本)9781479917976
Organizations have begun outsourcing management of their data to third party cloud service providers after the introduction of Database as a Service (DAS) model. A cloud database is a database that typically runs on a cloud computing platform, such as Amazon EC2, GoGrid, Salesforce and Rackspace. But outsourcing the data raises concerns over privacy. A typical solution is to store databases in encrypted form on the remote server. Queried records are downloaded from the server and decrypted for further processing. Bucketization is one technique for executing queries over encrypted data on a DAS server. this paper is an extension to work done by other researchers [1-4]. Query Optimal Bucketization (QOB) algorithm [1-2] divides the server data into buckets subject to an optimality constraint. In an earlier paper [3], the authors proposed Binary Query Bucketization (BQB) to improve the search time for bucketized datasets and reduce the number of records that are processed by QOB. In this paper, we propose a parallel Binary Query Bucketization (PBQB) algorithm to query records located in the DAS. It integrates parallel search [4] and BQB. parallel search divides the search workload into chunks with each thread/processor working on a chunk. Simulation is used to assess the numerical performance of PBQB. It is shown that the proposed algorithm outperforms BQB.
ADAS (Advanced Driver Assistance Systems) algorithms increasingly use heavy image processing operations. To embed this type of algorithms, semiconductor companies offer many heterogeneous architectures. these SoCs (Sy...
详细信息
ADAS (Advanced Driver Assistance Systems) algorithms increasingly use heavy image processing operations. To embed this type of algorithms, semiconductor companies offer many heterogeneous architectures. these SoCs (System on Chip) are composed of different processing units, with different capabilities, and often with massively parallel computing unit. Due to the complexity of these SoCs, predicting if a given algorithm can be executed in real time on a given architecture is not trivial. In fact it is not a simple task for automotive industry actors to choose the most suited heterogeneous SoC for a given application. Moreover, embedding complex algorithms on these systems remains a difficult task due to heterogeneity, it is not easy to decide how to allocate parts of a given algorithm on the different computing units of a given SoC. In order to help automotive industry in embedding algorithms on heterogeneous architectures, we propose a novel approach to predict performances of image processingalgorithms applicable on different types of computing units. Our methodology is able to predict a more or less wide interval of execution time with a degree of confidence using only high level description of algorithms, and a few characteristics of computing units.
Nyström method and low-rank linearized Support Vector Machines (SVMs) are two widely used methods for scaling up kernel SVMs, both of which need to sample part of columns of the kernel matrix to reduce the size. ...
详细信息
In this paper, we parallelize the collision detection of five- axis machining as an example to show how to execute CNC applications on Graphics processing Unit (GPU). We first design and implement an efficient collisi...
详细信息
We develop an efficient parallel algorithm for answering shortest-path queries in planar graphs and implement it on a multi-node CPU-GPU clusters. the algorithm uses a divide-and-conquer approach for decomposing the i...
详细信息
ISBN:
(纸本)9783319265209;9783319265193
We develop an efficient parallel algorithm for answering shortest-path queries in planar graphs and implement it on a multi-node CPU-GPU clusters. the algorithm uses a divide-and-conquer approach for decomposing the input graph into small and roughly equal subgraphs and constructs a distributed data structure containing shortest distances within each of those subgraphs and between their boundary vertices. For a planar graph with n vertices, that data structure needs O(n) storage per processor and allows queries to be answered in O(n(1/4)) time.
暂无评论