检索结果-内蒙古大学图书馆

International Conference on Mechatronic Science, Electric Engineering and Computer

作者： Meirui Meihui Weiping Zhang Tao Wang Ning Tian Jinbao Li Longjiang Guo School of Computer Science and Technology Heilongjiang University Harbin 150080 China Harbin Research Institute of Electrical Instrumentation 150028 China School of Computer Science and Technology Heilongjiang University Harbin China Key Laboratory of Database and Parallel Computing of Heilongjiang Province Harbin China

ISBN: (纸本)9781479925667

Based on the principle of QR loop iterations, this paper implements a parallel algorithm based on the hardware of GPU (Graphic Process Unit) by using routines from CUDA (Computer Unified Device Architecture) to find the eigenvalues of general matrices. CPU and GPU computing card form a client-server computing framework. Here, CPU can be regarded as a client and GPU card is considered as a computing server. For the experiment environment, this paper chooses a GPU card with the model of NVIDIA GeForce GTX460 as a server side and a CPU with the model of Intel Core i5-760 quad-core as a client side. Win7 64-bit is selected as the operating system. The parallel implementation consists of two parts:PA H and PA QR. PA H is a procedure that transforms a matrix A into the Hessenberg matrix B. PA QR is the actual parallel algorithm of the QR iterations that is imposed on the Hessenberg matrix B for finding eigenvalues of the general matrix A. The speedup ratio of the proposed algorithm is jarless when the number of iterations becomes greater. The experimental results show that the parallel implementation with CUDA on GPU only makes use of less running time than traditional sequential algorithms. The speedup ratio of PA H is between 1.79 and 7.81. The speedup ratio of PA QR is between 3.24 and 118.9. Especially, when the order of general matrix equals 8192, the amount of iterations becomes 10000. The speedup ratio of the PA-H and the PA QR can run up to 7.81 and 118.9 respectively.

关键词： Graphics processing units Eigenvalues and eigenfunctions Hardware Kernel Instruction sets

来源：评论

学校读者我要写书评

暂无评论

parallel Algorithm for Approximate String Matching with K Differences

Parallel Algorithm for Approximate String Matching with K Di...

引用

International Conference on Networking, Architecture, and Storage (NAS)

作者： Longjiang Guo Shufang Du Meirui Ren Yu Liu Jinbao Li Jing He Ning Tian Keqin Li Key Laboratory of Database and Parallel Computing Heilongjiang China Heilongjiang University School of Computer Science and Technology Harbin China Department of Computer Science Kennesaw State University Kennesaw Georgia USA Department of Computer Science Srate University of New York New Paltz New York USA

Approximate string matching using the k-difference technique has been widely applied to many fields such as pattern recognition and computational biology. Data dependency exists in the traditional sequential algorithm. Therefore, it is hard to design a parallel algorithm for approximate string matching with k differences. This paper presents a technique to eliminate data dependency. Based on this technique, this paper also presents a parallel algorithm which can calculate the elements in the same row of the edit distance matrix in parallel by eliminating data dependency. The algorithm has high parallelism, but requires synchronization. To validate the proposed algorithm, it is implemented on GPU and multiple-core CPUs. Moreover, the CUDA optimization techniques are also presented in the paper. Finally, experimental results show that, compared with the traditional sequential algorithm on CPU with twenty-four cores, the proposed parallel algorithm achieves speedup of 7-42 on GPU.

关键词： parallel algorithms Graphics processing units Approximation algorithms Algorithm design and analysis Heuristic algorithms Pattern matching

来源：评论

学校读者我要写书评

暂无评论

An Efficient Graph Isomorphism Algorithm Based on Canonical Labeling and Its parallel Implementation on GPU

An Efficient Graph Isomorphism Algorithm Based on Canonical ...

引用

IEEE International Conference on High Performance computing and Communications (HPCC)

作者： Renda Wang Longjiang Guo Chunyu Ai Jinbao Li Meirui Ren Keqin Li School of Computer Science and Technology Heilongjiang University Harbin Heilongjiang China Key Laboratory of Database and Parallel Computing Harbin Heilongjiang China Department of Computer Science University of South Carolina Upstate Spartanburg SC USA Department of Computer Science State University of New York New Paltz New York USA

The Graph Isomorphism (GI) problem has been extensively studied due to its significant applications. The most effective class of GI algorithms, i.e., canonical labeling algorithms, are suitable for either graphs with high randomness or symmetry, or graphs for which both of them are not strongly held. Also, the core operations of canonical labeling algorithms, i.e., individuation-refinement (IR) and certificate comparison, usually occupy more than 70% of the total running time. How to weaken the limitations of structures and improve the efficiency of these operations are challenges. In this paper, we present an efficient GI algorithm called PEACE, which is particularly suitable for graphs with high randomness or symmetry. We present a parallel implementation of our algorithm on GPUs. We design some new techniques and also use some existing methods to speed up calculations under CUDA. More importantly, these techniques can be applied to all IR-based GI algorithms. We evaluate the proposed algorithm on various graphs to make comprehensive comparison with currently the most efficient canonical labeling algorithms on CPUs. Experimental results show that PEACE is superior to other algorithms on graphs with high symmetry or many automorphisms, and up to 50% performance increase can be achieved in the best case. We also apply our parallel techniques on these algorithms, and compare the performance on CPU and multiple GPU devices. The results show that the techniques make all algorithms gain 15~55 speedup.

关键词： Labeling Graphics processing units Algorithm design and analysis Indexes Arrays Instruction sets Partitioning algorithms

来源：评论

学校读者我要写书评

暂无评论

Wireless nerve: Invisible anti-theft system in wireless sensor network

Wireless nerve: Invisible anti-theft system in wireless sens...

引用

Int. Workshops on Web-Age Information Management, WAIM 2012: 1st Int. Workshop on GDMM 2012, 2nd Int. Wireless Sensor Networks Workshop, IWSN 2012, 1st Int. Workshop on MDSP 2012, 3rd Int. Workshop on USDM 2012, 4th Int. Workshop on XMLDM 2012

作者： Ren, Meirui Duan, Jinsheng Qu, Hao Wang, Xinjing Du, Lei School of Computer Science and Technology Heilongjiang University Harbin 150080 China Key Laboratory of Database and Parallel Computing Heilongjiang Province Harbin 150001 China

ISBN: (纸本)9783642330490

There are limitations in traditional anti-theft technologies. All the technologies have a common ground, that is: the devices are exposed to the space, also easily found and destroyed by adversaries. This paper demonstrates a Hidden Anti-theft System (HAS) based on wireless sensor network. The sensors can be hidden in walls and furniture. The intruder influences the transmitting link, and RSSI of the link changes. According to the changes, the system can detect the movement in monitoring area, and keep invisible. HAS is applied in indoor environment. Experiment results show that HAS achieves very low false positive and no false negative. © 2012 Springer-Verlag.

关键词： Wireless sensor networks

来源：评论

学校读者我要写书评

暂无评论

An effective top-k keyword search algorithm based on classified Steiner tree

An effective top-k keyword search algorithm based on classif...

引用

作者： Yang, Yan Tang, Mingzhu Zhong, Yingli Zhang, Zhaogong Guo, Longjiang School of Computer Science and Technology Heilongjiang University 150080 Harbin China Key Laboratory of Database and Parallel Computing Heilongjiang University 150080 Harbin China

ISBN: (纸本)9783642330490

keyword search has become one of hot topics in the field of information retrieval. It can provide users a simple and friendly interface. But the efficiency of some existing keyword search algorithms is low and there are some draws in sorting results. Most algorithms are suited for either unstructured data or structured data. This paper proposes a new kind of top-k keyword search algorithm. No matter the data is unstructured, semi-structured or structured, the algorithm is always effective. It introduces the concept of neighbor sets of nodes and uses set join algorithm to narrow the search space. We also propose the definition of classified Steiner tree, which can reduce the draw phenomenon in results. In addition, the algorithms can output the results of the classified Steiner tree at the same time of computing them. So it can reduce the waiting time of the users and improve the efficiency of keywords search. © 2012 Springer-Verlag.

关键词： Efficiency

来源：评论

学校读者我要写书评

暂无评论

Data query based on data cache and channel switch in multi-radio wireless sensor networks

引用

Jisuanji Xuebao/Chinese Journal of Computers 2012年第11期35卷 2403-2414页

作者： Zhang, Yan-Qing Li, Jin-Bao Guo, Long-Jiang Zhu, Jing-Hua School of Computer Science and Technology Heilongjiang University Harbin 150080 China Key Laboratory of Database and Parallel Computing of Heilongjiang Province Harbin 150080 China

Due to the nature of multi-radio multi-channel wireless sensor networks, such as the quality of service of the links, channel conflict etc., we investigated the problem of data query based on data cache and channel switch, and proved it to be an NP-complete problem. Firstly, we constructed a LP equation based on the data flow conservation and link-channel constraint etc. to formulate the problem, then designed a polynomial approximate algorithm. The algorithm used dynamic programming strategy to minimize the delay of unit data packet transmission from cache nodes to the query node, greedily chose a cache node with the smallest delay of unit data packet transmission, and collected the new covered data packets. Theoretical analysis and experimental results indicate that the proposed algorithm can reduce the communicate delay and improve the efficiency of query effectively.

关键词： Dynamic programming

来源：评论

学校读者我要写书评

暂无评论

MPMC: An algorithm for data aggregation scheduling in multi-channel and multi-power wireless sensor networks

引用

Jisuanji Yanjiu yu Fazhan/Computer Research and Development 2012年第7期49卷 1568-1578页

作者： Fan, Wenbin Guo, Longjiang Li, Jinbao Ren, Meirui School of Computer Science and Technology Heilongjiang University Harbin 150080 China Key Laboratory of Database and Parallel Computing Heilongjiang Province Harbin 150001 China

Data aggregation is a fundamental and yet time-consuming task in WSNs, especially in high-density WSNs. Therefore, people have focused on the problem of minimum-latency data aggregation. The problem has been already proved that it is an NP-hard. This paper proposes a cluster-based data aggregation scheduling algorithm called MPMC in multi-channel and multi-power WSNs to minimize the data aggregation latency. The paper adopts the idea of that the low power is used for packet transmission in inner-cluster and high power is used for packet transmission between clusters. This paper analyzes the number of channel under different topologies that approaches a constant. In simulation experiments, MPMC compares with the best algorithm based on single channel and the best algorithm based on multi-channel. Simulation results show that the MPMC algorithm proposed in this paper achieves the minimum average latency.

关键词： Wireless sensor networks

来源：评论

学校读者我要写书评

暂无评论

Distributed multi-dimensional probabilistic Top-k query processing in sensor networks

引用

Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition) 2012年第SUPPL.1期40卷 389-393+397页

作者： Zhu, Jinghua Guan, Xuemin Department of Computer Science and Technology Heilongjiang University Harbin 150080 China Key Laboratory of Database and Parallel Computing Heilongjiang Province Harbin 150080 China

A distributed multi-dimensional probabilistic Top-k (DMPT) query processing algorithm was proposed for sensor networks. DMPT exploited Skyline operator to get the Top-k results and reduced data transmissions and query delay through feedback and filtering mechanism. Data uncertainty, multi-dimensional attributes, distributed network and energy limitation were considered in DMPT, and the Top-k results were attained by computing Skylayers of data. Experiments on real and simulated data are conducted to demonstrate that DMPT has better energy efficiency and faster response compared with traditional algorithm.

关键词： Sensor networks

来源：评论

学校读者我要写书评

暂无评论

Implementing the jacobi algorithm for solving eigenvalues of symmetric matrices with CUDA

Implementing the jacobi algorithm for solving eigenvalues of...

引用

2012 IEEE 7th International Conference on Networking, Architecture and Storage, NAS 2012

作者： Wang, Tao Guo, Longjiang Li, Guilin Li, Jinbao Wang, Renda Ren, Meirui He, Jing School of Computer Science and Technology Heilongjiang University Harbin 150080 China Key Laboratory of Database and Parallel Computing of Heilongjiang Province Harbin 150080 China Software School of Xiamen University Xiamen Fujian 361005 China Department of Computer Science Kennesaw State University Kennesaw GA 30144 United States

ISBN: (纸本)9780769547220

Solving the eigenvalues of matrices is an open problem which is often related to scientific computation. With the increasing of the order of matrices, traditional sequential algorithms are unable to meet the needs for the calculation time. Although people can use cluster systems in a short time to solve the eigenvalues of large-scale matrices, it will bring an increase in equipment costs and power consumption. This paper proposes a parallel algorithm named Jacobi on gpu which is implemented by CUDA (Computer Unified Device Architecture) on GPU (Graphic Process Unit) to solve the eigenvalues of symmetric matrices. In our experimental environment, we have Intel Core i5-760 quad-core CPU, NVIDIA GeForce GTX460 card, and Win7 64-bit operating system. When the size of matrix is 10240×10240, the number of iterations is 10000 times, the speedup ratio is 13.71. As the size of matrices increase, the speedup ratio increases correspondingly. Moreover, as the number of iterations increases, the speedup ratio is very stable. When the size of matrix is 8192×8192, the number of iterations are 1000, 2000, 4000, 8000 and 16000 respectively, the standard deviation of the speedup ratio is 0.1161. The experimental results show that the Jacobi on gpu algorithm can save more running time than traditional sequential algorithms and the speedup ratio is 3.02∼13.71. Therefore, the computing time of traditional sequential algorithms to solve the eigenvalues of matrices is reduced significantly. © 2012 IEEE.

关键词： Iterative methods

来源：评论

学校读者我要写书评

暂无评论

Improvement on simulating TCP connection establishment in NS2

引用

Tongxin Xuebao/Journal on Communications 2012年第SUPPL.2期33卷 15-19页

作者： Jiang, Yu Ren, Jian Zhou, Li-Ming School of Computer Science and Technology Heilongjiang University Harbin 150080 China Key Laboratory of Database and Parallel Computing of Heilongjiang Province Harbin 150080 China School of Information Science and Technology Heilongjiang University Harbin 150080 China

In the procedure of three-way handshake of transmission control protocol connection establishment, it involves the management of half-connection and connection tables. However, in the famous networks simulator NS2, there is only a formal description instead of a concrete and complete implementation for the procedure of TCP connection establishment. An improvement was beed made on the point. Table structures for half-connections and connections are added, and the way of managing half-connection table in Linux kernel is also imported into NS2. Simulation results show that it is clear to monitor the change in half-connection table in the TCP connection establishment procedure, and thus the requirement of simulating the connection establishment procedure in studies such as protection from TCP SYN flooding attack can be met.

关键词： Transmission control protocol

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：