检索结果-内蒙古大学图书馆

IEEE International Conference on Progress in Informatics and Computing (PIC)

作者： Sun, Xin Wang, Pengwei Lei, Yinghui Liu, Wenqiang Yang, Lijun Zhang, Zhaohui Donghua Univ Sch Comp Sci & Technol Shanghai Peoples R China

ISBN: (纸本)9781538676721

With the rapid development and popularization, Internet is becoming the most convenient way to publish and obtain information, which causes an extremely increasing quantity and variety of data. It is difficult to find out potentially valuable information from these data, which is the primary problem of data mining. Mining company hot events from Internet news can effectively reflect how its business works. Thus, we propose a method for discovering and obtaining hot events from Internet news. In the proposed method, we use Gaussian kernel to update clustering center instead of global cluster to modify single-pass clustering algorithm. It is a dynamic incremental clustering algorithm which does not need to initialize the number of clusters. Then, Top-N hot events can be obtained through the clustering centers. Experimental comparison shows that the improved algorithm has higher clustering efficiency than the classic algorithm. Case studies from Shanghai pilot free-trade zone (FTZ) also show the effectiveness of our proposed method.

关键词： data mining company hot events single-pass algorithm clustering center

来源：评论

学校读者我要写书评

暂无评论

PRACTICAL SKETCHING algorithmS FOR LOW-RANK MATRIX APPROXIMATION

引用

SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS 2017年第4期38卷 1454-1485页

作者： Tropp, Joel A. Yurtsever, Alp Udell, Madeleine Cevher, Volkan CALTECH Comp & Math Sci Pasadena CA 91125 USA Ecole Polytech Fed Lausanne CH-1015 Lausanne Switzerland Cornell Univ Ithaca NY 14853 USA

This paper describes a suite of algorithms for constructing low-rank approximations of an input matrix from a random linear image, or sketch, of the matrix. These methods can preserve structural properties of the input matrix, such as positive-semidefiniteness, and they can produce approximations with a user-specified rank. The algorithms are simple, accurate, numerically stable, and provably correct. Moreover, each method is accompanied by an informative error bound that allows users to select parameters a priori to achieve a given approximation quality. These claims are supported by numerical experiments with real and synthetic data.

关键词： dimension reduction matrix approximation numerical linear algebra randomized algorithm single-pass algorithm sketching streaming algorithm subspace embedding

来源：评论

学校读者我要写书评

暂无评论

Demons Kernel Computation with single-pass Stream Processing on FPGA

Demons Kernel Computation with Single-pass Stream Processing...

引用

14th IEEE International Conference on High Performance Computing and Communications (HPCC) / IEEE 9th International Conference on Embedded Software and Systems (ICESS)

作者： Chiew, Wei Ming Lin, Feng Qian, Kemao Seah, Hock Soon Nanyang Technol Univ Sch Comp Engn Singapore Singapore

ISBN: (纸本)9780769547497

Non-rigid registration is crucial in imaging, in particular, to adjust deformities produced during image acquisition and improve the accuracy of datasets. However, conventional imaging systems lack the desired speed and computational bandwidth for additional non-rigid registration of the deformed images. Therefore, such functionality is usually unavailable in time-critical settings. Expensive computations and memory intensive characteristics of non-rigid image registration algorithms such as the Demons algorithm further limits the realization of such systems. In response, we propose an alternative and efficient custom hardware-based Demons registration algorithm which utilizes pipelined streaming models to minimize memory fetches for computation. Designed for highly customizable hardware, our design only requires single-pass of images to compute the Demons kernel. Implementation results on the Xilinx ML605 FPGA system is presented and quantitatively evaluated in clock cycle counts in contrast with a software-based implementation.

关键词： non-rigid image registration Demons kernel single-pass algorithm FPGA

来源：评论

学校读者我要写书评

暂无评论

DSM-FI: an efficient algorithm for mining frequent itemsets in data streams

引用

KNOWLEDGE AND INFORMATION SYSTEMS 2008年第1期17卷 79-97页

作者： Li, Hua-Fu Shan, Man-Kwan Lee, Suh-Yin Kainan Univ Dept Comp Sci Tao Yuan Taiwan Natl Chengchi Univ Dept Comp Sci Taipei 11623 Taiwan Natl Chiao Tung Univ Dept Comp Sci Hsinchu Taiwan

Online mining of data streams is an important data mining problem with broad applications. However, it is also a difficult problem since the streaming data possess some inherent characteristics. In this paper, we propose a new single-pass algorithm, called DSM-FI (data stream mining for frequent itemsets), for online incremental mining of frequent itemsets over a continuous stream of online transactions. According to the proposed algorithm, each transaction of the stream is projected into a set of sub-transactions, and these sub-transactions are inserted into a new in-memory summary data structure, called SFI-forest (summary frequent itemset forest) for maintaining the set of all frequent itemsets embedded in the transaction data stream generated so far. Finally, the set of all frequent itemsets is determined from the current SFI-forest. Theoretical analysis and experimental studies show that the proposed DSM-FI algorithm uses stable memory, makes only one pass over an online transactional data stream, and outperforms the existing algorithms of one-pass mining of frequent itemsets.

关键词： Data mining Data streams Frequent itemsets single-pass algorithm Landmark window

来源：评论

学校读者我要写书评

暂无评论

Mining top-k Hot Melody Structures over online music query streams

引用

PATTERN RECOGNITION LETTERS 2008年第16期29卷 2116-2121页

作者： Li, Hua-Fu Kainan Univ Dept Comp Sci Tao Yuan 338 Taiwan

Online mining of frequent patterns from music data is one of the most important research issues of multimedia data mining. Most previous studies require the specification of a min_support threshold and aim at mining a complete set of frequent patterns satisfying min_support. However. in practice, it is difficult for users to provide an appropriate value of min_support threshold. In this paper, we propose a new problem of multimedia data mining: online mining of top-k melody structures of length no less than min_1, where k is the desired number of hot melody structures to be mined and min_1 is the minimal length of each melody structure. An efficient single-pass algorithm, called top-k-HMS (top-k Hot Melody Structures) is developed for mining such melody structures Without min_support. In the framework of top-k-HMS algorithm, a new summary data structure, called TKM-list (top-k melody list) is developed to maintain the essential information about the top-k hot melody structures from the Current melody sequence streams. Experimental Studies show that the proposed top-k-HMS algorithm is an efficient one-pass method for mining the set of top-k Hot Melody Structures over a continuous stream of melody sequences. (C) 2008 Elsevier B.V. All rights reserved.

关键词： Data mining Multimedia data mining Music query streams Top-k Hot Melody Structures single-pass algorithm

来源：评论

学校读者我要写书评

暂无评论

DSM-PLW: single-pass mining of path traversal patterns over streaming Web click-sequences

引用

COMPUTER NETWORKS 2006年第10期50卷 1474-1487页

作者： Li, Hua-Fu Lee, Suh-Yin Shan, Man-Kwan Natl Chiao Tung Univ Dept Comp Sci & Informat Engn Hsinchu 300 Taiwan Natl Chengchi Univ Dept Comp Sci Taipei 116 Taiwan

Mining Web click streams is an important data mining problem with broad applications. However, it is also a difficult problem since the streaming data possess some interesting characteristics, such as unknown or unbounded length, possibly a very fast arrival rate, inability to backtrack over previously arrived click-sequences, and a lack of system control over the order in which the data arrive. In this paper, we propose a projection-based, single-pass algorithm, called DSM-PLW (Data Stream Mining for Path traversal patterns in a Landmark Window), for online incremental mining of path traversal patterns over a continuous stream of maximal forward references generated at a rapid rate. According to the algorithm, each maximal forward reference of the stream is projected into a set of reference-suffix maximal forward references, and these reference-suffix maximal forward references are inserted into a new in-memory summary data structure, called SP-forest (Summary Path traversal pattern forest), which is an extended prefix tree-based data structure for storing essential information about frequent reference sequences of the stream so far. The set of all maximal reference sequences is determined from the SP-forest by a depth-first-search mechanism, called MRS-mining (Maximal Reference Sequence mining). Theoretical analysis and experimental studies show that the proposed algorithm has gently growing memory requirements and makes only one pass over the streaming data. (c) 2005 Elsevier B.V. All rights reserved.

关键词： web click-sequence streams path traversal patterns single-pass algorithm

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：