检索结果-内蒙古大学图书馆

International Conference on Fuzzy Systems and knowledge Discovery (FSKD)

作者： Xiongpai Qin Huiju Wang Furong Li Jidong Chen Xuan Zhou Xiaoyong Du Shan Wang Key Laboratory of Data Engineering and Knowledge Engineering (RUC) Ministry of Education Beijing China School of Information Renmin University of China Beijing China EMC-Greenplum Research China China

In algorithm trading, computer algorithms are used to make the decision on the time, quantity, and direction of operations (buy, sell, or hold) automatically. To create a useful algorithm, the parameters of the algorithm should be optimized based on historical data. However, Parameter optimization is a time consuming task, due to the large search space. We propose to search the parameter combination space using the MapReduce framework, with the expectation that runtime of optimization be cut down by leveraging the parallel processing capability of MapReduce. This paper presents the details of our method and some experiment results to demonstrate its efficiency. We also show that a rule based strategy after being optimized performs better in terms of stability than the one whose parameters are arbitrarily preset, while making a comparable profit.

关键词： Optimization Measurement Computers Algorithm design and analysis Runtime Servers data processing

来源：评论

学校读者我要写书评

暂无评论

Single attribute Join Queries within latest sampling periods in sensor networks

Single attribute Join Queries within latest sampling periods...

引用

IEEE Symposium on Computers and Communications (ISCC)

作者： Shangfeng Mo Hong Chen Yinglong Li School of Math and Computing Science Hunan University of Science and Technology Xiangtan China Key Laboratory of Data Engineering and Knowledge Engineering of MOE School of Information Renmin University of China Beijing China

Join processing in wireless sensor networks is a challenging problem. Current solutions are not involved in the join operation among tuples of the latest sampling periods. In this article, we proposed a continuous Single attribute Join Queries within latest sampling Periods (SJQP) for wireless sensor networks. The main idea of our filter-based framework is to discard non-matching tuples, and our scheme can guarantee the result is correct independent of the filters. Experiments based on real-world sensor data show that our method performs close to a theoretical optimum and consistently outperforms the centralized join algorithm.

关键词： Base stations Wireless sensor networks Probes Filtering algorithms Temperature sensors Information filters

来源：评论

学校读者我要写书评

暂无评论

引用

Journal of Convergence Information Technology 2012年第16期7卷 87-96页

作者： Minjuan, Zhong School of Information Technology Jiangxi University of Finance and Economics Nanchang 330013 China Jiangxi Key Laboratory of Data and Knowledge Engineering Jiangxi University of Finance and Economics Nanchang 330013 China

With the increasing of XML data over the Internet, managing and analyzing huge amount of XML documents has played an important role for information management. Clustering as an intelligent technique has been utilized as an excellent way of grouping the documents by their content or structure. However, the key problem is how to measure similarity between XML documents. In this paper, we propose an extended vector space model and on this basis put forward an effective semantic similarity measurement method combining content and structure semantics, in which a variety of XML document features impacting similarity measurement, such as term element frequency, term inverse element frequency, semantic weight of tag and level information of the term, are analyzed. In addition, information gain, for clustering quality evaluation are introduced motivated by the fact that collection has no classification information in advance. Experiment results show that proposed similarity method (EVSM_SS) outperforms the content and structure integration measurement based on structure path (VSM_SP) as well as traditional document clustering measurement (CO) in information gain and produce better clustering quality.

关键词： XML

来源：评论

学校读者我要写书评

暂无评论

Combining term semantics with content and structure semantics for XML element search results clustering

引用

Journal of Convergence Information Technology 2012年第15期7卷 26-35页

The biggest characteristic of the XML retrieval is able to return the element node results. This paper studies XML element search results clustering and proposes one similarity measurement method based on term semantics, in which the "core" concept between terms is got through latent semantic indexing technology(LSI) and the same time the XML element node content and semantic structure properties(CASS) are combined. In addition, two new performance evaluation methodologies, namely R_ClusterRatio and R_DocuRatio are introduced to evaluate clustering quality. It is motivated by the observations of relevant documents distribution and the fact that the experiment data collection, IEEE CS corpus, do not provide classification information. Experiment results show that proposed similarity method combining term semantics with content and structure semantics integration(LSI-CASS) is feasible, and it produces better clustering quality than LSI-CAS and CASS.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Reducing Uncertainty of Low-Sampling-Rate Trajectories

Reducing Uncertainty of Low-Sampling-Rate Trajectories

引用

International Conference on data engineering

作者： Kai Zheng Yu Zheng Xing Xie Xiaofang Zhou School of Information Technology and Electrical Engineering University of Queensland Brisbane Australia Microsoft Research Asia Beijing China Key Laboratory of Data Engineering and Knowledge Engineering Ministry of Education China School of Information Renmin University of China China

ISBN: (纸本)9781467300421

The increasing availability of GPS-embedded mobile devices has given rise to a new spectrum of location-based services, which have accumulated a huge collection of location trajectories. In practice, a large portion of these trajectories are of low-sampling-rate. For instance, the time interval between consecutive GPS points of some trajectories can be several minutes or even hours. With such a low sampling rate, most details of their movement are lost, which makes them difficult to process effectively. In this work, we investigate how to reduce the uncertainty in such kind of trajectories. Specifically, given a low-sampling-rate trajectory, we aim to infer its possible routes. The methodology adopted in our work is to take full advantage of the rich information extracted from the historical trajectories. We propose a systematic solution, History based Route Inference System (HRIS), which covers a series of novel algorithms that can derive the travel pattern from historical data and incorporate it into the route inference process. To validate the effectiveness of the system, we apply our solution to the map-matching problem which is an important application scenario of this work, and conduct extensive experiments on a real taxi trajectory dataset. The experiment results demonstrate that HRIS can achieve higher accuracy than the existing map-matching algorithms for low-sampling-rate trajectories.

关键词： Trajectory Roads Global Positioning System Inference algorithms Uncertainty Artificial neural networks Educational institutions

来源：评论

学校读者我要写书评

暂无评论

Using a semantic knowledge base for communication service quality management in Home Area Networks

Using a semantic knowledge base for communication service qu...

引用

IEEE Symposium on Network Operations and Management

作者： Liam Fallon Declan O'Sullivan LM Ericsson Network Management Lab Athlone Co. Westmeath Ireland Knowledge & Data Engineering Group (KDEG) Trinity College Dublin Ireland

The data required for automatic optimization of user services usually exists in current systems, but that data is not modelled or linked in a way that facilitates automation. knowledge engineering is a promising approach for managing the disparate communication service quality management information data sets and the links across those data sets. Once a knowledge base is in place, semantic techniques can be used to analyse and suggest optimizations to service quality. This paper describes our work in building, populating and evaluating a knowledge base for an IPTV service in Home Area Networks. Population of the knowledge base was implemented using terminal reports. The characteristics of the approach were evaluated through experimentation and the evaluation results are presented in this paper.

关键词： Context Semantics knowledge based systems Ontologies XML Measurement Unified modeling language

来源：评论

学校读者我要写书评

暂无评论

A Framework of Multi-Stage Classifier for Identifying Criminal Law Sentences

引用

Procedia Computer Science 2012年 13卷 53-59页

作者： Sotarat Thammaboosadee Bunthit Watanapa Nipon Charoenkitkarn Data and Knowledge Engineering Laboratory (D-Lab) School of Information Technology King Mongkut's University of Technology Thonburi 126 Pracha Utit Rd. Bangmod Thungkru Bangkok 10140 Thailand

This paper proposes a framework to identify the relevant law articles consisting of sentences and range of punishments, given facts discovered in the criminal case of interest. The model is formulated as a two-stage classifier according to the concept of machine learning. The first stage is to determine a set of case diagnostic issues, using a modular Artificial Neural Network (mANN), and the second stage is to determine the relevant legal elements which lead to legal charges identification, using SVM-equipped C4.5. The integrated multi-stage model aims at achieving high accuracy of classification while reserving “arguability”. Hypothetically, mANN handles well for digesting complexity in case-level issues analysis with acceptable explanatory power and C4.5 addresses the lesser extent of contingency and provides human-interpretable logic concerning the high-level context of legal codes.

关键词： Criminal law data mining Neural network Decision tree Legal reasoning

来源：评论

学校读者我要写书评

暂无评论

Drivers for strategic choice of cloud computing as online service in SMEs

Drivers for strategic choice of cloud computing as online se...

引用

International Conference on Information Systems, ICIS 2012

作者： Li, Min Yu, Yan Zhao, J. Leon Li, Xin University of Science and Technology China-City University of Hong Kong Joint Advanced Research Center 166 Renai Road Dushu Lake Higher Education Town Suzhou China Key Laboratory of Data Engineering and Knowledge Engineering MOE Renmin University of China 59 Zhongguancun Street Haidian District Beijing China Department of Information Systems City University of Hong Kong Tat Chee Avenue Kowloon Hong Kong

ISBN: (纸本)9781627486040

Cloud Computing Service (CCS) paradigm is changing IT strategy of organizations in the digital world. CCS that requires few upfront investments and uses lease-based pricing is especially relevant to the Small and Medium Enterprises (SMEs), which have limited resources and may not know their true valuation for the IT prior to adoption. Thus, this research aims to investigate the influential factors of SMEs' strategic choice of CCS as online service. Relying upon Technology-Organization-Environment (TOE) paradigm, we identify both generic and context-specific factors from the three aspects and explain how the identified factors affect SMEs' CCS strategic choices. We hope this research can make contributions to innovation diffusion theory and IT strategy literature. We also hope the research with progress going on can generate insights for the CCS vendors who care about the sector of SME as well as the government administrators to make appropriate policies or supports for SMEs.

关键词： Cloud computing

来源：评论

学校读者我要写书评

暂无评论

Endless and Scalable knowledge Table Extraction from Semi-structured Websites

Endless and Scalable Knowledge Table Extraction from Semi-st...

引用

IEEE International Conference on data Mining Workshops (ICDM Workshops)

作者： Yingqin Gu Lei Ji Ziheng Jiang Jun He Key Labs of Data Engineering and Knowledge Engineering University of China Microsoft Research Asia Beijing P.R. China School of Computer Science Beijing Institute of Technology P. R. China

ISBN: (纸本)9781467351645

The problem of scalable knowledge extraction from the Web has attracted much attention in the past decade. However, it is under explored how to extract the structured knowledge from semi-structured Websites in a fully automatic and scalable way. In this work, we define the table-formatted structured data with clear schema as knowledge Tables and propose a scalable learning system, which is named as Kable to extract knowledge from semi-structured Websites automatically in a never ending and scalable way. Kable consists of two major components, which are auto wrapper induction and schema matching respectively. In contrast to the state of the art auto wrappers for semi-structured Web sites, our adopted approach can run around 1'000 times faster, which makes the Web scale knowledge extraction possible. On the other hand, we propose a novel schema matching solution which can work effectively on the auto-extracted structured data. With 3 months' continuous run using ten Web servers, we successfully extracted 427,105,009 knowledge facts. The manual labeling over sampled knowledge extracted show the up to 87% precision for supporting various Web applications.

关键词： data mining knowledge engineering Motion pictures Clustering algorithms knowledge based systems Algorithm design and analysis Manganese

来源：评论

学校读者我要写书评

暂无评论

Pattern mining, semantic label identification and movement prediction using mobile phone data

Pattern mining, semantic label identification and movement p...

引用

8th International Conference on Advanced data Mining and Applications, ADMA 2012

作者： Xie, Rong Luo, Jun Yue, Yang Li, Qingquan Zou, Xiaoqing International School of Software Wuhan University Wuhan 430079 China Shenzhen Institutes of Advanced Technology CAS Shenzhen 518055 China Shenzhen Key Laboratory of High Performance Data Mining Shenzhen 518055 China Shenzhen University Shenzhen China State Key Lab. of Information Engineering in Surveying Mapping and Remote Sensing Wuhan University Wuhan 430079 China Faculty of Land Resource Engineering Kunming University of Science and Technology Kunming China

ISBN: (纸本)9783642355264

data collected from mobile phones have potential knowledge to provide with important behavior patterns of individuals. In this paper, we present approaches to discovering personal mobility and characteristics based on mobile phone location information and semantic analysis. We discuss three aspects related to very common mobile phone-related applications such as pattern mining, semantic label identification and movement prediction. We use real mobile phone data to perform functions of discovering these behavior patterns and demonstrate effectiveness of our approaches. © Springer-Verlag 2012.

关键词： Cellular telephones

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：