Activity instance oriented handling is a new means for vertical optimization of process cases. Unlike our previous batch processing mechanism in workflows, it focuses on the data characteristics of activity instances ...
详细信息
Automatic analysis of sentiments expressed in large scale online reviews is very important for intelligent business applications. Sentiment classification is the most popular task of sentiment analysis, which is more ...
详细信息
There are hundreds or thousands of web data sources providing data of relevance to a particular domain on the Web, so how to find a suitable set of sources quickly to integrate from a number of sources is becoming mor...
详细信息
Domain terms play a crucial role in many research areas, which has led to a rise in demand for automatic domain terms extraction. In this paper, we present a two-level evaluation approach based on term hood and unit h...
详细信息
Domain terms play a crucial role in many research areas, which has led to a rise in demand for automatic domain terms extraction. In this paper, we present a two-level evaluation approach based on term hood and unit hood to extract Chinese domain compound terms automatically, which takes the character-level and word-level information into account. To achieve this, we incorporate semantic features by using the word segmentation to recognize single word terms, then leverage the improved C-value and heuristic methods such as word formation pattern and word formation power to evaluate candidates at both levels. By validating our approach with several existing dictionaries, a significant improvement of compound terms detection is achieved. Experiments in legal corpus show our method is superior over other compared methods.
In hospitals, there are usually many legacy equipments and software systems, which may not provide standard interface for integration or even do not have its own API for outside accessing. This makes it very hard to i...
详细信息
On the internet, all-round lawyer information is located at separated information sources, which prevent web users from effective information acquisition. In order to build a unified view of separated, heterogeneous, ...
详细信息
On the internet, all-round lawyer information is located at separated information sources, which prevent web users from effective information acquisition. In order to build a unified view of separated, heterogeneous, and often redundant lawyer information, we propose a new information integration method using multi-source information cross-validation. Based on the unified integrated data, a lawyer recommendation system is built. Several key technologies are presented and evaluated, including the multi-source information acquisition and validation. Experimental results indicate the key techniques used in the system are effective for lawyer information integration and recommendation.
Big data analysis is a main challenge we meet recently. Cloud computing is attracting more and more big data analysis applications, due to its well scalability and fault-tolerance. Some aggregation functions, like SUM...
详细信息
Big data analysis is a main challenge we meet recently. Cloud computing is attracting more and more big data analysis applications, due to its well scalability and fault-tolerance. Some aggregation functions, like SUM, can be computed in parallel, because they satisfy distributive law of addition. Unfortunately, some of statistical functions are not naturally parallelizable. That means they do not satisfy distributive law of addition. In this paper, we focus on percentile computing problem. We proposed an iterative-style prediction-based parallel algorithm in a distributed system. Prediction is done through a sampling technique. Experiment results verify the efficiency of our algorithm.
Recommender systems have been accepted as a vital application on the web by offering product advice or information that users might be interested in. Despite its success, similarity-based collaborative filtering suffe...
详细信息
More and more literature has been written on electronic records management (ERM) in e-government in recent three years, however, much is limited to experiences of one country with more focus on electronic records mana...
详细信息
More and more literature has been written on electronic records management (ERM) in e-government in recent three years, however, much is limited to experiences of one country with more focus on electronic records management systems (ERMS). This paper aims to investigate current trends and future directions of ERM including ERMS in e-government with more focus on comprehensive approaches internationally. Representative shows current ERMS have limitations either from IT perspectives or records and archives management perspectives managing records as data. Future research is called for more concerns of e-government and more involvement in efficient civil service. Case studies of representative policies, regulations, best practices guidelines from Australia, Canada, New Zealand, UK and U.S show challenges of ERM in e-government are getting more and more recognized in information risk, data governance and efficient civil service. Current trends of ERM in e-government are toward multidisciplinary and collaborative approaches to ERM managing records as information resources and business asset. Findings indicate that future directions of ERM in e-government would be toward meta-synthesis management at both organizational and national level, with integration concerns of collaboration, optimization and innovation of both ERM and e-government for civil service. The findings may have implications to be of use to both ERM and e-government professions internationally.
The logical difference is important to ontology engineers in capturing and understanding the difference between different versions of given ontology. For acyclic EL terminologies, in which the well applied medical ont...
详细信息
The logical difference is important to ontology engineers in capturing and understanding the difference between different versions of given ontology. For acyclic EL terminologies, in which the well applied medical ontology SNOMED CT is represented, there are two methods proposed in computing the logical difference between terminologies: direct computation method and uniform interpolant method. We argue that the later method outperforms the former one in showing the dependency between entailments in the logical difference through the introduction of concept difference. The resulting logical difference conveys more information to ontology engineers than direct computation method.
暂无评论