this book - in conjunction withthe double volume LNCS 9225-9226 - constitutes the refereed proceedings of the 11thinternationalconference on Intelligent Computing, ICIC 2015, held in Fuzhou, China, in August *** 84...
ISBN:
(数字)9783319220536
ISBN:
(纸本)9783319220529;9783319220536
this book - in conjunction withthe double volume LNCS 9225-9226 - constitutes the refereed proceedings of the 11thinternationalconference on Intelligent Computing, ICIC 2015, held in Fuzhou, China, in August *** 84 papers of this volume were carefully reviewed and selected from 671 submissions. Original contributions related to this theme were especially solicited, including theories, methodologies, and applications in science and technology. this year, the conference concentrated mainly on machine learning theory and methods, soft computing, image processing and computer vision, knowledge discovery and data mining, natural languageprocessing and computational linguistics, intelligent control and automation, intelligent communication networks and web applications, bioinformatics theory and methods, healthcare and medical methods, and information security.
Currently, very large data have been transferred from everywhere through World Wide web. Consequently, the information extraction systems have been arising and many researches have been focusing on those data for util...
详细信息
Currently, very large data have been transferred from everywhere through World Wide web. Consequently, the information extraction systems have been arising and many researches have been focusing on those data for utilizing them. these systems are very useful for data pre-processing and cleaning for real-time applications. Moreover, these systems can make other analyzing systems to analyze the data in real time such as social network mining, web mining, data mining, or even special tasks such as false advertisement detection, demand forecasting, and comment extraction on product and service reviews. In this paper, we focus on extracting the content data of web pages in e-commerce web sites based on subject detection and node density. In the experimental results, it can signify that our proposed method is appropriated to extract the data rich region in data-intensive pages in an automatic fashion.
In this paper, we propose a series of applications that represent a system family for processing Research Data. the whole system is a model of complete data flow of "research", from data capturing, processin...
详细信息
In this paper, we propose a series of applications that represent a system family for processing Research Data. the whole system is a model of complete data flow of "research", from data capturing, processing, until intelligent data analysis. Data engineering, Software Engineering, Domain Engineering and Ontology Engineering are used for the development of the system. We also described tools and technologies to be used in building the system. the prototype of a small scale system has been built as a proof of concept of this family of system, for the domain of "Research" in Indonesian context. the aim of building this prototype are: (a) to provide an architectural description prototype of a knowledge repository where Indonesian Context Research as a central domain (b) to comprehend a software engineering for software system family.
Explosively increased Big Data and very fast technical evolutions require an entirely new analytics that is able to precisely analyze researchers' activities until now and to provide research directions from now o...
详细信息
ISBN:
(纸本)9781479960507
Explosively increased Big Data and very fast technical evolutions require an entirely new analytics that is able to precisely analyze researchers' activities until now and to provide research directions from now on. Prescriptive analytics shows fundamental difference with descriptive/predictive analytics in that it should provide multiple strategies to achieve a given research direction. Complex event processing also shows a new way to read implicit intentions from many kinds of activities such as publishing article, travelling on business, and attending conference. thus, this talk shows a case study by explaining requirements and factors for implementing a personalized research service with InSciTe Advisory, as a data-intensive intelligent service, for helping to find plausible research directions. this talk also covers data gathering, information extraction from entities to simple events, reasoning, and Hadoop ecosystem.
Word Sense Disambiguation (WSD) has become a popular method for solving the ambiguous meaning of the words in information Retrieval (IR) field area. Under the Natural languageprocessing (NLP) community, WSD has been ...
详细信息
Word Sense Disambiguation (WSD) has become a popular method for solving the ambiguous meaning of the words in information Retrieval (IR) field area. Under the Natural languageprocessing (NLP) community, WSD has been described as the task which able to select the appropriate meaning among the ambiguous meanings to a given word. Among three approaches, supervised based, unsupervised based and knowledge based approaches to WSD, this paper focuses on both supervised based and knowledge based approaches by proposing new Jaccard coefficient-based WSD algorithm to overcome the vocabulary miss match problem. WordNet and corpus external knowledge resources are utilized as the sense repositories by linking up withthe new WSD algorithm to consider additional semantic for WSD. According to sample testing, IR system with new WSD algorithm attains more about 20 percent of total accuracy rate than traditional IR system.
Understanding lexical characteristics of clinical documents is the foundation of sublanguage based Medical languageprocessing (MLP) approach. However, there are limited studies focused on the lexical characters of Ch...
详细信息
Understanding lexical characteristics of clinical documents is the foundation of sublanguage based Medical languageprocessing (MLP) approach. However, there are limited studies focused on the lexical characters of Chinese clinical documents. In this study, a lexical characteristics analysis on both syntactic and semantic levels was conducted in a clinical corpus which contains 3,500 clinical documents generated during daily practices. the analysis was based on the automatic tagging results of a lexicon-based part-of-speech (POS) and semantic tagging method. the medical lexicon contains 237,291 entries annotated with both semantic and syntactic classes. the normalized frequency of different terms, syntactic and semantic classes was calculated and visualized. Major contribution of this paper is providing a wide-coverage Chinese medical semantic lexicon and presenting the lexical characteristics of Chinese clinical documents. Both of these will lay a good foundation for sublanguage based MLP studies in China.
there are many artificial intelligent applications. Some of them focus on the financial market. they often use a nature languageprocessing method, e.g., To predict stock prices. However, most of them are inaccurate. ...
详细信息
ISBN:
(纸本)9781479986477
there are many artificial intelligent applications. Some of them focus on the financial market. they often use a nature languageprocessing method, e.g., To predict stock prices. However, most of them are inaccurate. there are two reasons. For one thing, computer programs are more effective in the syntax analysis than semantic analysis. For another, accurately predicting stock prices is beyond our knowledge and ability today. However, there are many valuable experiences in existing studies. therefore, we propose a unified view and procedure to facilitate using these experiences. this procedure is based on the common knowledge, which is primarily expressed as keywords in this paper. It first recognizes name entities and then learns rules withthe common knowledge and last inferences crucial features. these features, with other quantitative features in the stock market, may make the prediction more accurate. As a result, this view and process can be a framework for many (but not all) nature languageprocessing applications in stock predicting.
Withthe amount of textual data available on the web, new methodologies of knowledge extraction domain are provided. Some original methods allow the users to combine different types of data in order to extract relevan...
详细信息
ISBN:
(纸本)9781509019670
Withthe amount of textual data available on the web, new methodologies of knowledge extraction domain are provided. Some original methods allow the users to combine different types of data in order to extract relevant information. In this context, we present the cornerstone of manipulations on textual documents and their preparation for extracting compatible spatial information withthose contained in satellite images. the term footprint is defined and its extraction is performed. In this paper, we describe the general process and some experiments conducted in the ANIMITEX project, which aims to match the information coming from texts withthose of satellite images.
Exploiting linguistic features is necessary on sentiment analysis in natural languageprocessing. this paper proposes a novel approach on exploiting linguistic features and SVMperf based semantic classification. the i...
详细信息
Exploiting linguistic features is necessary on sentiment analysis in natural languageprocessing. this paper proposes a novel approach on exploiting linguistic features and SVMperf based semantic classification. the innovation is that it uses the dependency relationship to do the linguistic feature extraction. In order to reduce the computational complexity, this paper uses the X2 (chi-square) and Pointwise Mutual information (PMI) metrics for feature selection. Furthermore, as for the approach on sentiment analysis, this paper uses the SVMperf based algorithm to do the alternative structural formulation of the SVM optimization problem for classification. this paper uses two different corpuses (i.e., microblogging and e-commerce data set) to evaluate the performance. Experiment results show the feasible of the approach. Existing problems and further works are also present in the end.
We can feel free to post the information such as personal events using Twitter one of the popular micro-blogging service. However, the collection of information is limited by the human power only, therefore, the metho...
详细信息
ISBN:
(纸本)9781479999590
We can feel free to post the information such as personal events using Twitter one of the popular micro-blogging service. However, the collection of information is limited by the human power only, therefore, the method of collecting trends automatically is important. Existing web services focus on the number of tweets for getting trends. However, a time lag was occurred for extracting the trends. In this paper, we propose the trend extraction method for twitter in real time by paying attention to the co-occurrence patterns. Our system can learn the new key patterns at the same time not only using the picked up trend biterms, previously. Furthermore, we evaluate the efficiency of the proposed method of extracting the trends from twitter by the comparative experiments. We demonstrate that our proposed method can extract accurately and widely without time-lags compared withthe existing service (Real time Yahoo Search).
暂无评论