This paper presents an investigation into the utility of document summarization in the context of information retrieval. The investigation explores the use of both context-independent standard summaries and query-bias...
详细信息
Particle swarm optimization (PSO) is a new stochastic population-based search methodology by simulating the animal social behaviors such as birds flocking and fish *** improvements have been proposed within the framew...
详细信息
Particle swarm optimization (PSO) is a new stochastic population-based search methodology by simulating the animal social behaviors such as birds flocking and fish *** improvements have been proposed within the framework of this biological assumption. However,in this paper,the search pattern of PSO is used to model the branch growth process of natural *** provides a different poten- tial manner from artificial *** illustrate the effectiveness of this new model,apical dominance phenomenon is introduced to construct a ncvel variant by emphasizing the influence of the *** this improvement,the population is divided into three different kinds of buds associated with their ***,a mutation strategy is applied to enhance the ability escaping from a local ***- ulation results demonstrate good performance of the new method when solving high-dimensional multi-modal problems.
With the fast development of World Wide Web, the quantity of web information is increasing in an unprecedented pace, a great many of which are generated dynamically from background databases, and can't be indexed ...
详细信息
With the fast development of World Wide Web, the quantity of web information is increasing in an unprecedented pace, a great many of which are generated dynamically from background databases, and can't be indexed by traditional search engine, so we call them Deep Web. For the heterogeneous and dynamic features of Deep Web sources, classifying the Deep Web source by domain effectively is a significant precondition of Deep Web sources integration. In this paper, we consider the visible features of Deep Web and Maximum Entropy approach, and then on the basis of binary classification, we propose a new multivariate classification approach based on Maximum Entropy towards Deep Web sources. In addition, we propose a Feedback algorithm to improve the accuracy of classification. An experimental evaluation over real Web data shows that, our approach could provide an effective and general solution to the multivariate classification of Deep Web sources.
Question classification is an important step in a question answering system, the accuracy rate of question classification has great impact to a question answering system's following module. This paper proposed a q...
详细信息
A new method for load forecasting based on LS-SVM, PSO and wavelet transform is proposed. The wavelet transform is adopted to decompose the historical data, so the approximate part and several detail parts are obtaine...
详细信息
A new method for load forecasting based on LS-SVM, PSO and wavelet transform is proposed. The wavelet transform is adopted to decompose the historical data, so the approximate part and several detail parts are obtained. The results of wavelet transform are predicted by a separate LS-SVM predictor. PSO is employed to determine these parameters of SVM model. The novel forecast model integrates the advantage of WT, PSO and LS-SVM. Compared with other predictors, this forecast model has greater generality ability and higher accuracy.
A novel self-adaption strategy for the parameter epsiv in epsiv-MOEA is proposed in this paper based on the analyses of the relationship between the value of epsiv and the maximum number of non-dominated solutions. Th...
详细信息
A novel self-adaption strategy for the parameter epsiv in epsiv-MOEA is proposed in this paper based on the analyses of the relationship between the value of epsiv and the maximum number of non-dominated solutions. Then this novel strategy is applied in epsiv-MOEA and tested on 10 common benchmark functions. The experimental results demonstrate that even if without the good initial value for the parameter s, epsiv-MOEA with this self-adaption strategy (named Algorithm 1) is able to approximately obtain the expected number of non-dominated solutions, which are very close to and uniformly distributed on the Pareto-optimal front. Furthermore, the genetic drift phenomenon in Algorithm 1 is discussed Two cases of genetic drift are pointed out, and one case can be fixed up by a simple approach proposed in this paper.
The next generation of the Web, called Semantic Web, has to improve the Web with semantic page annotations to enable knowledge-level querying and searches. However, manual construction of these ontologies is a time co...
详细信息
ISBN:
(纸本)9781424430536;9780769531519
The next generation of the Web, called Semantic Web, has to improve the Web with semantic page annotations to enable knowledge-level querying and searches. However, manual construction of these ontologies is a time consuming and difficult task. In this paper, we describe an automatic extraction method that learns domain ontologies for semantic web from deep web. Our approach first learns a base ontology from deep web query interfaces, then grows the current ontology by probing the sources and discovering additional concepts and instances from the result pages. We have evaluated our approach in several real-world domains. Preliminary results indicate that the proposed extraction method discovers concepts and instances with high accuracy.
An increasing number of databases have become Web accessible through HTML form-based search interfaces, which is so-called deep Web. For full utilization of deep Web resources and improving Web intelligence, which is ...
详细信息
An increasing number of databases have become Web accessible through HTML form-based search interfaces, which is so-called deep Web. For full utilization of deep Web resources and improving Web intelligence, which is essential for many applications such as deep Web data collection and comparison shopping, they need to be extracted out and assigned meaningful labels. In this paper, we present a synchronous-annotation approach that introduce domain ontology as a global schema ordered by Web databases to the annotation process. We combine ontology, interface schema and result schema and adopt the strategy of query ontology instance to implement annotation. In order to verify the effectiveness of the method proposed in this paper, we test on a number of different areas of Web databases. The experimental results indicate that the proposed approach is more effective than existing approaches.
A lot of high quality and wealthy data are hidden in backend database and search engines can not index this page, which is called Deep Web. It is mostly accessible through query interfaces. SDWS, a semantic search eng...
详细信息
ISBN:
(纸本)9781424430536;9780769531519
A lot of high quality and wealthy data are hidden in backend database and search engines can not index this page, which is called Deep Web. It is mostly accessible through query interfaces. SDWS, a semantic search engine for Deep Web is presented. We are studying and implementing semantic Web technology to the each process of Deep Web information integrated, and expertise in Deep Web discovering, annotating query results and integrating information. The novel approach promise better access to Deep Web.
Traditional text classification model uses statistical methods to obtain features. But in the aspect of discrimination domain and non-domain text category, domain knowledge relations haven't been taken account of ...
详细信息
Traditional text classification model uses statistical methods to obtain features. But in the aspect of discrimination domain and non-domain text category, domain knowledge relations haven't been taken account of in these methods. A domain text classification model was presented in this paper. This model used the support vector machine learning algorithm, gained domain classification feature words through statistic and union domain words, structured domain classification feature space. With the help of domain knowledge relations, computed relevance between domain concepts, got domain classification feature weight. Finally domain text classification was realized. An experiment in the Yunnan tourism domain was carried on to confirm that domain knowledge relations have a good influence on the domain text classification. The classification accuracy rate has been increased 0.04 than improved TFIDF method.
暂无评论