this paper presents a method for the valuation of discourses from different linguistic perspectives: lexical, syntactic and semantic. We describe a platform discourse analysis tool (DAT) which integrates a range of la...
详细信息
this paper presents UDRST, an unlabeled discourse parsing system in the RST framework. UDRST consists of a segmentation model and a parsing model. the segmentation model exploits subtree features to rerank N-best outp...
详细信息
Service discovery plays an important role in service composition. In order to achieve better performance of service discovery, often, service classification should be in place to group available services into differen...
详细信息
the advent of standards i.e. IEEE 11073 for device connectivity, Health Level 7 (HL7) etc. provide an assimilating platform for medical devices and seamless data flow among modern Healthinformation Systems (HIS). How...
详细信息
Today, the explosive amount of information on the web has become the major challenge for web searchers. Despite the improvements of search engine technologies, searching for specific information is sometimes still an ...
详细信息
Today, the explosive amount of information on the web has become the major challenge for web searchers. Despite the improvements of search engine technologies, searching for specific information is sometimes still an uneasy task. there are two major problems: first, the relatedness of information is not handled by search engines, and second, HTML is a presentation-oriented language, not an information-oriented one. the former issue results in inconvenience. Result list from most modern search engines contain standalone web pages, however, multiple web pages can be needed to solve a web user's request. On the other hand, the latter issue results in difficulties for processingthe returned data. thinking of the web as a huge set of databases, a more efficient approach for accessing data than the current search engine is needed. In this paper, a mechanism and implementation based on the author's previous work, H2X and VWBE, to address these issues is proposed. the result is a better framework for realizing the concept of web of data.
web service discovery is becoming a challenging and time consuming task due to large number of web services available on the Internet. Organizing the web services into functionally similar clusters is one of a very ef...
web service discovery is becoming a challenging and time consuming task due to large number of web services available on the Internet. Organizing the web services into functionally similar clusters is one of a very efficient approach for reducing the search space. To cluster web services, take out the web services description languages documents and extract the features (e.g., service name) to measure the similarities. Complex terms are used as web service features in some contexts. Current approaches do not consider about the hidden semantic pattern exists within the complex terms. We present an approach to cluster the web services into functionally similar web service clusters that mine web Service Description language (WSDL) documents and generate ontologies by using complex terms for the measuring purpose of similarity. We use both logic based reasoning and edge base similarity measuring techniques for calculating the similarity using generated ontology. Experimental results show our clustering approach with ontology learning, has better performance comparing with approaches which are not considering about the latent pattern exists within the complex terms.
there are a lot of research results in large scale graph analysis on Hadoop. the performance of the graph analysis based on Hadoop is impacted by data partitioning. the effectiveness of data partitioning depends on ho...
详细信息
there are a lot of research results in large scale graph analysis on Hadoop. the performance of the graph analysis based on Hadoop is impacted by data partitioning. the effectiveness of data partitioning depends on how the data partitioning maintains data locality in each node of cluster, and this would be different from the problems faced with. One way of data partitioning known to be effective is partitioning data by domains. For instance, this technique could be very useful in partitioning data by areas analyzing web graphs. But this kind of improvement from the data partitioning is limited to specific problems. In this paper, we propose a data partitioning technique based on semi-clustering for analyzing web graphs with PageRank algorithm on Hadoop. With experiment, PageRank computation with our partitioning technique improves the performance, as the number of iterations increases. this method can be very effective in the case of large scale graph processing.
A PSE (Problem Solving Environment), called as WAPSE, is constructed based on cooperation of web modules, distributed on one or more servers. WAPSE helps users to concentrate on problem solving. A user interface as pa...
详细信息
ISBN:
(纸本)9781467308946
A PSE (Problem Solving Environment), called as WAPSE, is constructed based on cooperation of web modules, distributed on one or more servers. WAPSE helps users to concentrate on problem solving. A user interface as part of PSE of WAPSE is built on a web browser for an easy use of PSE. Simple processing combining multiple modules is already possible, because web service technologies have since progressed. Various web servers connected to the Internet can share each resource. However, it is difficult to build a web based agents' system in primitive web environments. It is troublesome to deploy a large numbers of web based agents to each computer all at once. the debugging of a web based agent and change of the processing computers are common problems at a development stage. therefore, we developed WAPSE so that a developer more easily described a web based agent. WAPSE is a middleware for PSE using cooperation of a simple web based agents' group handled on one or more web servers. though the architecture is simple, WAPSE behaves in a complicated way because it makes distributed cooperation between agents in async sequentially. One simple web based agent's behavior is predictable because a simple web based agent is carried out simply. However, behavior becomes complicated even at cooperation of two simple web based agents. Furthermore, behavior by the cooperation of many simple web based agents is complicated. WAPSE is extended to include mobile terminals as processing computers because recently various mobile terminals are popular. therefore, WAPSE behaves in a more complicated way.
Arabic language is very rich in grammatical declension of nouns and verbs. In classical Arabic, there are have classes of verb lexemes and these are declined in four ways. Modeling the morphological effect of such a r...
详细信息
Arabic language is very rich in grammatical declension of nouns and verbs. In classical Arabic, there are have classes of verb lexemes and these are declined in four ways. Modeling the morphological effect of such a rich declension system is a challenging problem and is essential for processing of Arabic language. In this paper, we use Head-Driven Phase Structure Grammar (HPSG) to analyze the declension system of Arabic verbs based on lexical types. We design lexical type hierarchy of Arabic verbs and show the mapping of lexical type to declension type from this type hierarchy. We also design construction rules to capture the morphological and syntactic effect of declension types dynamically. Finally, we have implemented the lexical type hierarchy and construction rules in the TRALE platform to validate the proposed formalism. this work effectively extends the capabilities of existing HPSG framework for supporting declension of verbs in Arabic.
the growth of web applications on Internet has led to the increase in cyber crime. the attacker may inject malicious code into text boxes of vulnerable web application such as guest book, feedback form, search box, et...
详细信息
ISBN:
(纸本)9781467302418
the growth of web applications on Internet has led to the increase in cyber crime. the attacker may inject malicious code into text boxes of vulnerable web application such as guest book, feedback form, search box, etc. which may be further executed by web server. the execution of system call and API on web server by attacker through PHP code injection may damage the file system or leaks configuration information of web server. PHP code injection attacks have become more extensive in nature due to the emergence of dynamic web paradigms. Dynamic features and functionalities of a web site are controlled through PHP language. Hence, the use of PHP language (which itself carries vulnerabilities) in dynamic web page results in higher possibilities of successful execution of code injection attacks. the aim of this paper is twofold. Firstly, to understand the web application vulnerabilities related to PHP code injection attack, two PHP code injection attack scenarios have been developed. Secondly, to accurate and fast incident determination from gathered evidences a tagging system based on domain dictionary has been developed. the proposed prototype system shall be helpful for law enforcement agency to effectively gather and analyze evidences subjected to PHP code injection attacks.
暂无评论