Extensible Markup Language (xml) is a simple, flexible text format derived from SGML, which is originally designed to support large-scale electronic publishing. Nowadays xml plays a fundamental role in the exchange of...
详细信息
Extensible Markup Language (xml) is a simple, flexible text format derived from SGML, which is originally designed to support large-scale electronic publishing. Nowadays xml plays a fundamental role in the exchange of a wide variety of data on the Web. As xml allows designers to create their own customized tags, enables the definition, transmission, validation, and interpretation of data between applications, devices and organizations, lots of works in soft computing employ xml to take control and responsibility for the information, such as fuzzy markup language, and accordingly there are lots of xml-based data or documents. However, most of mobile and interactive ubiquitous multimedia devices have restricted hardware such as CPU, memory, and display screen. So, it is essential to compress an xml document/element collection to a brief summary before it is delivered to the user according to his/her information need. Query-oriented xml text summarization aims to provide users a brief and readable substitution of the original retrieved documents/elements according to the user's query, which can relieve users' reading burden effectively. We propose a query-oriented xmlsummarization system QxmlSum, which extracts sentences and combines them as a summary based on three kinds of features: user's queries, the content of xml documents/elements, and the structure of xml documents/elements. Experiments on the IEEE-CS datasets used in Initiative for the Evaluation of xml Retrieval show that the query-oriented xml summary generated by QxmlSum is competitive.
暂无评论