This chapter gives a high-level overview of how query processing is carried out in SeCo. At the highest level of abstraction, queries are expressed in a conjunctive declarative query language over service interfaces, ...
详细信息
ISBN:
(纸本)9783642196676
This chapter gives a high-level overview of how query processing is carried out in SeCo. At the highest level of abstraction, queries are expressed in a conjunctive declarative query language over service interfaces, named SeCoQL, chosen to be a compact and readable formulation to serve both experts users and system developers. Queries are then expressed at a logical level in the form of acyclic invocation workflows, after a compile-time analysis that decides a cost-driven scheduling of service invocations. At a lower, physical level queries are then translated into executable specifications that distinguish between the data, flow and the control flow, support parallelism, account for stateless and stateful computation tasks, and support backward and forward control. The query engine is implemented as an interpreter of these physical plans. A workbench and testing environment is also available in the form of a tool, to monitor the processing of complex queries by inspecting all phases of their analysis and execution, at all levels of abstraction.
searchcomputing is a new paradigm for composing search services. While state-of-art search systems answer generic or domain-specific queries, searchcomputing enables answering questions via a constellation of dynami...
详细信息
ISBN:
(纸本)9783642123092
searchcomputing is a new paradigm for composing search services. While state-of-art search systems answer generic or domain-specific queries, searchcomputing enables answering questions via a constellation of dynamically selected, cooperating search services, which are correlated by means of join operations. The idea is simple, yet pervasive. new language and description paradigms are required for expressing queries and for connecting services. new user interfaces and protocols help capturing ranking preferences and enabling their refinement.
There are major trends to advance the functionality of search engines to a more expressive semantic level. This is enabled by the advent of knowledge-sharing communities such as Wikipedia and the progress in automatic...
详细信息
ISBN:
(纸本)9783642123092
There are major trends to advance the functionality of search engines to a more expressive semantic level. This is enabled by the advent of knowledge-sharing communities such as Wikipedia and the progress in automatically extracting entities and relationships from semistructured as well as natural-language Web sources. In addition, Semantic-Web-style ontologies, structured Deep-Web sources, and Social-Web networks and tagging communities can contribute towards a grand vision of turning the Web into a comprehensive knowledge base that can be efficiently searched with high precision. This vision and position paper discusses opportunities and challenges along this research avenue. The technical issues to be looked into include knowledge harvesting to construct large knowledge bases, searching for knowledge in terms of entities and relationships, and ranking the results of such queries.
searchcomputing aims at opening the Web to a new class of search applications, by offering enhanced expressive and computational power. The success of searchcomputing, as of any technical advance, will be measured b...
详细信息
ISBN:
(纸本)9783642123092
searchcomputing aims at opening the Web to a new class of search applications, by offering enhanced expressive and computational power. The success of searchcomputing, as of any technical advance, will be measured by its impact upon the search industry and market, and this in turn will be highly influenced by reactions of Web users and developers. It is too early to anticipate such reactions as the technology is still "under construction" but this chapter attempts a first identification of the possible future players in the development of searchcomputing applications, by grossly identifying the roles of "data source publishers" and of "application developers", and by discussing how classical advertising-based models may support the new applications. This chapter also describes the high-level design of the prototyping environment that is currently under development and how the design will support the deployment upon high performance architectures. Finally, we describe advertising as the prevalent business model of the search engines industry, and briefly discuss the options for the evolution of such model in the context of searchcomputing.
The Web is progressively becoming a multimedia content delivery platform. This trend poses severe challenges to the information retrieval theories, techniques and tools. This chapter defines the problem of multimedia ...
详细信息
ISBN:
(纸本)9783642123092
The Web is progressively becoming a multimedia content delivery platform. This trend poses severe challenges to the information retrieval theories, techniques and tools. This chapter defines the problem of multimedia information retrieval with its challenges and application areas, overviews its major technical issues, proposes a reference architecture unifying the aspects of content processing and querying, exemplifies a next-generation platform for multimedia search, and concludes by showing the close ties between multi-domain search investigated in searchcomputing and multimodal/multimedia search.
In this chapter we present the main trends in the search engine industry. Being such industry technology based, its dynamics can be assessed by applying theories such as (a) dominant design, (b) complementary assets, ...
详细信息
ISBN:
(纸本)9783642123092
In this chapter we present the main trends in the search engine industry. Being such industry technology based, its dynamics can be assessed by applying theories such as (a) dominant design, (b) complementary assets, (c) product and service architecture and (d) disruptive technologies. We dedicate the first section of this chapter to reviewing such literature and explaining how to apply it to identify trends in the search engine industry competition. As preliminary result we position the search engine industry among those that are probably entering in a new fluid phase. In this industry the Google architecture already emerged as dominant design, but after 2005 many new players entered the market (e.g. Cuil, Kosmix, Powerset, Wolfram Alpha, Bing) and most of them are not following the dominant design but are really trying to propose something radically new. Then, we present the data gathering tool we build to use analyze a sample of 26 search engines. In particular, we describe the dimensions, relevant to study the search engine industry, and the metrics for measuring the features of different search engines along those dimensions. We consider three types of metrics: (a) user based - what the user can perceive and act upon;(b) machinery related - what the search engine does internally;and (c) business model oriented - what makes the business profitable. Then we analyze the data using three methods: principal component analysis, two steps cluster analysis, and post hoc analysis on the business models categorization. We close the chapter discussing the results of our analysis.
searchcomputing has been proposed to support the integration of the results of search engines with other data and computational resources. A key feature of the resulting integration platform is direct support for mul...
详细信息
ISBN:
(纸本)9783642123092
searchcomputing has been proposed to support the integration of the results of search engines with other data and computational resources. A key feature of the resulting integration platform is direct support for multi-domain ordered data, reflecting the fact that search engines produce ranked outputs, which should be taken into account when the results of several requests are combined. In the life sciences, there are many different types of ranked data. For example, ranked data may represent many different phenomena, including physical ordering within a genome, algorithmically assigned scores that represent levels of sequence similarity, and experimentally measured values such as expression levels. This chapter explores the extent to which the searchcomputing functionalities designed for use with search engine results may be applicable for different forms of ranked data that are encountered when carrying out data integration in the life sciences. This is done by classifying different types of ranked data in the life sciences, providing examples of different types of ranking and ranking integration needs in the life sciences, identifying issues in the integration of such ranked data, and discussing techniques for drawing conclusions from diverse rankings.
Mashups, i.e., web applications that are developed by integrating data, application logic, and user interfaces sourced from the Web, represent one of the innovations that characterize Web 2.0. Novel content wrapping t...
详细信息
ISBN:
(纸本)9783642123092
Mashups, i.e., web applications that are developed by integrating data, application logic, and user interfaces sourced from the Web, represent one of the innovations that characterize Web 2.0. Novel content wrapping technologies, the availability of so-called web APIs (e.g., web services), and the increasing sophistication of mashup tools allow also the less skilled programmer (or even the average web user) to compose personal applications on the Web. In many cases, such applications also feature search capabilities, achieved by explicitly integrating search services, such as Google or Yahoo!, into the overall logic of the composite application. In this chapter, we first overview the state of the art in mashup development by looking at which technologies a mashup developer should master and which instruments exist that facilitate the overall development process. Then we specifically focus on our own mashup platform, mashArt, and discuss its approach to what we call universal integration, i.e., integration at the data, application, and user interface layer inside one and the same mashup environment. To better explain the novel ideas of the platform and its value in the context of searchcomputing, we discuss an example inspired by the idea of searchcomputing.
The proceedings contain 31 papers. The topics discussed include: the new frontier of web search technology: seven challenges;information exploration in searchcomputing;trends in search interaction;context and action ...
ISBN:
(纸本)9783642196676
The proceedings contain 31 papers. The topics discussed include: the new frontier of web search technology: seven challenges;information exploration in searchcomputing;trends in search interaction;context and action in search interfaces;visualization of multi-domain ranked data;semantic resource framework;automatic normalization and annotation for discovering semantic mappings;the rank join problem;efficient computation of searchcomputing queries;tools supporting searchcomputing application development;Bio-SeCo: integration and global ranking of biomedical search results;workflows for information integration in the life sciences;complex search, ranks, and biological discovery: a user's perspective;an experience in applying user centered design to searchcomputing;and analysis of business models for searchcomputing.
The proceedings contain 15 papers. The topics discussed include: next generation web search;search for knowledge;the search engine industry;from mashup technologies to universal integration: searchcomputing the imper...
ISBN:
(纸本)3642123090
The proceedings contain 15 papers. The topics discussed include: next generation web search;search for knowledge;the search engine industry;from mashup technologies to universal integration: searchcomputing the imperative way;web data extraction for service creation;multimedia and multimodal information retrieval;service marts;join methods and query optimization;rank-join algorithms for searchcomputing;Panta Rhei: flexible execution engine for searchcomputing queries;liquid queries and liquid results in searchcomputing;building searchcomputing applications;and searchcomputing and the life sciences.
暂无评论