The problem of matching schemas or ontologies consists of providing corresponding entities in two or more knowledge models that belong to a same domain but have been developed separately. Nowadays there are a lot of t...
详细信息
The problem of matching schemas or ontologies consists of providing corresponding entities in two or more knowledge models that belong to a same domain but have been developed separately. Nowadays there are a lot of techniques and tools for addressing this problem, however, the complex nature of the matching problem make existing solutions for real situations not fully satisfactory. The Google Similarity Distance has appeared recently. Its purpose is to mine knowledge from the Web using the Google search engine in order to semantically compare text expressions. Our work consists of developing a software application for validating results discovered by schema and ontolog2/ matching tools using the philosophy behind this distance. Moreover, we are interested in using not only Google, but other popular search engines with this similarity distance. The results reveal three main facts. Firstly, some web search engines can help us to validate semantic correspondences satisfactorily. Secondly there are significant differences among the web search engines. And thirdly the best results are obtained when using combinations of the web search engines that we have studied.
In the process of integrating legacy databases, one has to resolve inter-database conflicts at both the schema and instance levels. In this paper, we discuss relationship conflicts as a special type of conflicts to be...
详细信息
In the process of integrating legacy databases, one has to resolve inter-database conflicts at both the schema and instance levels. In this paper, we discuss relationship conflicts as a special type of conflicts to be resolved during the database integration. Relationships are properties that relate real world objects. So far, most inter-database relationship conflicts are addressed at the schema-level by various schema integration techniques. However, instance-level relationship conflicts are largely neglected. This paper therefore investigates the causes of instance-level relationship conflicts and proposes a taxonomy for classifying instance-level relationship conflicts. In addition, we develop a systematic process to resolve instance-level relationship conflicts and incorporate the resolution steps into the overall database integration process. Instance-level relationship conflict detection algorithms have also been developed to aid the resolution process. This research should facilitate database integration work for both multidatabase and data warehousing approaches. Most importantly, it should improve the data quality of the integrated databases. (C) 2000 Elsevier Science B.V. All rights reserved.
We constructed a web-based genome annotation platform, MarinegenomicsDB, to integrate genome data from various marine organisms including the pearl oyster Pinctada fucata and the coral Acropora digitifera. This newly ...
详细信息
We constructed a web-based genome annotation platform, MarinegenomicsDB, to integrate genome data from various marine organisms including the pearl oyster Pinctada fucata and the coral Acropora digitifera. This newly developed viewer application provides open access to published data and a user-friendly environment for community-based manual gene annotation. Development on a flexible framework enables easy expansion of the website on demand. To date, more than 2000 genes have been annotated using this system. In the future, the website will be expanded to host a wider variety of data, more species, and different types of genome-wide analyses. The website is available at the following URL: http://***.
What is neuroinformatics? What is the Human Brain Project? Why should you care? Supported by a consortium of US funding agencies, the Human Brain Project aims to bring to the analysis of brain function the same advant...
详细信息
What is neuroinformatics? What is the Human Brain Project? Why should you care? Supported by a consortium of US funding agencies, the Human Brain Project aims to bring to the analysis of brain function the same advantages of Internet-accessible databases and database tools that have been crucial to the development of molecular biology and the Human Genome Project. The much greater complexity of neural data, however, makes this a far more challenging task. As a pilot project in this new initiative, we review some of the progress that has been made and indicate some of the problems, challenges and opportunities that lie ahead.
This paper outlines the work carried out by the project team over the last three years to develop an in-house current information management system, focused on the specific need to gather information from across vario...
详细信息
This paper outlines the work carried out by the project team over the last three years to develop an in-house current information management system, focused on the specific need to gather information from across various departmental databases to fulfil the research excellence framework requirements for a specialist arts institution. The overall objective of the project was to support the university's successful submission to the REF2014 in November 2013. The system was used to collate relevant information from various institutional databases and transfer this to the Higher Education Funding Council for England (HEFCE) Submission System, thereby increasing institutional efficiency by reducing repetition of data entry and saving time in checking and organising information. (C) 2014 Published by Elsevier B.V
作者:
Harwood, CRMoszer, INewcastle Univ
Sch Med Dept Microbiol & Immunol Newcastle Upon Tyne NE2 4HH Tyne & Wear England Inst Pasteur
Unite Genet Genomes Bacteriens F-75724 Paris 15 France
Bacillus subtilis is a sporulating Gram-positive bacterium that lives primarily in the soil and associated water sources. The publication of the B. subtilis genome sequence and subsequent systematic functional analysi...
详细信息
Bacillus subtilis is a sporulating Gram-positive bacterium that lives primarily in the soil and associated water sources. The publication of the B. subtilis genome sequence and subsequent systematic functional analysis and gene regulation programmes, together with an extensive understanding of its biochemistry and physiology, makes this micro-organism a prime candidate in which to model regulatory networks in silico. In this paper we discuss combined molecular biological and bioinformatical approaches that are being developed to model this organism's responses to changes in its environment. Copyright (C) 2001 John Wiley Sons, Ltd.
The Press and Information Office of the Federal Government of Germany, the public relations department of the German Government, has been on the Web since 1996 as a complement to conventional publication methods. It a...
详细信息
The Press and Information Office of the Federal Government of Germany, the public relations department of the German Government, has been on the Web since 1996 as a complement to conventional publication methods. It acts as an entry point for the German Government referring to the Web pages and servers of the German Government itself, other German constitutional bodies, German embassies, Permanent Missions and information centers, and other German institutions. It started with existing information material, converted to HTML. Due to the national mandate and due to the information of public concern in this Web site, additional requirements had to be fulfilled: A highly automated information processing around the clock, a high security level implemented, and online ordering of booklets and actual information (Webcasting). The paper describes how we succeeded together with our co-operation partners to bring the German Government represented by the Press and Information Office into the Web. Naturally it covers only some parts of our two-years-project which ended in March, 1998. (C) 1998 Published by Elsevier Science B.V. All rights resented.
Role-based access control provides a very flexible set of mechanisms for managing the access control of a complex system with many users, objects and applications. The role graph model of Nyanchama and Osborn is one e...
详细信息
Role-based access control provides a very flexible set of mechanisms for managing the access control of a complex system with many users, objects and applications. The role graph model of Nyanchama and Osborn is one example of how role administration algorithms can be implemented. In our previous research, we have also shown how the access control information of existing systems can be extracted and represented as a role graph. In this paper, we extend this research by showing how, when two systems are being integrated, their role graphs can also be integrated. (C) 2002 Elsevier Science B.V. All rights reserved.
RDF has established in the last years as the language for describing, publishing and sharing biomedical resources. Following this trend, a great amount of RDF-based data sources, as well as ontologies, have appeared. ...
详细信息
ISBN:
(纸本)9781614992899;9781614992882
RDF has established in the last years as the language for describing, publishing and sharing biomedical resources. Following this trend, a great amount of RDF-based data sources, as well as ontologies, have appeared. Using a common language as RDF has provided a unified syntactic for sharing resources, but the semantics remain as the main cause of heterogeneity, hampering data integration and homogenization efforts. To overcome this issue, ontology alignment based solutions have been typically used. However, alignment information is usually codified using ad-hoc formats. In this paper, we present a general purpose ontology mapping format, totally independent from the homogenization approach to be applied. The format is accompanied with a Java API that offers mapping construction and parsing features, as well as some basic algorithms for applying it to data translation solutions.
Data on the Swedish population are stored in many registers located in different authorities. These data are crucial for epidemiological and other register-based research. We are developing an infrastructure to suppor...
详细信息
ISBN:
(纸本)9780769537634
Data on the Swedish population are stored in many registers located in different authorities. These data are crucial for epidemiological and other register-based research. We are developing an infrastructure to support research on the data from Swedish registers. In the infrastructure each organization or authority should keep its own control on all accesses to its data. The infrastructure should allow only authorized access to data. The effort for scientists to perform their analyses on the data should be insignificant. We propose to base the infrastructure on federated databases. This enables secure and only authorized data access, efficient and scalable data extractions, integration with statistical and other tools, and autonomy to data sources. We are investigating the feasibility of our approach by developing a pilot infrastructure for epidemiological research on cervical cancer. In this paper we identify challenges of applying federated databases for accessing sensitive and personal data by scientists.
暂无评论