Citation: M. Lange et al. (2014-07-25): IDPredictor: predict database links in biomedical database. DOI:10.5447/ipk/2012/4

Abstract: Abstract: Knowledge found in biomedical databases, in particular in Web information systems, is a major bioinformatics resource. In general, this biological knowledge is worldwide represented in a network of databases. These data are spread among thousands of databases, which overlap in content, but differ substantially with respect to content detail, interface, formats and data structure. To support a functional annotation of lab data, such as protein sequences, metabolites or DNA sequences as well as a semi-automated data exploration in information retrieval environments an integrated view to databases is essential. Search engines have the potential of assisting in data retrieval from these structured sources, but fall short of providing a comprehensive knowledge excerpt out of the interlinked databases. A prerequisit for supporting the concept of an integrated data view is the to acquiring insights into cross-references among database entities. But only a fraction of all possible cross-references are explicitely tagged in the particular biomedical informations systems. In this work, we investigate to what extend an automated construction of an integrated data network is possible. We propose a method that predict and extracts cross-references from multiple life science databases and thier possible referenced data targets. We study the retrieval quality of our method and the relationship between manually crafted relevance ranking and relevance ranking based on cross-references, and report on first, promising results.

License: GNU GENERAL PUBLIC LICENSE Version 2, June 1991

DOI: 10.5447/ipk/2012/4

Content: 0 Directories 1 Files (6.6 MB)

Files
Loading, please wait!
//IDPredictor: predict database links in biomedical database/IDPredictor.zip
Download
Metadata
CONTRIBUTOR:
Matthias Lange, Hendrik Mehlhorn, Uwe Scholz, Falk Schreiber [Show full information]
CREATOR:
Matthias Lange, Hendrik Mehlhorn, Uwe Scholz, Falk Schreiber [Show full information]
PUBLISHER: e!DAL - Plant Genomics and Phenomics Research Data Repository (PGP), IPK Gatersleben, Seeland OT Gatersleben, Corrensstraße 3, D-06466, Germany
SIZE: 6.6 MB
SUBJECT: bioinformatics, neural network, Result evaluation
CHECKSUM: SHA-256 : 64386b5066bd45f5d8f0b96eebed3a60ce9af513887da455a886f4adcf01b28b
COVERAGE: none
DATE: Event: event
UPDATED: TimePoint: Fri Jul 25 09:27:12 CEST 2014
CREATED: TimePoint: Fri Jul 25 09:27:12 CEST 2014
FORMAT: application/zip
LANGUAGE: en
RELATION: none
SOURCE: none
Revision: 1 - CreationDate: Fri Jul 25 09:27:12 CEST 2014 - RevisionDate: Fri Jul 25 09:27:13 CEST 2014