Integration of Biological Sources: Exploring the Case of Protein Homology

Tjeerd W. Boerman, Maurice van Keulen, P.E. van der Vet, Edouard I. Severing

Research output: Book/ReportReportProfessional

17 Downloads (Pure)

Abstract

Data integration is a key issue in the domain of bioin- formatics, which deals with huge amounts of heteroge- neous biological data that grows and changes rapidly. This paper serves as an introduction in the field of bioinformatics and the biological concepts it deals with, and an exploration of the integration problems a bioinformatics scientist faces. We examine ProGMap, an integrated protein homology system used by bioin- formatics scientists at Wageningen University, and several use cases related to protein homology. A key issue we identify is the huge manual effort required to unify source databases into a single resource. Un- certain databases are able to contain several possi- ble worlds, and it has been proposed that they can be used to significantly reduce initial integration efforts. We propose several directions for future work where uncertain databases can be applied to bioinformatics, with the goal of furthering the cause of bioinformatics integration.
Original languageUndefined
Place of PublicationEnschede
PublisherCentre for Telematics and Information Technology (CTIT)
Number of pages17
Publication statusPublished - Aug 2011

Publication series

NameCTIT Technical Report Series
PublisherUniversity of Twente, Centre for Telematics and Information Technology
No.TR-CTIT-11-18
ISSN (Print)1381-3625

Keywords

  • IR-77936
  • METIS-278767
  • EWI-20450

Cite this

Boerman, T. W., van Keulen, M., van der Vet, P. E., & Severing, E. I. (2011). Integration of Biological Sources: Exploring the Case of Protein Homology. (CTIT Technical Report Series; No. TR-CTIT-11-18). Enschede: Centre for Telematics and Information Technology (CTIT).
Boerman, Tjeerd W. ; van Keulen, Maurice ; van der Vet, P.E. ; Severing, Edouard I. / Integration of Biological Sources: Exploring the Case of Protein Homology. Enschede : Centre for Telematics and Information Technology (CTIT), 2011. 17 p. (CTIT Technical Report Series; TR-CTIT-11-18).
@book{e6ee259bd53f498b95c5e953ddc8d6aa,
title = "Integration of Biological Sources: Exploring the Case of Protein Homology",
abstract = "Data integration is a key issue in the domain of bioin- formatics, which deals with huge amounts of heteroge- neous biological data that grows and changes rapidly. This paper serves as an introduction in the field of bioinformatics and the biological concepts it deals with, and an exploration of the integration problems a bioinformatics scientist faces. We examine ProGMap, an integrated protein homology system used by bioin- formatics scientists at Wageningen University, and several use cases related to protein homology. A key issue we identify is the huge manual effort required to unify source databases into a single resource. Un- certain databases are able to contain several possi- ble worlds, and it has been proposed that they can be used to significantly reduce initial integration efforts. We propose several directions for future work where uncertain databases can be applied to bioinformatics, with the goal of furthering the cause of bioinformatics integration.",
keywords = "IR-77936, METIS-278767, EWI-20450",
author = "Boerman, {Tjeerd W.} and {van Keulen}, Maurice and {van der Vet}, P.E. and Severing, {Edouard I.}",
year = "2011",
month = "8",
language = "Undefined",
series = "CTIT Technical Report Series",
publisher = "Centre for Telematics and Information Technology (CTIT)",
number = "TR-CTIT-11-18",
address = "Netherlands",

}

Boerman, TW, van Keulen, M, van der Vet, PE & Severing, EI 2011, Integration of Biological Sources: Exploring the Case of Protein Homology. CTIT Technical Report Series, no. TR-CTIT-11-18, Centre for Telematics and Information Technology (CTIT), Enschede.

Integration of Biological Sources: Exploring the Case of Protein Homology. / Boerman, Tjeerd W.; van Keulen, Maurice; van der Vet, P.E.; Severing, Edouard I.

Enschede : Centre for Telematics and Information Technology (CTIT), 2011. 17 p. (CTIT Technical Report Series; No. TR-CTIT-11-18).

Research output: Book/ReportReportProfessional

TY - BOOK

T1 - Integration of Biological Sources: Exploring the Case of Protein Homology

AU - Boerman, Tjeerd W.

AU - van Keulen, Maurice

AU - van der Vet, P.E.

AU - Severing, Edouard I.

PY - 2011/8

Y1 - 2011/8

N2 - Data integration is a key issue in the domain of bioin- formatics, which deals with huge amounts of heteroge- neous biological data that grows and changes rapidly. This paper serves as an introduction in the field of bioinformatics and the biological concepts it deals with, and an exploration of the integration problems a bioinformatics scientist faces. We examine ProGMap, an integrated protein homology system used by bioin- formatics scientists at Wageningen University, and several use cases related to protein homology. A key issue we identify is the huge manual effort required to unify source databases into a single resource. Un- certain databases are able to contain several possi- ble worlds, and it has been proposed that they can be used to significantly reduce initial integration efforts. We propose several directions for future work where uncertain databases can be applied to bioinformatics, with the goal of furthering the cause of bioinformatics integration.

AB - Data integration is a key issue in the domain of bioin- formatics, which deals with huge amounts of heteroge- neous biological data that grows and changes rapidly. This paper serves as an introduction in the field of bioinformatics and the biological concepts it deals with, and an exploration of the integration problems a bioinformatics scientist faces. We examine ProGMap, an integrated protein homology system used by bioin- formatics scientists at Wageningen University, and several use cases related to protein homology. A key issue we identify is the huge manual effort required to unify source databases into a single resource. Un- certain databases are able to contain several possi- ble worlds, and it has been proposed that they can be used to significantly reduce initial integration efforts. We propose several directions for future work where uncertain databases can be applied to bioinformatics, with the goal of furthering the cause of bioinformatics integration.

KW - IR-77936

KW - METIS-278767

KW - EWI-20450

M3 - Report

T3 - CTIT Technical Report Series

BT - Integration of Biological Sources: Exploring the Case of Protein Homology

PB - Centre for Telematics and Information Technology (CTIT)

CY - Enschede

ER -

Boerman TW, van Keulen M, van der Vet PE, Severing EI. Integration of Biological Sources: Exploring the Case of Protein Homology. Enschede: Centre for Telematics and Information Technology (CTIT), 2011. 17 p. (CTIT Technical Report Series; TR-CTIT-11-18).