License:  Creative Commons Attribution 4.0 International license (CC BY 4.0)
 Creative Commons Attribution 4.0 International license (CC BY 4.0)
When quoting this document, please refer to the following
DOI: 10.4230/DagSemProc.08131.8
URN: urn:nbn:de:0030-drops-15133
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2008/1513/
  | Go to the corresponding Portal | 
 
Leser, Ulf ; 
Groth, Philip ; 
Weiss, Bertram ; 
Pohlenz, Hans-Dieter 
Mining Phenotypes for Protein Function Prediction
Abstract
Until very recently, phenotypes only very rarely were studied in a systematic manner. While ontologies for describing gene functions now have a 10 year long tradition, similar vocabularies for describing the phenotype of genes are only emerging now; similarly, the techniques for determining phenotypes on a large scale (especially RNAi) are available only for a few years, while genomic sequencing or gene expression studies are already established for a much longer time.
In this talk, we describe results from a study for exploiting phenotype descriptions for protein function prediction. We used the data from PhenomicsDB, a phenotype database integrated from several publicly available data sources. Due to the lack of standardization, phenotypes in PhenomicsDB can only be viewed as text (short statements, abstracts, singular terms, ...). We clustered these texts and analyzed the corresponding gene clusters in terms of their coherence in functional annotation and their interconnectedness by protein-protein-interactions. We also devised a method for using the close similarity in their phenotype descriptions to predict the function of proteins. We show that this methods yields a very good precision at acceptable coverage.
BibTeX - Entry
@InProceedings{leser_et_al:DagSemProc.08131.8,
  author =	{Leser, Ulf and Groth, Philip and Weiss, Bertram and Pohlenz, Hans-Dieter},
  title =	{{Mining Phenotypes for Protein Function Prediction}},
  booktitle =	{Ontologies and Text Mining for Life Sciences : Current Status and Future Perspectives},
  pages =	{1--1},
  series =	{Dagstuhl Seminar Proceedings (DagSemProc)},
  ISSN =	{1862-4405},
  year =	{2008},
  volume =	{8131},
  editor =	{Michael Ashburner and Ulf Leser and Dietrich Rebholz-Schuhmann},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops.dagstuhl.de/opus/volltexte/2008/1513},
  URN =		{urn:nbn:de:0030-drops-15133},
  doi =		{10.4230/DagSemProc.08131.8},
  annote =	{Keywords: Data mining, funciton prediction, bioinformatics, phenotypes, text mining}
}
 
| Keywords: |  | Data mining, funciton prediction, bioinformatics, phenotypes, text mining | 
 
 
| Collection: |  | 08131 - Ontologies and Text Mining for Life Sciences : Current Status and Future Perspectives | 
 
 
| Issue Date: |  | 2008 | 
 
 
| Date of publication: |  | 03.06.2008 |