License: Creative Commons Attribution 4.0 International license (CC BY 4.0)
When quoting this document, please refer to the following
DOI: 10.4230/DagSemProc.05471.3
URN: urn:nbn:de:0030-drops-5484
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2006/548/
Go to the corresponding Portal


Sturm, Marc ; Quinten, Sascha ; Huber, Christian G. ; Kohlbacher, Oliver

A machine learning approach for prediction of DNA and peptide HPLC retention times

pdf-format:
05471.SturmMarc.ExtAbstract.548.pdf (0.1 MB)


Abstract

High performance liquid chromatography (HPLC) has become one of the most efficient methods for
the separation of biomolecules. It is an important tool in DNA purification after synthesis as well as DNA quantification.
In both cases the separability of different oligonucleotides is essential. The prediction of oligonucleotide retention
times prior to the experiment may detect superimposed nucleotides and thereby help to avoid futile experiments.
In 2002 Gilar et al. proposed a simple mathematical model for the prediction of DNA retention times,
that reliably works at high temperatures only (at least 70°C).
To cover a wider temperature rang we incorporated DNA secondary structure information in addition to base composition and length.
We used support vector regression (SVR) for the model generation and retention time prediction.

A similar problem arises in shotgun proteomics. Here HPLC coupled to a mass spectrometer (MS) is used to analyze
complex peptide mixtures (thousands of peptides). Predicting peptide retention times can be used to validate
tandem-MS peptide identifications made by search engines like SEQUEST.
Recently several methods including multiple linear regression and artificial neural networks were proposed, but SVR has not been used so far.

BibTeX - Entry

@InProceedings{sturm_et_al:DagSemProc.05471.3,
  author =	{Sturm, Marc and Quinten, Sascha and Huber, Christian G. and Kohlbacher, Oliver},
  title =	{{A machine learning approach for prediction of DNA and peptide HPLC retention times}},
  booktitle =	{Computational Proteomics},
  pages =	{1--5},
  series =	{Dagstuhl Seminar Proceedings (DagSemProc)},
  ISSN =	{1862-4405},
  year =	{2006},
  volume =	{5471},
  editor =	{Christian G. Huber and Oliver Kohlbacher and Knut Reinert},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops.dagstuhl.de/opus/volltexte/2006/548},
  URN =		{urn:nbn:de:0030-drops-5484},
  doi =		{10.4230/DagSemProc.05471.3},
  annote =	{Keywords: High performance liquid chromatography, mass spectrometry, retention time, prediction, peptide, DNA, support vector regression}
}

Keywords: High performance liquid chromatography, mass spectrometry, retention time, prediction, peptide, DNA, support vector regression
Collection: 05471 - Computational Proteomics
Issue Date: 2006
Date of publication: 18.09.2006


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI