License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/OASIcs.ICCSW.2017.5
URN: urn:nbn:de:0030-drops-84475
Go to the corresponding OASIcs Volume Portal

Stamford, John ; Kambhampati, Chandra

Discriminative and Generative Models for Clinical Risk Estimation: An Empirical Comparison

OASIcs-ICCSW-2017-5.pdf (1.0 MB)


Linear discriminative models, in the form of Logistic Regression, are a popular choice within the clinical domain in the development of risk models. Logistic regression is commonly used as it offers explanatory information in addition to its predictive capabilities. In some examples the coefficients from these models have been used to determine overly simplified clinical risk scores. Such models are constrained to modeling linear relationships between the variables and the class despite it known that this relationship is not always linear. This paper compares the conditions under which linear discriminative and linear generative models perform best. This is done through comparing logistic regression and naïve Bayes on real clinical data. The work shows that generative models perform best when the internal representation of the data is closer to the true distribution of the data and when there is a very small difference between the means of the classes. When looking at variables such as sodium it is shown that logistic regression can not model the observed risk as it is non-linear in its nature, whereas naïve Bayes gives a better estimation of risk. The work concludes that the risk estimations derived from discriminative models such as logistic regression need to be considered in the wider context of the true risk observed within the dataset.

BibTeX - Entry

  author =	{John Stamford and Chandra Kambhampati},
  title =	{{Discriminative and Generative Models for Clinical Risk Estimation: An Empirical Comparison}},
  booktitle =	{2017 Imperial College Computing Student Workshop (ICCSW 2017)},
  pages =	{5:1--5:9},
  series =	{OpenAccess Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-059-0},
  ISSN =	{2190-6807},
  year =	{2018},
  volume =	{60},
  editor =	{Fergus Leahy and Juliana Franco},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{},
  URN =		{urn:nbn:de:0030-drops-84475},
  doi =		{10.4230/OASIcs.ICCSW.2017.5},
  annote =	{Keywords: Discriminative, Generative, Naive Bayes, Logistic Regression, Clinical Risk}

Keywords: Discriminative, Generative, Naïve Bayes, Logistic Regression, Clinical Risk
Collection: 2017 Imperial College Computing Student Workshop (ICCSW 2017)
Issue Date: 2018
Date of publication: 21.02.2018

DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI