Publication Details

Combination of Logistic Regression and Boosting to Predict Disease Outcome

ŠILHAVÁ, J.; SMRŽ, P. Combination of Logistic Regression and Boosting to Predict Disease Outcome. Applied Statistics 2010 International Conference. Ribno, Bled: 2010. p. 35-35. ISBN: 978-961-92487-4-4.
Czech title
Combination of Logistic Regression and Boosting to Predict Disease Outcome
Type
abstract
Language
English
Authors
Keywords

combining of heterogeneous data, gene expression data, class prediction, boosting

Abstract

An important current bioinformatic challenge is incorporation of diverse data types. Different bioinformatic data can provide complementary information. The combination of relevant data may lead to more accurate findings, e.g., it can help to understand complex diseases or it can derive more accurate hybrid diagnostic or prognostic signature. We propose a prediction approach that combines logistic regression and boosting. Logistic regression is employed with low-dimensional data, while boosting uses high-dimensional data. The presented approach is extended and incorporates more than two data sources. It is validated using simulated data sets and then applied to real bioinformatic data sets with clinical variables, gene expression data and SNP data. We show that this kind of data combination can increase predictive performance.

Annotation

An important current bioinformatic challenge is incorporation of diverse data types. Different bioinformatic data can provide complementary information. The combination of relevant data may lead to more accurate findings, e.g., it can help to understand complex diseases or it can derive more accurate hybrid diagnostic or prognostic signature. We propose a prediction approach that combines logistic regression and boosting. Logistic regression is employed with low-dimensional data, while boosting uses high-dimensional data. The presented approach is extended and incorporates more than two data sources. It is validated using simulated data sets and then applied to real bioinformatic data sets with clinical variables, gene expression data and SNP data. We show that this kind of data combination can increase predictive performance.

Published
2010
Pages
35–35
Book
Applied Statistics 2010 International Conference
ISBN
978-961-92487-4-4
Place
Ribno, Bled
BibTeX
@misc{BUT61068,
  author="Jana {Šilhavá} and Pavel {Smrž}",
  title="Combination of Logistic Regression and Boosting to Predict Disease Outcome",
  booktitle="Applied Statistics 2010 International Conference",
  year="2010",
  pages="35--35",
  address="Ribno, Bled",
  isbn="978-961-92487-4-4",
  note="abstract"
}
Back to top