La separación en regresión logística, una solución y aplicación

Translated title of the contribution: The problem of separation in logistic regression, a solution and an application

Juan C.M. Correa, Marisol C. Valencia

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

Logistic regression is one of the most used statistical techniques for explaining the probabilistic behavior of a given phenomenon. Data separation is a frequent problem in this model, as successes appear separated from failures and make it impossible to find the maximum likelihood estimators. Objective: to present a revision and a solution to the problem, and to compare it with other solutions. Methodology: a simulation of the logistic model and an estimation of the parameters’ bias using the proposed classical and Bayesian solution with fictitious observations, as well as the Firth method. Results: the bias found is lower when the pair of fictitious observations are generated using the Bayesian method. An example about the age at which menarche occurs is presented. Discussion: an appropriate solution to the problem of separation is provided using a simulation in a simple logistic model. Conclusions: the generation of fictitious observations within the separation region is recommended, and the best solution method is based on Bayesian theory, which achieves convergence of the parameters of the logistic model.

Translated title of the contributionThe problem of separation in logistic regression, a solution and an application
Original languageSpanish
Pages (from-to)281-288
Number of pages8
JournalRevista Facultad Nacional de Salud Publica
Volume29
Issue number3
DOIs
Publication statusPublished - Sept 2011
Externally publishedYes

Bibliographical note

Publisher Copyright:
© Universidad de Antioquia. All Rights Reserved.

All Science Journal Classification (ASJC) codes

  • Epidemiology
  • Health Policy
  • Public Health, Environmental and Occupational Health
  • Health Information Management

Fingerprint

Dive into the research topics of 'The problem of separation in logistic regression, a solution and an application'. Together they form a unique fingerprint.

Cite this