Endocrinol Metab.  2016 Mar;31(1):38-44. 10.3803/EnM.2016.31.1.38.

How to Establish Clinical Prediction Models

Affiliations
  • 1Department of Internal Medicine, Yonsei University College of Medicine, Seoul, Korea. yholee@yuhs.ac
  • 2Division of Biostatistics, Department of Public Health Sciences, University of California Davis School of Medicine, Davis, CA, USA.
  • 3Department of Endocrinology and Metabolism, Ajou University School of Medicine, Suwon, Korea. djkim@ajou.ac.kr

Abstract

A clinical prediction model can be applied to several challenging clinical scenarios: screening high-risk individuals for asymptomatic disease, predicting future events such as disease or death, and assisting medical decision-making and health education. Despite the impact of clinical prediction models on practice, prediction modeling is a complex process requiring careful statistical analyses and sound clinical judgement. Although there is no definite consensus on the best methodology for model development and validation, a few recommendations and checklists have been proposed. In this review, we summarize five steps for developing and validating a clinical prediction model: preparation for establishing clinical prediction models; dataset selection; handling variables; model generation; and model evaluation and validation. We also review several studies that detail methods for developing clinical prediction models with comparable examples from real practice. After model development and vigorous validation in relevant settings, possibly with evaluation of utility/usability and fine-tuning, good models can be ready for the use in practice. We anticipate that this framework will revitalize the use of predictive or prognostic research in endocrinology, leading to active applications in real clinical practice.

Keyword

Clinical prediction model; Development; Validation; Clinical usefulness

MeSH Terms

Asymptomatic Diseases
Checklist
Consensus
Dataset
Endocrinology
Health Education
Mass Screening

Cited by  7 articles

Biomarker Score in Risk Prediction: Beyond Scientific Evidence and Statistical Performance
Heejung Bang
Diabetes Metab J. 2020;44(2):245-247.    doi: 10.4093/dmj.2020.0073.

Development and Validation of a Deep Learning Based Diabetes Prediction System Using a Nationwide Population-Based Cohort
Sang Youl Rhee, Ji Min Sung, Sunhee Kim, In-Jeong Cho, Sang-Eun Lee, Hyuk-Jae Chang
Diabetes Metab J. 2021;45(4):515-525.    doi: 10.4093/dmj.2020.0081.

Development and Validation of the Korean Diabetes Risk Score: A 10-Year National Cohort Study
Kyoung Hwa Ha, Yong-ho Lee, Sun Ok Song, Jae-woo Lee, Dong Wook Kim, Kyung-hee Cho, Dae Jung Kim
Diabetes Metab J. 2018;42(5):402-414.    doi: 10.4093/dmj.2018.0014.

Nonalcoholic Fatty Liver Disease in Diabetes. Part I: Epidemiology and Diagnosis
Yong-ho Lee, Yongin Cho, Byung-Wan Lee, Cheol-Young Park, Dae Ho Lee, Bong-Soo Cha, Eun-Jung Rhee
Diabetes Metab J. 2019;43(1):31-45.    doi: 10.4093/dmj.2019.0011.

Development of Clinical Data Mart of HMG-CoA Reductase Inhibitor for Varied Clinical Research
Hun-Sung Kim, Hyunah Kim, Yoo Jin Jeong, Tong Min Kim, So Jung Yang, Sun Jung Baik, Seung-Hwan Lee, Jae Hyoung Cho, In Young Choi, Kun-Ho Yoon
Endocrinol Metab. 2017;32(1):90-98.    doi: 10.3803/EnM.2017.32.1.90.

Development and Evaluation of Electronic Health Record Data-Driven Predictive Models for Pressure Ulcers
Seul Ki Park, Hyeoun-Ae Park, Hee Hwang
J Korean Acad Nurs. 2019;49(5):575-585.    doi: 10.4040/jkan.2019.49.5.575.

Performance of Diabetes and Kidney Disease Screening Scores in Contemporary United States and Korean Populations
Liela Meng, Keun-Sang Kwon, Dae Jung Kim, Yong-ho Lee, Jeehyoung Kim, Abhijit V. Kshirsagar, Heejung Bang
Diabetes Metab J. 2022;46(2):273-285.    doi: 10.4093/dmj.2021.0054.


Reference

1. Moons KG, Royston P, Vergouwe Y, Grobbee DE, Altman DG. Prognosis and prognostic research: what, why, and how? BMJ. 2009; 338:b375.
2. Hemingway H, Croft P, Perel P, Hayden JA, Abrams K, Timmis A, et al. Prognosis research strategy (PROGRESS) 1: a framework for researching clinical outcomes. BMJ. 2013; 346:e5595.
3. Sackett DL, Rosenberg WM, Gray JA, Haynes RB, Richardson WS. Evidence based medicine: what it is and what it isn't. BMJ. 1996; 312:71–72.
4. Greenland S. The need for reorientation toward cost-effective prediction: comments on 'Evaluating the added predictive ability of a new marker. From area under the ROC curve to reclassification and beyond' by M. J. Pencina et al., Statistics in Medicine (DOI: 10.1002/sim.2929). Stat Med. 2008; 27:199–206.
5. Lim NK, Park SH, Choi SJ, Lee KS, Park HY. A risk score for predicting the incidence of type 2 diabetes in a middle-aged Korean cohort: the Korean genome and epidemiology study. Circ J. 2012; 76:1904–1910.
6. Griffin SJ, Little PS, Hales CN, Kinmonth AL, Wareham NJ. Diabetes risk score: towards earlier detection of type 2 diabetes in general practice. Diabetes Metab Res Rev. 2000; 16:164–171.
7. Goff DC Jr, Lloyd-Jones DM, Bennett G, Coady S, D'Agostino RB, Gibbons R, et al. 2013 ACC/AHA guideline on the assessment of cardiovascular risk: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. Circulation. 2014; 129:25 Suppl 2. S49–S73.
8. Steyerberg EW, Vergouwe Y. Towards better clinical prediction models: seven steps for development and an ABCD for validation. Eur Heart J. 2014; 35:1925–1931.
9. Royston P, Moons KG, Altman DG, Vergouwe Y. Prognosis and prognostic research: developing a prognostic model. BMJ. 2009; 338:b604.
10. Altman DG, Vergouwe Y, Royston P, Moons KG. Prognosis and prognostic research: validating a prognostic model. BMJ. 2009; 338:b605.
11. Moons KG, Altman DG, Vergouwe Y, Royston P. Prognosis and prognostic research: application and impact of prognostic models in clinical practice. BMJ. 2009; 338:b606.
12. Laupacis A, Sekar N, Stiell IG. Clinical prediction rules. A review and suggested modifications of methodological standards. JAMA. 1997; 277:488–494.
13. Altman DG, Royston P. What do we mean by validating a prognostic model? Stat Med. 2000; 19:453–473.
14. Steyerberg EW. Clinical prediction models: a practical approach to development, validation, and updating. New York: Springer;2009.
15. Steyerberg EW, Moons KG, van der Windt DA, Hayden JA, Perel P, Schroter S, et al. Prognosis Research Strategy (PROGRESS) 3: prognostic model research. PLoS Med. 2013; 10:e1001381.
16. Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): the TRIPOD statement. Ann Intern Med. 2015; 162:55–63.
17. Liu J, Hong Y, D'Agostino RB Sr, Wu Z, Wang W, Sun J, et al. Predictive value for the Chinese population of the Framingham CHD risk assessment tool compared with the Chinese Multi-Provincial Cohort Study. JAMA. 2004; 291:2591–2599.
18. Jee SH, Jang Y, Oh DJ, Oh BH, Lee SH, Park SW, et al. A coronary heart disease prediction model: the Korean Heart Study. BMJ Open. 2014; 4:e005025.
19. Lee YH, Bang H, Park YM, Bae JC, Lee BW, Kang ES, et al. Non-laboratory-based self-assessment screening score for non-alcoholic fatty liver disease: development, validation and comparison with other scores. PLoS One. 2014; 9:e107584.
20. Bang H, Edwards AM, Bomback AS, Ballantyne CM, Brillon D, Callahan MA, et al. Development and validation of a patient self-assessment score for diabetes risk. Ann Intern Med. 2009; 151:775–783.
21. Kotronen A, Peltonen M, Hakkarainen A, Sevastianova K, Bergholm R, Johansson LM, et al. Prediction of non-alcoholic fatty liver disease and liver fat using metabolic and genetic factors. Gastroenterology. 2009; 137:865–872.
22. Harrell FE Jr. Regression modeling strategies: with applications to linear models, logistic regression, and survival analysis. New York: Springer;2001.
23. Sun GW, Shook TL, Kay GL. Inappropriate use of bivariable analysis to screen risk factors for use in multivariable analysis. J Clin Epidemiol. 1996; 49:907–916.
24. Royston P, Altman DG, Sauerbrei W. Dichotomizing continuous predictors in multiple regression: a bad idea. Stat Med. 2006; 25:127–141.
25. Boersma E, Poldermans D, Bax JJ, Steyerberg EW, Thomson IR, Banga JD, et al. Predictors of cardiac events after major vascular surgery: role of clinical characteristics, dobutamine echocardiography, and beta-blocker therapy. JAMA. 2001; 285:1865–1873.
26. Sauerbrei W. The use of resampling methods to simplify regression models in medical statistics. J R Stat Soc Ser C Appl Stat. 1999; 48:313–329.
27. Shmueli G. To explain or to predict. Stat Sci. 2010; 289–310.
28. Heikes KE, Eddy DM, Arondekar B, Schlessinger L. Diabetes risk calculator: a simple tool for detecting undiagnosed diabetes and pre-diabetes. Diabetes Care. 2008; 31:1040–1045.
29. Breiman L, Friedman J, Stone CJ, Olshen RA. Classification and regression trees. Belmont: Wadsworth International Group;1984.
30. Breiman L. Statistical modeling: the two cultures (with comments and a rejoinder by the author). Statist Sci. 2001; 16:199–231.
31. Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, Obuchowski N, et al. Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiology. 2010; 21:128–138.
32. Meffert PJ, Baumeister SE, Lerch MM, Mayerle J, Kratzer W, Volzke H. Development, external validation, and comparative assessment of a new diagnostic score for hepatic steatosis. Am J Gastroenterol. 2014; 109:1404–1414.
33. Hilden J. Commentary: on NRI, IDI, and "good-looking" statistics with nothing underneath. Epidemiology. 2014; 25:265–267.
34. Lee YH, Bang H, Kim HC, Kim HM, Park SW, Kim DJ. A simple screening score for diabetes for the Korean population: development, validation, and comparison with other scores. Diabetes Care. 2012; 35:1723–1730.
35. Wyatt JC, Altman DG. Commentary: Prognostic models: clinically useful or quickly forgotten? BMJ. 1995; 311:1539.
Full Text Links
  • ENM
Actions
Cited
CITED
export Copy
Close
Share
  • Twitter
  • Facebook
Similar articles
Copyright © 2024 by Korean Association of Medical Journal Editors. All rights reserved.     E-mail: koreamed@kamje.or.kr