Endocrinol Metab.  2024 Oct;39(5):711-721. 10.3803/EnM.2024.1986.

Insulin Resistance and Impaired Insulin Secretion Predict Incident Diabetes: A Statistical Matching Application to the Two Korean Nationwide, Population-Representative Cohorts

Affiliations
  • 1Department of Internal Medicine, Seoul National University Bundang Hospital, Seongnam, Korea
  • 2Medical Research Collaborating Center, Seoul National University Bundang Hospital, Seongnam, Korea
  • 3Department of Internal Medicine, Seoul National University College of Medicine, Seoul, Korea
  • 4Big Data Center, Seoul National University Bundang Hospital, Seongnam, Korea
  • 5Department of Preventive Medicine, Seoul National University College of Medicine, Seoul, Korea

Abstract

Background
To evaluate whether insulin resistance and impaired insulin secretion are useful predictors of incident diabetes in Koreans using nationwide population-representative data to enhance data privacy.
Methods
This study analyzed the data of individuals without diabetes aged >40 years from the Korea National Health and Nutrition Examination Survey (KNHANES) 2007–2010 and 2015 and the National Health Insurance Service-National Health Screening Cohort (NHIS-HEALS). Owing to privacy concerns, these databases cannot be linked using direct identifiers. Therefore, we generated 10 synthetic datasets, followed by statistical matching with the NHIS-HEALS. Homeostasis model assessment of insulin resistance (HOMA-IR) and homeostasis model assessment of β-cell function (HOMA-β) were used as indicators of insulin resistance and insulin secretory function, respectively, and diabetes onset was captured in NHIS-HEALS.
Results
A median of 4,580 (range, 4,463 to 4,761) adults were included in the analyses after statistical matching of 10 synthetic KNHANES and NHIS-HEALS datasets. During a mean follow-up duration of 5.8 years, a median of 4.7% (range, 4.3% to 5.0%) of the participants developed diabetes. Compared to the reference low–HOMA-IR/high–HOMA-β group, the high–HOMA-IR/low– HOMA-β group had the highest risk of diabetes, followed by high–HOMA-IR/high–HOMA-β group and low–HOMA-IR/low– HOMA-β group (median adjusted hazard ratio [ranges]: 3.36 [1.86 to 6.05], 1.81 [1.01 to 3.22], and 1.68 [0.93 to 3.04], respectively).
Conclusion
Insulin resistance and impaired insulin secretion are robust predictors of diabetes in the Korean population. A retrospective cohort constructed by combining cross-sectional synthetic and longitudinal claims-based cohort data through statistical matching may be a reliable resource for studying the natural history of diabetes.

Keyword

Diabetes; Insulin resistance; Synthetic data; Statistical matching

Figure

  • Fig. 1. Flow chart of the study. This figure illustrates the matching process between the National Health Interview Survey (NHIS) data and the synthetic data generated from the Korea National Health and Nutrition Examination Survey (KNHANES). Initially, 10 synthetic datasets (m1, ..., m10) were created from the KNHANES data. Subsequently, each of these synthetic datasets was matched with the NHIS source data, resulting in the 10 matched datasets (M1, ..., M10). Specifically, a statistically matched dataset (M10) was incorporated by linking KNHANES synthetic data (m10) with the National Health Insurance Service-National Health Screening Cohort (NHIS-HEALS) dataset. In total, 24,617 individuals from the m10 dataset, which includes 2,512, 5,919, 6,542, and 4,421 participants in the years 2007, 2008, 2009, 2010, and 2015, respectively, and 514,866 individuals from NHIS-HEALS, including 186,980, 227,656, 223,551, 226,276, and 217,477 examinees in 2007, 2008, 2009, 2010, and 2015, respectively, were linked using statistical matching method. The participants in the same year of NHIS-HEALS and KNHANES were matched to ensure high-quality matching. As NHIS data includes tests that patients can undergo annually, duplication of the data may occur; thus, the previous year’s NHIS-KNHANES matched data were removed before the corresponding year’s matching. Using the same method above, M1–M9 were constructed by concatenating the KNHANES synthetic datasets m1–m9 and the NHIS-HEALS dataset, respectively—diabetes mellitus (DM) and fasting blood glucose (FBS). HOMA-IR, homeostasis model assessment of insulin resistance; HOMA-β, homeostasis model assessment of -βcell function. aExclusion criteria: (1) missing values, (2) previous DM history (questionnaire, FBS ≥126 mg/dL), (3) medical beneficiaries, (4) age <80 years; bExclusion criteria: (1) missing values, (2) previous DM history (questionnaire, FBS ≥126 mg/dL), (3) age <40 years.

  • Fig. 2. Kaplan-Meier curve of the risk for diabetes according to homeostasis model assessment of insulin resistance (HOMA-IR) and homeostasis model assessment of β-cell function (HOMA-β) statuses (representative matched dataset of Korea National Health and Nutrition Examination Survey synthetic dataset m10 and National Health Insurance Service-National Health Screening Cohort [NHIS-HEALS] dataset, M10).

  • Fig. 3. Forrest plot of the risk of diabetes according to the baseline homeostasis model assessment of insulin resistance (HOMA-IR) and homeostasis model assessment of β-cell function (HOMA-β) status (matched dataset of Korea National Health and Nutrition Examination Survey synthetic dataset m1–m10 and National Health Insurance Service-National Health Screening Cohort [NHIS-HEALS] dataset: M1–M10). Adjusted hazard ratios (HRs) of the low–HOMA-IR/low–HOMA-β group (orange), the high–HOMA-IR/high–HOMA-β group (green), and the high–HOMA-IR/low–HOMA-β group (blue) were calculated using the low–HOMA-IR/high–HOMA-β group as a reference.


Cited by  1 articles

Combining Nationwide Cohorts to Unveil the Predictive Role of Insulin Resistance and Impaired Insulin Secretion in Diabetes
Bukyung Kim
Endocrinol Metab. 2024;39(5):699-700.    doi: 10.3803/EnM.2024.2189.


Reference

1. Bae JH, Han KD, Ko SH, Yang YS, Choi JH, Choi KM, et al. Diabetes fact sheet in Korea 2021. Diabetes Metab J. 2022; 46:417–26.
Article
2. Defronzo RA. Banting lecture. From the triumvirate to the ominous octet: a new paradigm for the treatment of type 2 diabetes mellitus. Diabetes. 2009; 58:773–95.
3. Kodama K, Tojjar D, Yamada S, Toda K, Patel CJ, Butte AJ. Ethnic differences in the relationship between insulin sensitivity and insulin response: a systematic review and meta-analysis. Diabetes Care. 2013; 36:1789–96.
4. Tabak AG, Jokela M, Akbaraly TN, Brunner EJ, Kivimaki M, Witte DR. Trajectories of glycaemia, insulin sensitivity, and insulin secretion before diagnosis of type 2 diabetes: an analysis from the Whitehall II study. Lancet. 2009; 373:2215–21.
Article
5. Wang T, Lu J, Shi L, Chen G, Xu M, Xu Y, et al. Association of insulin resistance and β-cell dysfunction with incident diabetes among adults in China: a nationwide, populationbased, prospective cohort study. Lancet Diabetes Endocrinol. 2020; 8:115–24.
Article
6. Ohn JH, Kwak SH, Cho YM, Lim S, Jang HC, Park KS, et al. 10-Year trajectory of β-cell function and insulin sensitivity in the development of type 2 diabetes: a community-based prospective cohort study. Lancet Diabetes Endocrinol. 2016; 4:27–34.
Article
7. Morimoto A, Tatsumi Y, Deura K, Mizuno S, Ohno Y, Miyamatsu N, et al. Impact of impaired insulin secretion and insulin resistance on the incidence of type 2 diabetes mellitus in a Japanese population: the Saku study. Diabetologia. 2013; 56:1671–9.
Article
8. Park S, Kim K, Lee BK, Ahn J. A healthy diet rich in calcium and vitamin C is inversely associated with metabolic syndrome risk in Korean adults from the KNHANES 2013-2017. Nutrients. 2021; 13:1312.
Article
9. Park JH, Hong IY, Chung JW, Choi HS. Vitamin D status in South Korean population: seven-year trend from the KNHANES. Medicine (Baltimore). 2018; 97:e11032.
10. Yi DW, Khang AR, Lee HW, Son SM, Kang YH. Relative handgrip strength as a marker of metabolic syndrome: the Korea National Health and Nutrition Examination Survey (KNHANES) VI (2014-2015). Diabetes Metab Syndr Obes. 2018; 11:227–40.
11. Kim H, Lee M, Hwang H, Chung YJ, Cho HH, Yoon H, et al. The estimated prevalence and incidence of endometriosis with the Korean National Health Insurance Service-National Sample Cohort (NHIS-NSC): a national population-based study. J Epidemiol. 2021; 31:593–600.
Article
12. Ahn SV, Lee E, Park B, Jung JH, Park JE, Sheen SS, et al. Cancer development in patients with COPD: a retrospective analysis of the National Health Insurance Service-National Sample Cohort in Korea. BMC Pulm Med. 2020; 20:170.
Article
13. Son JW, Park CY, Kim S, Lee HK, Lee YS; Insulin Resistance as Primary Pathogenesis in Newly Diagnosed, Drug Naïve Type 2 Diabetes Patients in Korea (SURPRISE) Study Group. Changing clinical characteristics according to insulin resistance and insulin secretion in newly diagnosed type 2 diabetic patients in Korea. Diabetes Metab J. 2015; 39:387–94.
Article
14. Seong SC, Kim YY, Park SK, Khang YH, Kim HC, Park JH, et al. Cohort profile: the National Health Insurance Service-National Health Screening Cohort (NHIS-HEALS) in Korea. BMJ Open. 2017; 7:e016640.
Article
15. Matthews DR, Hosker JP, Rudenski AS, Naylor BA, Treacher DF, Turner RC. Homeostasis model assessment: insulin resistance and beta-cell function from fasting plasma glucose and insulin concentrations in man. Diabetologia. 1985; 28:412–9.
Article
16. Gonzalez JC, van Delden A, de Waal T. Assessment of the effect of constraints in a new multivariate mixed method for statistical matching. Computational Stat Data Anal. 2023; 177:107569.
17. Nowok B, Raab GM, Dibben C. Synthpop: bespoke creation of synthetic data in R. J Stat Softw. 2016; 74:1–26.
18. Yabe D, Seino Y. Type 2 diabetes via β-cell dysfunction in east Asian people. Lancet Diabetes Endocrinol. 2016; 4:2–3.
Article
19. Foraker RE, Yu SC, Gupta A, Michelson AP, Pineda Soto JA, Colvin R, et al. Spot the difference: comparing results of analyses from real patient data and synthetic derivatives. JAMIA Open. 2020; 3:557–66.
Article
20. Rankin D, Black M, Bond R, Wallace J, Mulvenna M, Epelde G. Reliability of supervised machine learning using synthetic data in health care: model to preserve privacy for data sharing. JMIR Med Inform. 2020; 8:e18910.
Article
21. Ping H, Stoyanovich J, Howe B. DataSynthesizer: privacy-preserving synthetic datasets. In : In: SSDBM ‘17 Proceedings of the 29th International Conference on Scientific and Statistical Database Management; 2017 Jun 27-29; Chicago, IL. Ney York, NY: Association for Computing Machinery;2017. p. 1–5.
22. Reiter JP. Using CART to generate partially synthetic public use microdata. J Off Stat. 2005; 21:441–62.
23. D’Orazio M, Di Zio M, Scanu M. Statistical matching: theory and practice. New York: John Wiley & Sons;2006.
24. D’Alberto R, Raggi M. Integrating rather than collecting: statistical matching in the data flood era. Stat Pap. 2024; 65:2135–63.
Article
25. De Waal AG. Statistical matching: experimental results and future research questions [Internet]. Den Haag: CBS;2015. [cited 2024 Jun 21]. Available from: https://pure.uvt.nl/ws/portalfiles/portal/48726611/MTO_d_Waal_statistical_matching_2015.pdf.
26. ElSayed NA, Aleppo G, Aroda VR, Bannuru RR, Brown FM, Bruemmer D, et al. 2. Classification and diagnosis of diabetes: standards of care in diabetes-2023. Diabetes Care. 2023; 46(Suppl 1):S19–40.
27. Park SY, Gautier JF, Chon S. Assessment of insulin secretion and insulin resistance in human. Diabetes Metab J. 2021; 45:641–54.
Article
28. Lee MJ, Bae JH, Khang AR, Yi D, Yun MS, Kang YH. Triglyceride-glucose index predicts type 2 diabetes mellitus more effectively than oral glucose tolerance test-derived insulin sensitivity and secretion markers. Diabetes Res Clin Pract. 2024; 210:111640.
Article
Full Text Links
  • ENM
Actions
Cited
CITED
export Copy
Close
Share
  • Twitter
  • Facebook
Similar articles
Copyright © 2025 by Korean Association of Medical Journal Editors. All rights reserved.     E-mail: koreamed@kamje.or.kr