Kidney Res Clin Pract.  2017 Mar;36(1):3-11. 10.23876/j.krcp.2017.36.1.3.

Medical big data: promise and challenges

Affiliations
  • 1Department of Biomedical Engineering, Seoul National University College of Medicine, Seoul, Korea.

Abstract

The concept of big data, commonly characterized by volume, variety, velocity, and veracity, goes far beyond the data type and includes the aspects of data analysis, such as hypothesis-generating, rather than hypothesis-testing. Big data focuses on temporal stability of the association, rather than on causal relationship and underlying probability distribution assumptions are frequently not required. Medical big data as material to be analyzed has various features that are not only distinct from big data of other disciplines, but also distinct from traditional clinical epidemiology. Big data technology has many areas of application in healthcare, such as predictive modeling and clinical decision support, disease or safety surveillance, public health, and research. Big data analytics frequently exploits analytic methods developed in data mining, including classification, clustering, and regression. Medical big data analyses are complicated by many technical issues, such as missing values, curse of dimensionality, and bias control, and share the inherent limitations of observation study, namely the inability to test causality resulting from residual confounding and reverse causation. Recently, propensity score analysis and instrumental variable analysis have been introduced to overcome these limitations, and they have accomplished a great deal. Many challenges, such as the absence of evidence of practical benefits of big data, methodological issues including legal and ethical issues, and clinical integration and utility issues, must be overcome to realize the promise of medical big data as the fuel of a continuous learning healthcare system that will improve patient outcome and reduce waste in areas including nephrology.

Keyword

Big data; Epidemiology; Data mining; Healthcare; Statistics

MeSH Terms

Bias (Epidemiology)
Classification
Data Mining
Decision Support Systems, Clinical
Delivery of Health Care
Epidemiology
Ethics
Humans
Learning
Nephrology
Propensity Score
Public Health Surveillance
Statistics as Topic
Full Text Links
  • KRCP
Actions
Cited
CITED
export Copy
Close
Share
  • Twitter
  • Facebook
Similar articles
Copyright © 2024 by Korean Association of Medical Journal Editors. All rights reserved.     E-mail: koreamed@kamje.or.kr