期刊论文详细信息
Frontiers in Medicine
Big Data, Data Science, and Causal Inference: A Primer for Clinicians
article
Yoshihiko Raita1  Carlos A. Camargo, Jr.2  Liming Liang1  Kohei Hasegawa1 
[1] Department of Emergency Medicine, Harvard Medical School, Massachusetts General Hospital, United States;Division of Rheumatology, Department of Medicine, Harvard Medical School, Massachusetts General Hospital, United States;Department of Epidemiology, Harvard T.H. Chan School of Public Health, United States;Department of Biostatistics, Harvard T.H. Chan School of Public Health, United States
关键词: big data;    data science;    causal inference;    the ladder of causation;    machine learning;   
DOI  :  10.3389/fmed.2021.678047
学科分类:社会科学、人文和艺术(综合)
来源: Frontiers
PDF
【 摘 要 】

Clinicians handle a growing amount of clinical, biometric, and biomarker data. In this “big data” era, there is an emerging faith that the answer to all clinical and scientific questions reside in “big data” and that data will transform medicine into precision medicine. However, data by themselves are useless. It is the algorithms encoding causal reasoning and domain (e.g., clinical and biological) knowledge that prove transformative. The recent introduction of (health) data science presents an opportunity to re-think this data-centric view. For example, while precision medicine seeks to provide the right prevention and treatment strategy to the right patients at the right time, its realization cannot be achieved by algorithms that operate exclusively in data-driven prediction modes, as do most machine learning algorithms. Better understanding of data science and its tasks is vital to interpret findings and translate new discoveries into clinical practice. In this review, we first discuss the principles and major tasks of data science by organizing it into three defining tasks: (1) association and prediction, (2) intervention, and (3) counterfactual causal inference. Second, we review commonly-used data science tools with examples in the medical literature. Lastly, we outline current challenges and future directions in the fields of medicine, elaborating on how data science can enhance clinical effectiveness and inform medical practice. As machine learning algorithms become ubiquitous tools to handle quantitatively “big data,” their integration with causal reasoning and domain knowledge is instrumental to qualitatively transform medicine, which will, in turn, improve health outcomes of patients.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202108180001427ZK.pdf 1300KB PDF download
  文献评价指标  
  下载次数:12次 浏览次数:0次