期刊论文详细信息
BMC Medical Research Methodology
Semantic and cognitive tools to aid statistical science: replace confidence and significance by compatibility and surprise
Sander Greenland1  Zad Rafi2 
[1]Department of Epidemiology and Department of Statistics, University of California, Los Angeles, CA, USA
[2]Department of Population Health, NYU Langone Medical Center, 227 East 30th Street, 10016, New York, NY, USA
关键词: Confidence intervals;    Cognitive science;    Bias;    Data interpretation;    Evidence;    Hypothesis tests;    Information;    P;    Statistical significance;    Models, statistical;   
DOI  :  10.1186/s12874-020-01105-9
来源: Springer
PDF
【 摘 要 】
BackgroundResearchers often misinterpret and misrepresent statistical outputs. This abuse has led to a large literature on modification or replacement of testing thresholds and P-values with confidence intervals, Bayes factors, and other devices. Because the core problems appear cognitive rather than statistical, we review some simple methods to aid researchers in interpreting statistical outputs. These methods emphasize logical and information concepts over probability, and thus may be more robust to common misinterpretations than are traditional descriptions.MethodsWe use the Shannon transform of the P-value p, also known as the binary surprisal or S-value s = −log2(p), to provide a measure of the information supplied by the testing procedure, and to help calibrate intuitions against simple physical experiments like coin tossing. We also use tables or graphs of test statistics for alternative hypotheses, and interval estimates for different percentile levels, to thwart fallacies arising from arbitrary dichotomies. Finally, we reinterpret P-values and interval estimates in unconditional terms, which describe compatibility of data with the entire set of analysis assumptions. We illustrate these methods with a reanalysis of data from an existing record-based cohort study.ConclusionsIn line with other recent recommendations, we advise that teaching materials and research reports discuss P-values as measures of compatibility rather than significance, compute P-values for alternative hypotheses whenever they are computed for null hypotheses, and interpret interval estimates as showing values of high compatibility with data, rather than regions of confidence.Our recommendations emphasize cognitive devices for displaying the compatibility of the observed data with various hypotheses of interest, rather than focusing on single hypothesis tests or interval estimates. We believe these simple reforms are well worth the minor effort they require.
【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202104264599951ZK.pdf 1678KB PDF download
  文献评价指标  
  下载次数:4次 浏览次数:10次