期刊论文详细信息
BMC Medical Research Methodology
The impact of grey zones on the accuracy of agreement measures for ordinal tables
Quoc Duyet Tran1  Haydar Demirhan2  Anil Dolgun2 
[1] An Giang University, VNU-HCM;Mathematical Sciences, School of Science, RMIT University;
关键词: Grading variation;    Inter-rater agreement;    Kappa coefficient;    Monte Carlo simulation;    Reliability;    Weight;   
DOI  :  10.1186/s12874-021-01248-3
来源: DOAJ
【 摘 要 】

Abstract Background In an inter-rater agreement study, if two raters tend to rate considering different aspects of the subject of interest or have different experience levels, a grey zone occurs among the levels of a square contingency table showing the inter-rater agreement. These grey zones distort the degree of agreement between raters and negatively impact the decisions based on the inter-rater agreement tables. In this sense, it is important to know how the existence of a grey zone impacts the inter-rater agreement coefficients to choose the most reliable agreement coefficient against the grey zones to reach out with more reliable decisions. Methods In this article, we propose two approaches to create grey zones in simulations setting and conduct an extensive Monte Carlo simulation study to figure out the impact of having grey zones on the weighted inter-rater agreement measures for ordinal tables over a comprehensive simulation space. Results The weighted inter-rater agreement coefficients are not reliable against the existence of grey zones. Increasing sample size and the number of categories in the agreement table decreases the accuracy of weighted inter-rater agreement measures when there is a grey zone. When the degree of agreement between the raters is high, the agreement measures are not significantly impacted by the existence of grey zones. However, if there is a medium to low degree of inter-rater agreement, all the weighted coefficients are more or less impacted. Conclusions It is observed in this study that the existence of grey zones has a significant negative impact on the accuracy of agreement measures especially for a low degree of true agreement and high sample and tables sizes. In general, Gwet’s AC2 and Brennan-Prediger’s κ with quadratic or ordinal weights are reliable against the grey zones.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:5次