Data in Brief | |
A dataset of chest X-ray reports annotated with Spatial Role Labeling annotations | |
Kirk Roberts1  Surabhi Datta2  | |
[1] Corresponding author.;School of Biomedical Informatics, The University of Texas Health Science Center, Houston, TX, USA; | |
关键词: Spatial relations; Spatial Role Labeling; Radiology report; Chest radiology; Natural language processing; Information extraction; | |
DOI : | |
来源: DOAJ |
【 摘 要 】
In this paper, we present a dataset consisting of 2000 chest X-ray reports (available as part of the Open-i image search platform) annotated with spatial information. The annotation is based on Spatial Role Labeling. The information includes annotating a radiographic finding, its associated anatomical location, any potential diagnosis described in connection to the spatial relation (between finding and location), and any hedging phrase used to describe the certainty level of a finding/diagnosis. All these annotations are identified with reference to a spatial expression (or Spatial Indicator) that triggers a spatial relation in a sentence. The spatial roles used to encode the spatial information are Trajector, Landmark, Diagnosis, and Hedge. In total, there are 1962 Spatial Indicators (mainly prepositions). There are 2293 Trajectors, 2167 Landmarks, 455 Diagnosis, and 388 Hedges in the dataset. This annotated dataset can be used for developing automatic approaches targeted toward spatial information extraction from radiology reports which then can be applied to numerous clinical applications. We utilize this dataset to develop deep learning-based methods for automatically extracting the Spatial Indicators as well as the associated spatial roles [1].
【 授权许可】
Unknown