BMC Bioinformatics | |
Advances in monolingual and crosslingual automatic disability annotation in Spanish | |
Research | |
Edgar Andres1  Koldo Gojenola1  Aitziber Atutxa1  Iakes Goenaga2  | |
[1]HiTZ: Basque Center for Language Technology, University of the Basque Country UPV/EHU, Bilbao, Spain | |
[2]HiTZ: Basque Center for Language Technology, University of the Basque Country UPV/EHU, Donostia, Spain | |
关键词: Artificial intelligence; Neural networks; Named entity recognition; Disability annotation; Embeddings; Crosslingual learning; | |
DOI : 10.1186/s12859-023-05372-3 | |
received in 2022-12-21, accepted in 2023-05-31, 发布年份 2023 | |
来源: Springer | |
![]() |
【 摘 要 】
BackgroundUnlike diseases, automatic recognition of disabilities has not received the same attention in the area of medical NLP. Progress in this direction is hampered by obstacles like the lack of annotated corpus. Neural architectures learn to translate sequences from spontaneous representations into their corresponding standard representations given a set of samples. The aim of this paper is to present the last advances in monolingual (Spanish) and crosslingual (from English to Spanish and vice versa) automatic disability annotation. The task consists of identifying disability mentions in medical texts written in Spanish within a collection of abstracts from journal papers related to the biomedical domain.ResultsIn order to carry out the task, we have combined deep learning models that use different embedding granularities for sequence to sequence tagging with a simple acronym and abbreviation detection module to boost the coverage.ConclusionsOur monolingual experiments demonstrate that a good combination of different word embedding representations provide better results than single representations, significantly outperforming the state of the art in disability annotation in Spanish. Additionally, we have experimented crosslingual transfer (zero-shot) for disability annotation between English and Spanish with interesting results that might help overcoming the data scarcity bottleneck, specially significant for the disabilities.【 授权许可】
CC BY
© The Author(s) 2023
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202309072180412ZK.pdf | 1645KB | ![]() |
|
Fig. 2 | 278KB | Image | ![]() |
41116_2023_37_Article_IEq112.gif | 1KB | Image | ![]() |
41116_2023_37_Article_IEq128.gif | 1KB | Image | ![]() |
41116_2023_37_Article_IEq140.gif | 1KB | Image | ![]() |
Fig. 2 | 232KB | Image | ![]() |
41116_2023_37_Article_IEq150.gif | 1KB | Image | ![]() |
41116_2023_37_Article_IEq152.gif | 1KB | Image | ![]() |
Fig. 2 | 137KB | Image | ![]() |
41116_2023_37_Article_IEq174.gif | 1KB | Image | ![]() |
41116_2023_37_Article_IEq188.gif | 1KB | Image | ![]() |
Fig. 1 | 380KB | Image | ![]() |
MediaObjects/12888_2023_4904_MOESM1_ESM.docx | 29KB | Other | ![]() |
Fig. 1 | 488KB | Image | ![]() |
Fig. 1 | 939KB | Image | ![]() |
41116_2023_37_Article_IEq193.gif | 1KB | Image | ![]() |
41116_2023_37_Article_IEq194.gif | 1KB | Image | ![]() |
Fig. 1 | 227KB | Image | ![]() |
41116_2023_37_Article_IEq196.gif | 1KB | Image | ![]() |
41116_2023_37_Article_IEq197.gif | 1KB | Image | ![]() |
41116_2023_37_Article_IEq213.gif | 1KB | Image | ![]() |
41116_2023_37_Article_IEq226.gif | 1KB | Image | ![]() |
41116_2023_37_Article_IEq228.gif | 1KB | Image | ![]() |
41116_2023_37_Article_IEq229.gif | 1KB | Image | ![]() |
41116_2023_37_Article_IEq246.gif | 1KB | Image | ![]() |
41116_2023_37_Article_IEq271.gif | 1KB | Image | ![]() |
41116_2023_37_Article_IEq289.gif | 1KB | Image | ![]() |
41116_2023_37_Article_IEq290.gif | 1KB | Image | ![]() |
41116_2023_37_Article_IEq291.gif | 1KB | Image | ![]() |
MediaObjects/12888_2023_4925_MOESM1_ESM.docx | 16KB | Other | ![]() |
Fig. 1 | 1438KB | Image | ![]() |
41116_2023_37_Article_IEq177.gif | 1KB | Image | ![]() |
Fig. 7 | 1698KB | Image | ![]() |
Fig. 1 | 68KB | Image | ![]() |
42004_2023_919_Article_IEq130.gif | 1KB | Image | ![]() |
13011_2023_540_Article_IEq1.gif | 1KB | Image | ![]() |
40517_2023_259_Article_IEq2.gif | 1KB | Image | ![]() |
MediaObjects/12888_2023_4789_MOESM1_ESM.docx | 116KB | Other | ![]() |
Fig. 2 | 123KB | Image | ![]() |
13011_2023_540_Article_IEq19.gif | 1KB | Image | ![]() |
40517_2023_259_Article_IEq5.gif | 1KB | Image | ![]() |
40517_2023_259_Article_IEq6.gif | 1KB | Image | ![]() |
40517_2023_259_Article_IEq7.gif | 1KB | Image | ![]() |
40517_2023_259_Article_IEq8.gif | 1KB | Image | ![]() |
MediaObjects/12888_2023_4850_MOESM3_ESM.xlsx | 22KB | Other | ![]() |
MediaObjects/12888_2023_4850_MOESM4_ESM.docx | 29KB | Other | ![]() |
40517_2023_259_Article_IEq11.gif | 1KB | Image | ![]() |
Fig. 2 | 129KB | Image | ![]() |
40517_2023_259_Article_IEq13.gif | 1KB | Image | ![]() |
42004_2023_919_Article_IEq168.gif | 1KB | Image | ![]() |
Fig. 1 | 487KB | Image | ![]() |
Fig. 4 | 1153KB | Image | ![]() |
42004_2023_919_Article_IEq171.gif | 1KB | Image | ![]() |
Fig. 5 | 332KB | Image | ![]() |
Fig. 8 | 728KB | Image | ![]() |
Fig. 5 | 1164KB | Image | ![]() |
MediaObjects/41408_2023_867_MOESM1_ESM.docx | 286KB | Other | ![]() |
40517_2023_259_Article_IEq32.gif | 1KB | Image | ![]() |
Fig. 2 | 297KB | Image | ![]() |
MediaObjects/12888_2023_4850_MOESM7_ESM.docx | 15KB | Other | ![]() |
40517_2023_259_Article_IEq35.gif | 1KB | Image | ![]() |
40517_2023_259_Article_IEq36.gif | 1KB | Image | ![]() |
【 图 表 】
40517_2023_259_Article_IEq36.gif
40517_2023_259_Article_IEq35.gif
Fig. 2
40517_2023_259_Article_IEq32.gif
Fig. 5
Fig. 8
Fig. 5
42004_2023_919_Article_IEq171.gif
Fig. 4
Fig. 1
42004_2023_919_Article_IEq168.gif
40517_2023_259_Article_IEq13.gif
Fig. 2
40517_2023_259_Article_IEq11.gif
40517_2023_259_Article_IEq8.gif
40517_2023_259_Article_IEq7.gif
40517_2023_259_Article_IEq6.gif
40517_2023_259_Article_IEq5.gif
13011_2023_540_Article_IEq19.gif
Fig. 2
40517_2023_259_Article_IEq2.gif
13011_2023_540_Article_IEq1.gif
42004_2023_919_Article_IEq130.gif
Fig. 1
Fig. 7
41116_2023_37_Article_IEq177.gif
Fig. 1
41116_2023_37_Article_IEq291.gif
41116_2023_37_Article_IEq290.gif
41116_2023_37_Article_IEq289.gif
41116_2023_37_Article_IEq271.gif
41116_2023_37_Article_IEq246.gif
41116_2023_37_Article_IEq229.gif
41116_2023_37_Article_IEq228.gif
41116_2023_37_Article_IEq226.gif
41116_2023_37_Article_IEq213.gif
41116_2023_37_Article_IEq197.gif
41116_2023_37_Article_IEq196.gif
Fig. 1
41116_2023_37_Article_IEq194.gif
41116_2023_37_Article_IEq193.gif
Fig. 1
Fig. 1
Fig. 1
41116_2023_37_Article_IEq188.gif
41116_2023_37_Article_IEq174.gif
Fig. 2
41116_2023_37_Article_IEq152.gif
41116_2023_37_Article_IEq150.gif
Fig. 2
41116_2023_37_Article_IEq140.gif
41116_2023_37_Article_IEq128.gif
41116_2023_37_Article_IEq112.gif
Fig. 2
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]
- [34]
- [35]
- [36]
- [37]
- [38]
- [39]
- [40]
- [41]
- [42]
- [43]
- [44]
- [45]
- [46]
- [47]
- [48]
- [49]
- [50]
- [51]
- [52]