期刊论文详细信息
Brazilian Computer Society. Journal
ALICAT: a customized approach to item selection process in computerized adaptive testing
article
Victor M. G. Jatobá1  Jorge S. Farias2  Valdinei Freire1  André S. Ruela1  Karina V. Delgado1 
[1] Department of Information Systems, School of Arts, Sciences and Humanities, University of São Paulo;Exact Science and Earth Department, University of Bahia State (UNEB)
关键词: Computerized adaptive testing;    Item response theory;    Fisher information;    Item selection rule;    Item selection method;   
DOI  :  10.1186/s13173-020-00098-z
来源: Springer U K
PDF
【 摘 要 】

Computerized adaptive testing (CAT) based on item response theory allows more accurate assessments with fewer questions than the classic paper and pencil (P&P) test. Nonetheless, the CAT construction involves some key questions that, when done properly, can further improve the accuracy and efficiency in estimating the examinees’ abilities. One of the main questions is in regard to choosing the item selection rule (ISR). The classic CAT makes exclusive use of one ISR. However, these rules have differences depending on the examinees’ ability level and on the CAT stage. Thus, the objective of this work is to reduce the dichotomous test size which is inserted in a classic CAT with no significant loss of accuracy in the estimation of the examinee’s ability level. For this purpose, we analyze the ISR performance and then build a personalized item selection process in CAT considering the use of more than one rule. The case study in Mathematics and its Technologies test of the ENEM 2012 shows that the Kullback-Leibler information with a posterior distribution (KLP) has better performance in the examinees’ ability estimation when compared with Fisher information (F), Kullback-Leibler information (KL), maximum likelihood weighted information (MLWI), and maximum posterior weighted information (MPWI) rules. Previous results in the literature show that CAT using KLP was able to reduce this test size by 46.6% from the full size of 45 items with no significant loss of accuracy in estimating the examinees’ ability level. In this work, we observe that the F and the MLWI rules performed better on early CAT stages to estimate examinees’ proficiency level with extreme negative and positive values, respectively. With this information, we were able to reduce the same test by 53.3% using the personalized item selection process, called ALICAT, which includes the best rules working together.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO202106300002969ZK.pdf 2061KB PDF download
  文献评价指标  
  下载次数:6次 浏览次数:8次