Italian Information Retrieval Workshop 2012. | |
Error-Correcting Output Codes for Multi-Label Text Categorization | |
图书情报档案学;计算机科学 | |
Giuliano Armano1 ; Camelia Chira2 ; Nima Hatami1 | |
Others : http://ceur-ws.org/Vol-835/paper4.pdf PID : 41388 |
|
来源: CEUR | |
【 摘 要 】
When a sample belongs to more than one label from a set of available classes, the classification problem (known as multi-label classification) turns to be more complicated. Text data, widely available nowadays in the world wide web, is an obvious instance example of such a task. This paper presents a new method for multi-label text categorization created by modifying the Error-Correcting Output Coding (ECOC) technique. Using a set of binary complimentary classifiers, ECOC has proven to be efficient for multi-class problems. The proposed method, called ML-ECOC, is a first attempt to extend the ECOC algorithm to handle multi-label tasks. Experimental results on the Reuters bench- marks (RCV1-v2) demonstrate the potential of the proposed method on multi-label text categorization.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
Error-Correcting Output Codes for Multi-Label Text Categorization | 264KB | download |