Annals of Emerging Technologies in Computing | |
The Theory of Probabilistic Hierarchical Learning for Classification | |
article | |
Ursani, Ahsan Ahmad1  Ursani, Ziauddin2  | |
[1] Mehran University of Engineering and Technology;University of Bristol | |
关键词: Classification; Hierarchical learning; Probabilistic learning; Set partitioning; Supervised learning; | |
DOI : 10.33166/AETiC.2023.01.005 | |
学科分类:电子与电气工程 | |
来源: International Association for Educators and Researchers (IAER) | |
【 摘 要 】
Providing the ability of classification to computers has remained at the core of the faculty of artificial intelligence. Its application has now made inroads towards nearly every walk of life, spreading over healthcare, education, defence, economics, linguistics, sociology, literature, transportation, agriculture, and industry etc. To our understanding most of the problems faced by us can be formulated as classification problems. Therefore, any novel contribution in this area has a great potential of applications in the real world. This paper proposes a novel way of learning from classification datasets i.e., hierarchical learning through set partitioning. The theory of probabilistic hierarchical learning for classification has been evolved through several works while widening its scope with each instance. The theory demonstrates that the classification of any dataset can be learnt by generating a hierarchy of learnt models each capable of classifying a disjoint subset of the training set. The basic assertion behind the theory is that an accurate classification of complex datasets can be achieved through hierarchical application of low complexity models. In this paper, the theory is redefined and revised based on four mathematical principles namely, principle of successive bifurcation, principle of two-tier discrimination, principle of class membership and the principle of selective data normalization. The algorithmic implementation of each principle is also discussed. The scope of the approach is now further widened to include ten popular real-world datasets in its test base. This approach does not only produce their accurate models but also produced above 95% accuracy on average with regard to the generalising ability, which is competitive with the contemporary literature.
【 授权许可】
CC BY
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202306300002726ZK.pdf | 706KB | download |