学位论文详细信息
Solving a mixed-integer programming formulation of a classification model with misclassification limits
Machine learning;Mathematical programming;Classification;Integer programming;Discriminant analysis
Brooks, J. Paul ; Industrial and Systems Engineering
University:Georgia Institute of Technology
Department:Industrial and Systems Engineering
关键词: Machine learning;    Mathematical programming;    Classification;    Integer programming;    Discriminant analysis;   
Others  :  https://smartech.gatech.edu/bitstream/1853/7473/1/brooks_j_paul_200512_phd.pdf
美国|英语
来源: SMARTech Repository
PDF
【 摘 要 】

Classification, the development of rules for the allocation of observations to one or more groups, is a fundamental problem in machine learning and has been applied to many problems in medicine and business.We consider aspects of a classification model developed by Gallagher, Lee, and Patterson that is based on a result by Anderson.The model seeks to maximize the probability of correct G-group classification, subject to limits on misclassification probabilities.The mixed-integer programming formulation of the model is an empirical method for estimating the parameters of an optimal classification rule, which are identified as coefficients of linear functions by Anderson.The model is shown to be a consistent method for estimating the parameters of the optimal solution to the problem of maximizing the probability of correct classification subject to limits on inter-group misclassification probabilities.A polynomial time algorithm is described for two-group instances.The method is NP-complete for a general number of groups, and an approximation is formulated as a mixed-integer program (MIP).The MIP is difficult to solve due to the formulation of constraints wherein certain variables are equal to the maximum of a set of linear functions.These constraints are conducive to an ill-conditioned coefficient matrix.Methods for generating edges of the conflict graph and conflict hypergraphs are discussed.The conflict graph is employed for finding cuts in a branch-and-bound framework.This technique and others lead to improvement in solution time over industry-standard software on instances generated by real-world data.The classification accuracy of the model in relation to standard classification methods on real-world and simulated data is also noted.

【 预 览 】
附件列表
Files Size Format View
Solving a mixed-integer programming formulation of a classification model with misclassification limits 1102KB PDF download
  文献评价指标  
  下载次数:20次 浏览次数:16次