×

On the scalability of ordered multi-class ROC analysis. (English) Zbl 1452.62473

Summary: Receiver operating characteristics (ROC) analysis provides a way to select possibly optimal models for discriminating two kinds of objects without the need of specifying the cost or class distribution. It is nowadays established as a standard analysis tool in different domains, including medical decision making, pattern recognition and machine learning. Recently, an extension to the ordered multi-class case has been proposed, in which the concept of a ROC curve is generalized to an \(r\)-dimensional surface for \(r\) ordered categories, and the volume under this ROC surface (VUS) measures the overall power of a model to classify objects of the various categories. However, the computation of this criterion as well as the \(U\)-statistics estimators of its variance and covariance for two models is believed to be complex. New algorithms to compute VUS and its (co)variance estimator are presented. In particular, the volume under the ROC surface can be found very efficiently with a simple dynamic program dominated by a single sorting operation on the data set. For the variance and covariance, the respective estimators are reformulated as a series of recurrent functions over layered data graphs and subsequently these functions are rapidly evaluated with a dynamic program. Simulation experiments confirm that the presented algorithms scale well with respect to the size of the data set and the number of categories. For example, the volume under the ROC surface could be rapidly computed on very large data sets of more than 500 000 instances, while a naive implementation spent much more time on data sets of size less than 1000.

MSC:

62H30 Classification and discrimination; cluster analysis (statistical aspects)
62-08 Computational methods for problems pertaining to statistics
68T05 Learning and adaptive systems in artificial intelligence

Software:

HandTill2001
Full Text: DOI

References:

[1] Agarwal, S.; Graepel, T.; Herbrich, R.; Har-Peled, S.; Roth, D., Generalization bounds for the area under the ROC curve, Journal of Machine Learning Research, 6, 393-425 (2005) · Zbl 1222.68129
[2] Agresti, A., Categorical Data Analysis (2002), John Wiley and Sons · Zbl 1018.62002
[3] Chu, W.; Ghahramani, Z., Gaussian processes for ordinal regression, Journal of Machine Learning Research, 6, 1019-1041 (2005) · Zbl 1222.68170
[4] Chu, W., Keerthi, S., 2005, New approaches to support vector ordinal regression, In: Proceedings of the International Conference on Machine Learning, Bonn, Germany, pp. 321-328; Chu, W., Keerthi, S., 2005, New approaches to support vector ordinal regression, In: Proceedings of the International Conference on Machine Learning, Bonn, Germany, pp. 321-328
[5] Cortes, C.; Mohri, M., AUC optimization versus error rate minimization, (Advances in Neural Information Processing Systems, Vancouver, Canada, vol. 16 (2003), The MIT Press)
[6] Cortes, C.; Mohri, M., Confidence intervals for the area under the ROC curve, (Advances in Neural Information Processing Systems, NIPS 2004, Vancouver, Canada, vol. 17 (2004), MIT Press), 305-312
[7] Crammer, K., Singer, Y., 2001. Pranking with ranking, In: Proceedings of the Conference on Neural Information Processing Systems, Vancouver, Canada, pp. 641-647; Crammer, K., Singer, Y., 2001. Pranking with ranking, In: Proceedings of the Conference on Neural Information Processing Systems, Vancouver, Canada, pp. 641-647
[8] Dreiseitl, S.; Ohno-Machado, L.; Binder, M., Comparing three-class diagnostic tests by three-way ROC analysis, Medical Decision Making, 20, 323-331 (2000)
[9] Fawcett, T., An introduction to ROC analysis, Pattern Recognition Letters, 27, 8, 861-874 (2006)
[10] Ferri, C., Hernandez-Orallo, J., Salido, M.A., 2003. Volume under ROC surface for multi-class problems, In: Proceedings of the European Conference on Machine Learning, Dubrovnik, Croatia, pp. 108-120; Ferri, C., Hernandez-Orallo, J., Salido, M.A., 2003. Volume under ROC surface for multi-class problems, In: Proceedings of the European Conference on Machine Learning, Dubrovnik, Croatia, pp. 108-120 · Zbl 1257.68125
[11] Fieldsend, J.; Everson, M., Multi-class ROC analysis from a multi-objective optimization perspective, Pattern Recognition Letters, 27, 918-927 (2006)
[12] Hand, D.; Till, R., A simple generalization of the area under the ROC curve for multiple class problems, Machine Learning, 45, 171-186 (2001) · Zbl 1007.68180
[13] Hanley, J.; McNeil, B., The meaning and use of the area under a receiver operating characteristics ROC curve, Radiology, 143, 29-36 (1982)
[14] Hanley, J.; McNeil, B., A method of comparing receiver operating characteristics curves derived from the same class, Radiology, 148, 839-843 (1983)
[15] Hastie, T.; Tibshirani, R.; Friedman, J., The Elements of Statistical Learning (2001), Springer · Zbl 0973.62007
[16] Herschtal, A., Raskutti, B., 2005. Optimizing area under the ROC curve using gradient descent. In: Proceedings of the International Conference on Machine Learning, Bonn, Germany, pp. 49-57; Herschtal, A., Raskutti, B., 2005. Optimizing area under the ROC curve using gradient descent. In: Proceedings of the International Conference on Machine Learning, Bonn, Germany, pp. 49-57
[17] Kramer, S.; Widmer, G.; Pfahringer, B.; Degroeve, M., Prediction of ordinal classes using regression trees, Fundamenta Informaticae, 24, 1-15 (2000)
[18] Nakas, C.; Yiannoutsos, C., Ordered multiple-class ROC analysis with continuous measurements, Statistics in Medicine, 22, 3437-3449 (2004)
[19] Provost, F.; Fawcett, T., Robust classification for inprecise environments, Machine Learning, 42, 203-231 (2001) · Zbl 0969.68126
[20] Shashua, A.; Levin, A., Ranking with large margin principle: Two approaches, (Proceedings of the International Conference on Neural Information Processing Systems, Vancouver, Canada (2003), The MIT Press), 937-944
[21] Waegeman, W., De Baets, B., Boullart, L., 2006. A comparison of different performance measures for ordinal regression, In: Proceedings of the Third International Workshop on ROC Analysis, part of the International Conference on Machine Learning, Pittsburgh, PA, USA, pp. 63-69; Waegeman, W., De Baets, B., Boullart, L., 2006. A comparison of different performance measures for ordinal regression, In: Proceedings of the Third International Workshop on ROC Analysis, part of the International Conference on Machine Learning, Pittsburgh, PA, USA, pp. 63-69
[22] Waegeman, W.; De Baets, B.; Boullart, L., ROC analysis in ordinal regression learning, Pattern Recognition Letters, 29, 1-9 (2008)
[23] Yan, L., Dodier, R., Mozer, M., Wolniewiecz, R., 2003. Optimizing classifier performance via an approximation to the Wilcoxon-Mann-Whitney statistic, In: Proceedings of the International Conference on Machine Learning, Washington DC, USA, pp. 848-855; Yan, L., Dodier, R., Mozer, M., Wolniewiecz, R., 2003. Optimizing classifier performance via an approximation to the Wilcoxon-Mann-Whitney statistic, In: Proceedings of the International Conference on Machine Learning, Washington DC, USA, pp. 848-855
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.