×

Automatic learning of cost functions for graph edit distance. (English) Zbl 1142.68492

Summary: Graph matching and graph edit distance have become important tools in structural pattern recognition. The graph edit distance concept allows us to measure the structural similarity of attributed graphs in an error-tolerant way. The key idea is to model graph variations by structural distortion operations. As one of its main constraints, however, the edit distance requires the adequate definition of edit cost functions, which eventually determine which graphs are considered similar. In the past, these cost functions were usually defined in a manual fashion, which is highly prone to errors. The present paper proposes a method to automatically learn cost functions from a labeled sample set of graphs. To this end, we formulate the graph edit process in a stochastic context and perform a maximum likelihood parameter estimation of the distribution of edit operations. The underlying distortion model is learned using an Expectation Maximization algorithm. From this model we finally derive the desired cost functions. In a series of experiments we demonstrate the learning effect of the proposed method and provide a performance comparison to other models.

MSC:

68T10 Pattern recognition, speech recognition
68R10 Graph theory (including graph drawing) in computer science
Full Text: DOI

References:

[1] Special Section on Graph algorithms and computer vision, IEEE Transactions on Pattern Analysis and Machine Intelligence 23 (10) (2001) 1040-1151.; Special Section on Graph algorithms and computer vision, IEEE Transactions on Pattern Analysis and Machine Intelligence 23 (10) (2001) 1040-1151.
[2] Special Issue on graph based representations, Pattern Recognition Letters 24 (8) (2003) 1033-1122.; Special Issue on graph based representations, Pattern Recognition Letters 24 (8) (2003) 1033-1122. · Zbl 1028.00533
[3] Special Issue on Graph matching in pattern recognition and computer vision, International Journal of Pattern Recognition and Artificial Intelligence 18 (3) (2004) 261-517.; Special Issue on Graph matching in pattern recognition and computer vision, International Journal of Pattern Recognition and Artificial Intelligence 18 (3) (2004) 261-517.
[4] Bunke, H.; Allermann, G., Inexact graph matching for structural pattern recognition, Pattern Recognition Letters, 1, 245-253 (1983) · Zbl 0509.68100
[5] Bunke, H.; Shearer, K., A graph distance metric based on the maximal common subgraph, Pattern Recognition Letters, 19, 3, 255-259 (1998) · Zbl 0905.68128
[6] Christmas, W. J.; Kittler, J.; Petrou, M., Structural matching in computer vision using probabilistic relaxation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 17, 8, 749-764 (1995)
[7] Conte, D.; Foggia, P.; Sansone, C.; Vento, M., Thirty years of graph matching in pattern recognition, International Journal of Pattern Recognition and Artificial Intelligence, 18, 3, 265-298 (2004)
[8] Cox, T.; Cox, M., Multidimensional Scaling (1994), Chapman and Hall · Zbl 0853.62047
[9] Cross, A. D.J.; Hancock, E. R., Graph matching with a dual-step EM algorithm, IEEE Transactions on Pattern Analysis and Machine Intelligence, 20, 11, 1236-1253 (1998)
[10] Dempster, A. P.; Laird, N. M.; Rubin, D. B., Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society, 39, 1, 1-38 (1977) · Zbl 0364.62022
[11] Fernandez, M.-L.; Valiente, G., A graph distance metric combining maximum common subgraph and minimum common supergraph, Pattern Recognition Letters, 22, 6-7, 753-758 (2001) · Zbl 1010.68889
[12] Finch, A. M.; Wilson, R. C.; Hancock, E. R., Symbolic graph matching with the EM algorithm, Pattern Recognition, 31, 11, 1777-1790 (1998)
[13] Hubert, L.; Schultz, J., Quadratic assignment as a general data analysis strategy, British Journal of Mathematical and Statistical Psychology, 29, 190-241 (1976) · Zbl 0356.92027
[14] Luo, B.; Hancock, E., Structural graph matching using the EM algorithm and singular value decomposition, IEEE Transactions on Pattern Analysis and Machine Intelligence, 23, 10, 1120-1136 (2001)
[15] Maltoni, D.; Maio, D.; Jain, A. K.; Prabhakar, S., Handbook of Fingerprint Recognition (2003), Springer · Zbl 1027.68114
[16] Messmer, B. T.; Bunke, H., A new algorithm for error-tolerant subgraph isomorphism detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, 20, 5, 493-504 (1998)
[17] Neuhaus, M.; Bunke, H., A graph matching based approach to fingerprint classification using directional variance, (Proceedings of the 5th International Conference on Audio- and Video-Based Biometric Person Authentication. Proceedings of the 5th International Conference on Audio- and Video-Based Biometric Person Authentication, LNCS, vol. 3546 (2005), Springer), 191-200
[18] Neuhaus, M.; Bunke, H., Self-organizing maps for learning the edit costs in graph matching, IEEE Transactions on Systems, Man, and Cybernetics (Part B), 35, 3, 503-514 (2005)
[19] Redner, R. A.; Walker, H. F., Mixture densities, maximum likelihood and the EM algorithm, Society for Industrial and Applied Mathematics (SIAM) Review, 26, 2, 195-239 (1984) · Zbl 0536.62021
[20] Ristad, E.; Yianilos, P., Learning string edit distance, IEEE Transactions on Pattern Analysis and Machine Intelligence, 20, 5, 522-532 (1998)
[21] Sanfeliu, A.; Fu, K. S., A distance measure between attributed relational graphs for pattern recognition, IEEE Transactions on Systems, Man, and Cybernetics (Part B), 13, 3, 353-363 (1983) · Zbl 0511.68060
[22] Vlassis, N.; Likas, A., A greedy EM algorithm for Gaussian mixture learning, Neural Processing Letters, 15, 1, 77-87 (2002) · Zbl 1008.68734
[23] Wagner, R. A.; Fischer, M. J., The string-to-string correction problem, Journal of the Association for Computer Machinery, 21, 1, 168-173 (1974) · Zbl 0278.68032
[24] Wallis, W. D.; Shoubridge, P.; Kraetzl, M.; Ray, D., Graph distances using graph union, Pattern Recognition Letters, 22, 6, 701-704 (2001) · Zbl 1010.68896
[25] C.I. Watson, C.L. Wilson, NIST Special Database 4, Fingerprint Database, March 1992.; C.I. Watson, C.L. Wilson, NIST Special Database 4, Fingerprint Database, March 1992.
[26] Wilson, R. C.; Hancock, E., Structural matching by discrete relaxation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 19, 6, 634-648 (1997)
[27] Wu, C. F.J., On the convergence properties of the EM algorithm, The Annals of Statistics, 11, 1, 95-103 (1983) · Zbl 0517.62035
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.