Abstract
The kernel method of machine learning is to transform data from data space to reproducing kernel Hilbert space (RKHS) and then perform machine learning in RKHS, while kernel learning is to select the best RKHS for specific applications and given learning samples. Since RKHS can be generated from kernel functions, kernel learning is to learn kernel functions. At present, the dilemma of kernel learning is that there are few kinds of kernel functions available for learning. The first contribution of this paper is to propose a new framework of kernel functions, in which the given learning samples can be embedded. Moreover, the framework contains a learnable part which can be optimized for specific applications. Symmetric positive definite (SPD) matrix data are more and more common in machine learning. However, SPD data space does not constitute a linear space and dictionary learning involves a lot of linear operations. Therefore, dictionary learning cannot be performed directly on SPD data space. The second contribution of this paper is to apply the proposed framework of kernel functions to dictionary learning of SPD data, in which SPD data are first transformed to the RKHS produced by the proposed framework, and then, both dictionary and the learnable part of the framework are learned simultaneously in RKHS. The experimental results on 4 landmark datasets show that the proposed algorithm performs better than 6 other algorithms published recently in top academic journals.
Similar content being viewed by others
References
Turk M, Pentland AP (1991) Face recognition using eigenfaces. In: IEEE conference on computer vision and pattern recognition, pp 586–591
Belhumeur PN, Hespanha JP, Kriengman DJ (1997) Eigenfaces vs. Fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720
Wan M, Li M, Yang G et al (2014) Feature extraction using two-dimensional maximum embedding difference. In: Information sciences, pp 55–69
Wan M, Yang G, Gai S et al (2017) Two-dimensional discriminant locality preserving projections (2DDLPP) and its application to feature extraction via fuzzy set. In: Multimed tools appl, vol 76, pp 355–371. https://doi.org/10.1007/s11042-015-3057-8
Liu Z, Lai Z, Ou W et al (2020) Structured optimal graph based sparse feature extraction for semi-supervised learning. Sig Process 170:107456
Pennec X, Fillard P, Ayache N (2006) A Riemannian framework for tensor computing. Int J Comput Vis 66(1):41–66
Said S, Bombrun L, Berthoumieu Y, Manton JH (2017) Riemannian Gaussian distributions on the space of symmetric positive definite matrices. IEEE Trans Inform Theory 63(4):2153–2170
Wang R, Guo H, Davis LS et al (2012) Covariance discriminative learning: a natural and efficient approach to image set classification. In: IEEE conference on computer vision and pattern recognition, pp 2496–2503
Guo K, Ishwar P, Konrad J (2013) Action recognition from video using feature covariance matrices. IEEE Trans Image Process 22(6):2479–2494
Tuzel O, Porikli F, Meer P (2008) Pedestrian detection via classification on Riemannian manifolds. IEEE Trans Pattern Anal Mach Intell 30(10):1713–1727. https://doi.org/10.1109/TPAMI.2008.75
Zhang T, Ghanem B, Liu S et al (2013) Robust visual tracking via structured multi-task sparse learning. Int J Comput Vis 101(2):367–383
Yang M, Zhang L (2010) Gabor feature based sparse representation for face recognition with Gabor occlusion dictionary. In: ECCV, pp 448–461
Wan M et al (2016) Local graph embedding based on maximum margin criterion via fuzzy set. In: Fuzzy sets and systems, vol 318.JUL.1, pp 120–131
Liu Z, Wang J, Liu G et al (2013) Discriminative low-rank preserving projection for dimensionality reduction. In: Applied soft computing, vol 85, pp 105768, 2019. J. S. Turner, Log-Euclidean kernels for sparse representation and dictionary learning, ICCV, pp 1601–1608
Elad M (2010) Sparse and redundant representations–from theory to applications in signal and image processing. Springer Publishing Company, Berlin
Shawetaylor J, Cristianini N (2004) Kernel methods for pattern analysis. Cambridge University Press, New York
Harandi M, Sanderson C, Hartley R, Lovell B (2012) Sparse coding and dictionary learning for symmetric positive definite matrices: a kernel approach. In: ECCV, pp 216–229
Jayasumana S, Hartley R, Salzmann M, Li H, Harandi M (2013) Kernel methods on the Riemannian manifold of symmetric positive definite matrices. In: CVPR, pp 73–80
Vemulapalli R, Pillai JK, Chellappa R (2013) Kernel learning for extrinsic classification of manifold features. In: IEEE conference on computer vision and pattern recognition (CVPR)
Randen T, Husoy JH (1999) Filtering for texture classification: a comparative study. IEEE Trans Pattern Anal Mach Intell 21(4):291–310
Goh A, Vidal R (2008) Clustering and dimensionality reduction on Riemannian manifolds. In: IEEE conference on computer vision and pattern recognition, pp 1–7
Harandi M, Sanderson C, Wiliem A et al (2012) Kernel analysis over Riemannian manifolds for visual recognition of actions, pedestrians and textures
Jayasumana S, Hartley R, Salzmann M et al (2015) Kernel methods on riemannian manifolds with Gaussian RBF Kernels. In: IEEE transactions on pattern analysis & machine intelligence, vol 37, no 12, pp 1–1
Wu Y, Jia Y, Li P, Zhang J, Yuan J (2015) Manifold kernel sparse representation of symmetric positive-definite matrices and its applications. IEEE Trans Image Process 24(11):3729–3741
Harandi M, Hartley R, Lovell B et al (2014) Sparse coding on symmetric positive definite manifolds using bregman divergences. IEEE Trans Neural Netw Learn Syst 27(6):1294–1306
Sivalingam R, Boley D, Morellas V et al (2014) Tensor sparse coding for positive definite matrices. IEEE Trans Pattern Anal Mach Intell 36(3):592–605
Sivalingam R, Boley D, Morellas V et al (2015) Tensor dictionary learning for positive definite matrices. IEEE Trans Image Process 24(11):4592
Cherian A, Sra S (2016) Riemannian dictionary learning and sparse coding for positive definite matrices. IEEE Trans Neural Netw Learn Syst 28(11):2859–2871
Zhang X, Li W, Hu W et al (2013) Block covariance based L1 tracker with a subtle template dictionary. Pattern Recogn 46(7):1750–1761
Guo K, Ishwar P, Konrad J (2010) Action recognition using sparse representation on covariance manifolds of optical flow. In: IEEE international conference on advanced video and signal based surveillance, pp 188–195
Yuan C, Hu W, Li X et al (2010) Human action recognition under log-euclidean riemannian metric. In: Asian conference on computer vision, pp 343–353
Li P, Wang Q, Zuo W et al (2013) Log-Euclidean kernels for sparse representation and dictionary learning. In: ICCV, pp 1601–1608
Das A, Nair MS, Peter SD (2019) Sparse representation over learned dictionaries on the Riemannian manifold for automated grading of nuclear pleomorphism in breast cancer. IEEE Trans Image Process 28(3):1248–1260. https://doi.org/10.1109/TIP.2018.2877337
Li D, Chen L, Wang F (2016) Semantic and neighborhood preserving dictionary learning for symmetric positive-definite matrices. In: IEEE 13th international conference on signal processing (ICSP), Chengdu, pp 654–658. https://doi.org/10.1109/ICSP.2016.7877913
[Online]. Available: http://spams-devel.gforge.inria.fr/
[Online]. Available: http://cvxr.com/cvx/
Absil PA, Baker CG, Gallivan KA (2007) Trust-region methods on Riemannian manifolds. Found Comput Math 7(3):303–330
Absil PA, Mahony R, Sepulchre R (2008) Optimization algorithms on matrix manifolds. Princeton University Press, Princeton
[Online]. Available: https://www.manopt.org/
Tosato D, Spera M et al (2013) Characterizing humans on Riemannian manifolds. IEEE Trans Pattern Anal Mach Intell 35(8):1972–1984
Sra S, Cherian A (2011) Generalized dictionary learning for symmetric positive definite matrices with application to nearest neighbor retrieval. In: Proceedings of European conference on machine learning, pp 318–332
Phillips PJ, Moon H, Rizvi SA et al (2000) The FERET evaluation methodology for face-recognition algorithms. IEEE Trans Pattern Anal Mach Intell 22(10):1090–1104
Guo K, Ishwar P, Konrad J (2013) Action recognition from video using feature covariance matrices. IEEE Trans Image Process 22(6):2479–2494
Wright J, Yang A, Ganesh A, Sastry S, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 32(2):210–227
Yang M, Zhang L (2010) Gabor feature based sparse representation for face recognition with Gabor occlusion dictionary. In: ECCV, pp 448–461
Schwartz WR, Davis LS (2009) Learning discriminative appearance based models using partial least squares. In: Brazilian Symposium on Computer Graphics and Image Processing. Brazil, Oct, Rio de Janeiro, pp 322–329
Huy PA, Andrzej C (2011) Extended Hamiltonian learning on Riemannian manifolds: theoretical aspects. IEEE Trans Neural Netw 22(5):687–700
Funding
Funding was provided by National Natural Science Foundation of China (Grant No. 61773022).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Feng, W., Ma, Z., Zhuang, R. et al. The framework of learnable kernel function and its application to dictionary learning of SPD data. Pattern Anal Applic 24, 723–739 (2021). https://doi.org/10.1007/s10044-020-00941-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10044-020-00941-1