A novel multi-modal biometric authentication system with enhanced feature extraction using advanced mapped real transform

  • Authors

    • Lakshmi S Panicker School of Engineering, Cochin University of Science and Technology
    • R. Gopikakumari School of Engineering, CUSAT
    2025-01-07
    https://doi.org/10.14419/tn22a349
  • Audio-Visual Biometrics; Feature Level Fusion; SMRT.
  • Abstract

    In contexts where security is a top priority, biometric authentication, which is the act of identifying a person using physiological or behavioral modalities—acquires critical importance. Facial image and speech are the two most popular biometrics for personal verification. This paper introduces a new audio-visual biometric recognition system for authenticating humans, using the sequency-mapped real transform for feature extraction. The performance of this new system is compared with previously published results of other systems that only use audio, video, or both. The proposed system demonstrates better performance metrics by utilizing the complementary strengths of both modalities and by employing a new transform that cuts down on computational complexity.

  • References

    1. N. a. M. M.-W. a. C. J.-T. a. N. L. a. M.-W. M. a. J.-T. C. Li, "DNN-Driven Mixture of PLDA for Robust Speaker Verification," IEEE/ACM Trans. Audio, Speech and Lang. Proc., vol. 25, no. 6, p. 1371–1383, June 2017. https://doi.org/10.1109/TASLP.2017.2692304.
    2. C. S. X. Z. Y. Y. H. T. S. Fumin Shen, "Face image classification by pooling raw features," Pattern Recognition, vol. 54, pp. 94-103, 2016. https://doi.org/10.1016/j.patcog.2016.01.010.
    3. A. M. K. Javad Khodadoust, "Fingerprint indexing based on minutiae pairs and convex core point," Pattern Recognition, vol. 67, pp. 110-126, 2017. https://doi.org/10.1016/j.patcog.2017.01.022.
    4. L. N. Alessandra Lumini, "Overview of the combination of biometric matchers," Information Fusion,, vol. 33, pp. 71-85, 2017. https://doi.org/10.1016/j.inffus.2016.05.003.
    5. F. M. H. B. Jean-Philippe Thiran, Multimodal Signal Processing Theory and Applications for Human–Computer Interaction, Academic Press, 2010.
    6. K. M. R. G. B. Manju, "Prostate Disease Diagnosis from CT Images Using GA Optimized SMRT Based Texture Features,," in Procedia Computer Science, 2015.
    7. T. T. a. R. G. P. P. Mini, "Feature Vector Selection of Fusion of MFCC and SMRT Coefficients for SVM Classifier Based Speech Recognition System," in 2018 8th International Symposium on Embedded Computing and System Design (ISED), Cochin, India, 2018.
    8. R. S. A. R. Maneet Singh, "A comprehensive overview of biometric fusion," Information Fusion, vol. 52, pp. 187-205, 2019. https://doi.org/10.1016/j.inffus.2018.12.003.
    9. T. a. K. M. a. K. M. S. H. a. B. R. Mahboob, "Speaker identification using gmm with mfcc," International Journal of Computer Science Issues (IJCSI), vol. 12, 2015.
    10. S. L. H. Y. J.-H. K. G. K. S. Park, "Partially Occluded Facial Image Retrieval Based on a Similarity Measurement," Mathematical Prob-lems in Engineering, 2015. https://doi.org/10.1155/2015/217568.
    11. W. W. a. J. Chen, "Occlusion robust face recognition based on mask learning," in 2017 IEEE International Conference on Image Pro-cessing (ICIP), Beijing, China, 2017.
    12. C. Sanderson, "Biometric Person Recognition: Face, Speech, and Fusion," in VDM Verlag, 2008.
    13. R. Gopikakumari, Investigations on the development of an ANN model and visual manipulation approach for 2D DFT computation in image processing [PhD Thesis, Cochin University of Science and Technology, Kochi, India]. http://dyuthi.cusat.ac.in/purl/, 1998.
    14. R. C. &. G. R. Roy, "A new transform for 2-D signal representation (MRT) and some of its properties.," in 2004 International Confer-ence on Signal Processing and Communications, SPCOM, 2, 363–367, 2004.
    15. R. C. Roy, Development of a New Transform : Mrt [PhD Thesis, Cochin University of Science and Technology, Kochi, India], 2009.
    16. V. Bhadran, Development & Implementation of Visual Approach and Parallel Distributed Architecture for 2-D DFT & UMRT compu-tation [PhD Thesis, Cochin University of Science and Technology, Kochi, India], 2009.
    17. V. L. &. G. R. Jaya, " Sequency-based mapped real transform: properties and applications.," Signal, Image and Video Processing, vol. 11, no. 8, p. 1551–1558., 2017. https://doi.org/10.1007/s11760-017-1119-2.
    18. B. a. J. V. a. M. K. a. G. R. Manju, "8 × 8 SMRT Based Texture Descriptors," Lecture Notes on Software Engineering, vol. 3, pp. 295-298, 2015. https://doi.org/10.7763/LNSE.2015.V3.207.
    19. T. T. R. G. P.P. Mini, "EEG based direct speech BCI system using a fusion of SMRT and MFCC/LPCC features with ANN classifier," Biomedical Signal Processing and Control, vol. 68, p. 102625, 2021. https://doi.org/10.1016/j.bspc.2021.102625.
    20. P. J. M. Viola, "Robust Real-Time Face Detection," International Journal of Computer Vision, vol. 57, p. 137–154, 2004. https://doi.org/10.1023/B:VISI.0000013087.49260.fb.
    21. M. P. a. G. M. Foody, "Feature Selection for Classification of Hyperspectral Data by SVM," IEEE Transactions on Geoscience and Remote Sensing, vol. 48, no. 5, pp. 2297-2307, 2010. https://doi.org/10.1109/TGRS.2009.2039484.
    22. L. Breiman, "Random Forests," Machine Learning , vol. 45, p. 5–32, 2001. https://doi.org/10.1023/A:1010933404324.
    23. R. A. K. G. Raghavendra R, "Multimodal person verification system using face and speech," in Procedia Computer Science, 2010. https://doi.org/10.1016/j.procs.2010.11.023.
    24. D. a. H. K. J. a. N. S. S. SHAH, "ROBUST MULTIMODAL PERSON RECOGNITION USING LOW-COMPLEXITY AUDIO-VISUAL FEATURE FUSION APPROACHES," International Journal of Semantic Computing, vol. 4, pp. 155-179, 2010. https://doi.org/10.1142/S1793351X10000985.
    25. M. W. Girija Chetty, "Robust face-voice based speaker identity verification using multilevel fusion," Image and Vision Computing, vol. 26, pp. Pages 1249-1260, 2008. https://doi.org/10.1016/j.imavis.2008.02.009.
    26. B. Y. S. Palanivel, "Multimodal person authentication using speech, face and visual speech," Computer Vision and Image Understand-ing, vol. 109, pp. 44-55, 2008. https://doi.org/10.1016/j.cviu.2006.11.013.
    27. H.Kasban, "A Robust Multimodal Bio metric Authentication Scheme with Voice and Face Recognition," Arab Journal of Nuclear Sci-ences and Applications, vol. 50, no. 3, pp. 120-130, 2017.
  • Downloads

  • How to Cite

    Panicker, L. S., & Gopikakumari, R. (2025). A novel multi-modal biometric authentication system with enhanced feature extraction using advanced mapped real transform. International Journal of Engineering & Technology, 14(1), 1-6. https://doi.org/10.14419/tn22a349