Lips Reading Spoken Arabic Word Based on The Geometric Shape Features of The Lip

Authors

  • Prof. Khalil I. Alsaif  Tech. Department of Computer Engineering, Hadba Collage University Iraq
  • Nagham Salim Allella  New and Renewable Energy Department, Collage of Science, Mosul University, Iraq

DOI:

https://doi.org/10.32628/IJSRST2310164

Keywords:

Lips Reading, Geometrical Feature, Physiological Lips Feature, Landmark

Abstract

This research aims to determine how to guess individual Arabic words based on visual cues such as lip movement and form extraction. A three-step process was developed to accomplish this goal, the first of which is the identification of the face. Second, the lip region is targeted, and finally, the movement of the speaking lips is analyzed to determine what is being said. This research proposes an alternative approach that uses biometric engineering features extracted from the changing shapes of the lips to make the best guess at the word being spoken. This method uses a set of 20 landmarks around the mouth's outer and inner edges, the high point of an open mouth, and the point where the upper and lower lips meet. The proposed study took a different approach by looking at the shape of the lips and how the points on the upper and lower lips move, as well as figuring out what was being said (the outer and the inner).

References

  1. Oghbaie, M., Sabaghi, A., Hashemifard, K., & Akbari, M. (2021). ADVANCES AND CHALLENGES IN DEEP LIP READING. Computer Vision and Image Understanding(manuscript number:CVIU-21-732).
  2. Sheng, C., Kuang, G., Bai, L., Hou, C., Guo, Y., Xu, X., Pietikäinen, M., & Liu, L. (2022). Deep Learning for Visual Speech Analysis: A Survey. http://arxiv.org/abs/2205.10839
  3. Wang, C. (2020). Multi-grained spatio-temporal modeling for lip-reading. In 30th British Machine Vision Conference 2019, BMVC 2019.
  4. Wrobel, K., Doroz, R., Porwik, P., Naruniec, J., & Kowalski, M. (2017). Using a Probabilistic Neural Network for lip-based biometric verification. Engineering Applications of Artificial Intelligence.
  5. Çetingül, H. E., Yemez, Y., Erzin, E., & Tekalp, A. M. (2006). Discriminative Analysis of Lip Motion Features for Speaker Identification and Speech-Reading. IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 15, NO. 10, OCTOBER 2006.
  6. G¨ocke, R., & Asthana, A. (2008). A Comparative Study of 2D and 3D Lip Tracking Methods for AV ASR. AVISA.
  7. Eting¨ul, H. E. C., Yemez, Y., Erzin, E., & Tekalp, A. M. (2005). ROBUST LIP-MOTION FEATURES FOR SPEAKER IDENTIFICATION. IEEE.ICASSP. doi: 10.1109/ICASSP.2005.1415162 · Source: IEEE Xplore
  8. Kapkar, P. P., & Bharkad, S. D. (2019). Lip feature extraction and movement recognition methods: A review. In International Journal of Scientific and Technology Research (Vol. 8, Issue 8, pp. 50–55).
  9. Jia, X., & Sun, Y. (2013). A Kind of Visual Speech Feature with the Geometric and Local Inner Texture Description (p. 877~889). TELKOMNIKA, Vol. 11, No. 2.
  10. Brahme, A., & Bhadade, U. (2016). Lip Detection and Lip Geometric Feature Extraction using Constrained Local Model for Spoken Language Identification using Visual Speech Recognition. In Indian Journal of Science and Technology (Vol. 9, Issue 32). https://doi.org/10.17485/ijst/2016/v9i32/98737
  11. Li, R., Tian, J., & Chua, M. C. H. (2018). Facial expression classification using salient pattern driven integrated geometric and textual features. Springer Science+Business Media, LLC, part of Springer Nature 2018 Abstract. https://doi.org/10.1007/s11042-018-6133-z

Downloads

Published

2023-02-28

Issue

Section

Research Articles

How to Cite

[1]
Prof. Khalil I. Alsaif, Nagham Salim Allella "Lips Reading Spoken Arabic Word Based on The Geometric Shape Features of The Lip" International Journal of Scientific Research in Science and Technology(IJSRST), Online ISSN : 2395-602X, Print ISSN : 2395-6011,Volume 10, Issue 1, pp.624-634, January-February-2023. Available at doi : https://doi.org/10.32628/IJSRST2310164