A Voice Based Assistant Using Google Dialogflow and Machine Learning

Dr. Jaydeep Patil; Atharva Shewale; Ekta Bhushan; Alister Fernandes; Rucha Khartadkar

doi:10.32628/IJSRST218311

Authors

Dr. Jaydeep Patil Information Technology, AISSMS’s Institute of Information Technology, Pune, Maharashtra, India
Atharva Shewale Information Technology, AISSMS’s Institute of Information Technology, Pune, Maharashtra, India
Ekta Bhushan Information Technology, AISSMS’s Institute of Information Technology, Pune, Maharashtra, India
Alister Fernandes Information Technology, AISSMS’s Institute of Information Technology, Pune, Maharashtra, India
Rucha Khartadkar Information Technology, AISSMS’s Institute of Information Technology, Pune, Maharashtra, India

DOI:

https://doi.org/10.32628/IJSRST218311

Keywords:

Artificial Intelligence, Natural Language Understanding, IBM Watson, Google Dialogflow, Speech Recognition.

Abstract

Virtual Personal Assistant (VPA) is one of the most successful results of Artificial Intelligence, which has given a new way for the human to have its work done from a machine. This paper gives a brief survey on the methodologies and concepts used in making of an Virtual Personal Assistant (VPA) and thereby going on to use it in different software applications. Speech Recognition Systems, also known as Automatic Speech Recognition (ASR), plays An important role in virtual assistants in order to help user have a conversation with the system. In this project, we are trying to make a Virtual Personal Assistant ERAA which will include the important features that could help in assisting ones’ needs. Keeping in mind the user experience, we will make it as appealing as possible, just like other VPAs. Various Natural Language Understanding Platforms like IBM Watson and Google Dialogflow were studied for the same. In our project, we have used Google Dialogflow as the NLU Platform for the implementation of the software application. The User-Interface for the application is designed with the help of Flutter Software Platform. All the models used for this VPA will be designed in a way to work as efficient as possible. Some of the common features which are available in most of the VPAs will be added. We will be implementing ERAA via a smartphone application, and for future scope, our aim will be to implement it on the desktop environment. The following Paper ensure to provide the methodologies used for development of the application. It provides the obtained outcomes of the features developed within the application. It shows how the available natural language understanding platforms can reduce the burden of the user, and therefore going on to develop a robust software application.

References

Mohit Bansal, Dr. T. K. Thivakaran, “Analysis of Speech Recognition using Convolutional Neural Network”, Journal of Engineering Sciences, Vol 11, Issue 1, 2020, Page 285-291.
J. Huang, J. Li and Y. Gong, "An analysis of convolutional neural networks for speech recognition," 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), South Brisbane, QLD, Australia, 2015, pp 4989-4993, doi: 10.1109/ICASSP.2015.7178920
T. B. Mokgonyane, T. J. Sefara, T. I. Modipa, M. M. Mogale, M. J. Manamela and P. J. Manamela, "Automatic Speaker Recognition System based on Machine Learning Algorithms," 2019 Southern African Universities Power Engineering Conference/Robotics and Mechatronics/Pattern Recognition Association of South Africa (SAUPEC/RobMech/PRASA), Bloemfontein, South Africa, 2019, pp. 141-146, doi: 10.1109/RoboMech.2019.8704837.
Vineel Pratap, Qiantong Xu, Jacob Kahn, Gilad Avidov, Tatiana Likhomaneko, Awni Hannun, Vitaliy Liptchinsky, Gabriel Synnaeve, Ronan Collobert, “ Scaling up Online Speech Recognition Systems using ConvNets”, 27th January 2020.
A. B. Nassif, I. Shahin, I. Attili, M. Azzeh and K. Shaalan, "Speech Recognition Using Deep Neural Networks: A Systematic Review," in IEEE Access, vol. 7, pp. 19143-19165, 2019, doi: 10.1109/ACCESS.2019.2896880.
M. A. Khan, A. Tripathi, A. Dixit and M. Dixit, "Correlative Analysis and Impact of Intelligent Virtual Assistants on Machine Learning," 2019 11th International Conference on Computational Intelligence and Communication Networks (CICN), Honolulu, HI, USA, 2019, pp. 133-139, doi: 10.1109/CICN.2019.8902424.
Tulshan A.S., Dhage S.N. (2019) Survey on Virtual Assistant: Google Assistant, Siri, Cortana, Alexa. In: Thampi S., Marques O., Krishnan S., Li KC., Ciuonzo D., Kolekar M. (eds) Advances in Signal Processing and Intelligent Recognition Systems. SIRS 2018. Communications in Computer and Information Science, vol 968. Springer, Singapore.
N. A. Godse, S. Deodhar, S. Raut and P. Jagdale, "Implementation of Chatbot for ITSM Application Using IBM Watson," 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA), Pune, India, 2018, pp. 1-5, doi: 10.1109/ICCUBEA.2018.8697411.
Linda W. Lee, Amir Dabirian, Iran Paul McCarthy, Jan Kietzmann. (2020), “ Making sense of text: artificial intelligence-enabled content analysis”, European Journal of Marketing, Vol.54 No.3, pp 615-644.
Roberto Reyes, David Garza, Leonardo Garrido, Victor De la Cueva and Jorge Ramirez, “Methodology for the Implementation of Virtual Assistants for Education Using Google Dialogflow.”, Advances in Soft Computing (pp.440-451).
Chinnapa Reddy Kanakanti and Sabitha R., “Ai and Ml Based Google Assistant for an Organization using Google Cloud Platform and Dialogflow”, International Journal of Recent Technology and Engineering (IJRTE), Volume-8 Issue-5, January 2020, Page 2722-2727
Mayank Aggarwal and Mani Madhukar, “IBM’s Watson Analytics for Health Care: A Miracle Made True.”, Cloud Computing Systems and Applications in Healthcare. DOI: 10.4018/978-1-5225-1002-4.ch007.
G. E. Dahl, D. Yu, L. Deng, and A. Acero, “Contextdependent pre-trained deep neural networks for largevocabulary speech recognition,” IEEE Trans. on Audio, Speech and Language Processing, vol. 20, no. 1, pp. 30– 42, 2012.
Sánchez-Díaz X., Ayala-Bastidas G., Fonseca-Ortiz P., Garrido L. (2018) A Knowledge-Based Methodology for Building a Conversational Chatbot as an Intelligent Tutor. In: Batyrshin I., Martínez-Villaseñor M., Ponce Espinosa H. (eds) Advances in Computational Intelligence. MICAI 2018. Lecture Notes in Computer Science, vol 11289. Springer, Cham. https://doi.org/10.1007/978-3-030-04497-8_14.
Winkler, Rainer & Söllner, Matthias. (2018), “Unleashing the Potential of Chatbots in Education: A State-Of-The-Art Analysis”, Academy of Management Proceedings. 2018. DOI: 10.5465/AMBPP.2018.15903abstract
A. P. Singh, R. Nath and S. Kumar, "A Survey: Speech Recognition Approaches and Techniques," 2018 5th IEEE Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON), Gorakhpur, India, 2018, pp. 1-4, doi: 10.1109/UPCON.2018.8596954.
Ossama Abdel-Hamid, Abdelrahman Mohamed, Hui Jiang, Li Deng, Gerald Penn, and Dong Yu, “Convolutional Neural Networks for Speech Recognition”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22,2010.
Ying Zhang, Mohammad Pezeshki, Philemon Brakel, Saizheng Zhang, Cesar Laurent Yoshua Bengio, Aaron Courville, “Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks”, arXiv:1701.02720v1,2017.

A Voice Based Assistant Using Google Dialogflow and Machine Learning

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite