Data Mining, Spidering and Analysis with Python

Authors

  • Tejas Kamble  Department of Computer Engineering, Zeal College of Engineering and Research, Pune, Maharashtra, India
  • Srujan Garde  Department of Computer Engineering, Zeal College of Engineering and Research, Pune, Maharashtra, India
  • Samruddhi Kadam   Department of Computer Engineering, Zeal College of Engineering and Research, Pune, Maharashtra, India

Keywords:

Data mining, Data spidering, Data wrangling, Data Analysis, Data Manipulation

Abstract

Data mining and analytics have played an important role in knowledge discovery and decision making/supports in the process industry over the past several decades. As a computational engine to data mining and analytics, machine learning serves as basic tools for information extraction, data pattern recognition and predictions. From the perspective of machine learning, this paper provides a review on existing data mining and analytics applications in the process industry over the past several decades. The state of the art of data mining and analytics are reviewed through eight unsupervised learning and ten supervised learning algorithms, as well as the application status of semi-supervised learning algorithms. Several perspectives are highlighted and discussed for future researches on data mining and analytics in the process industry.

References

  1. G. S. L. M. J. A. Berry, Mastering Data Mining, New York: Wiley, 2000.
  2. H. M. a. P. S. D. Hand, Principles of Data Mining., Cambridge, : MA:MIT Press., 2001.
  3. J. F. R. O. a. C. S. L. Breiman, Classification and Regression Trees., Wadsworth, 1984.
  4. G. S. L. M. J. A. Berry, Data Mining Techniques, New York: Wiley, 1997.

Downloads

Published

2022-03-30

Issue

Section

Research Articles

How to Cite

[1]
Tejas Kamble, Srujan Garde, Samruddhi Kadam "Data Mining, Spidering and Analysis with Python" International Journal of Scientific Research in Science and Technology(IJSRST), Online ISSN : 2395-602X, Print ISSN : 2395-6011,Volume 9, Issue 2, pp.440-447, March-April-2022.