EVOLUTION OF DATA SCIENCE METHODOLOGIES FROM STATISTICAL APPROACHES TO MACHINE LEARNING PARADIGMS

Authors

  • Snehal Narendrakumar India Author

Keywords:

Data Science, Traditional Statistics, Machine Learning, Big Data, Predictive Analytics, Computational Advancements, Neural Networks, Interpretability, Ethical Concerns, Healthcare Analytics, Financial Modelling

Abstract

This paper explores the evolution of data science methodologies, from traditional statistical approaches to modern machine learning paradigms. It begins by defining data science and outlining its scope, highlighting the increasing importance of data-driven decision-making in various sectors. The historical context of data science is reviewed, with a focus on the early use of statistical methods for data analysis. Key limitations of these traditional approaches, including their reliance on linear models and interpretability issues, are discussed. The paper then examines the transition to machine learning, emphasizing the impact of computational advancements and the rise of big data on the development of more sophisticated algorithms. Key differences between traditional statistics and machine learning models are outlined, with particular attention given to the strengths and weaknesses of each methodology. Finally, the paper presents the drivers of this methodological shift and considers applications in diverse domains, such as healthcare and finance. Challenges, including the interpretability of machine learning models and ethical concerns in data science, are also addressed. The paper concludes with a discussion of future directions for data science methodologies.

 

References

Fisher, R. A. (1925). Statistical Methods for Research Workers. Oliver & Boyd.

Stigler, S. M. (1986). The History of Statistics: The Measurement of Uncertainty Before 1900. Harvard University Press.

Box, G. E. P., & Draper, N. R. (1987). Empirical Model-Building and Response Surfaces. Wiley.

Nie, N. H., Hull, C. H., Jenkins, J. G., Steinbrenner, K., & Bent, D. H. (1975). SPSS: Statistical Package for the Social Sciences. McGraw-Hill.

Breiman, L. (2001). Statistical Modeling: The Two Cultures. Statistical Science, 16(3), 199-231.

Bayyapu, S. (2021). Bridging the gap: Overcoming data, technological, and human roadblocks to AI-driven healthcare transformation. Journal of Management (JOM), 8(1), 7-14.

Rosenblatt, F. (1958). The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain. Psychological Review, 65(6), 386-408.

Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1986). Learning Representations by Back-Propagating Errors. Nature, 323, 533-536.

Cortes, C., & Vapnik, V. (1995). Support-Vector Networks. Machine Learning, 20, 273-297.

Bayyapu, S. (2022). Optimizing IT sourcing in healthcare: Balancing control, cost, and innovation. International Journal of Computer Applications, 3(1), 14-20.

Breiman, L. (2001). Random Forests. Machine Learning, 45(1), 5-32.

Bishop, C. M. (2006). Pattern Recognition and Machine Learning. Springer.

Domingos, P. (2012). A Few Useful Things to Know About Machine Learning. Communications of the ACM, 55(10), 78-87.

Chen, M., Mao, S., & Liu, Y. (2014). Big Data: A Survey. Mobile Networks and Applications, 19(2), 171-209.

Bayyapu, S. (2020). Blockchain healthcare: Redefining data ownership and trust in the medical ecosystem. International Journal of Advanced Research in Engineering and Technology (IJARET), 11(11), 2748-2755.

Tukey, J. W. (1977). Exploratory Data Analysis. Addison-Wesley.

Box, G. E. P., & Cox, D. R. (1964). An Analysis of Transformations. Journal of the Royal Statistical Society: Series B (Methodological), 26(2), 211-243.

Friedman, J. H. (1991). Multivariate Adaptive Regression Splines. The Annals of Statistics, 19(1), 1-67.

Downloads

Published

2024-04-21

How to Cite

EVOLUTION OF DATA SCIENCE METHODOLOGIES FROM STATISTICAL APPROACHES TO MACHINE LEARNING PARADIGMS. (2024). INTERNATIONAL JOURNAL OF COMPUTER SCIENCE REVIEW (IJCSR), 1(1), 1-11. https://lib-index.com/index.php/IJCSR/article/view/IJCSR_01_01_001