FROM RAW DATA TO ACTIONABLE INSIGHTS: A HOLISTIC SURVEY OF DATA SCIENCE PROCESSES

Authors

  • Nivedhaa N. Author

Keywords:

Data Science, Raw Data, Actionable Insights, Decision-Making, Survey, Holistic Processes, Data Collection, Ethics, Data Cleaning, Preprocessing, EDA, Pattern Discovery, Machine Learning, Practical Application

Abstract

Data science is integral in converting raw data into actionable insights for informed decision-making across domains. This survey explores the holistic data science processes, from collection to deriving insights. It introduces fundamental concepts, highlights ethical considerations in data collection, and covers data cleaning and preprocessing. The paper extensively explores Exploratory Data Analysis (EDA), emphasizing pattern discovery and anomaly detection. Model development and machine learning are central, emphasizing practical applications for extracting patterns. Detailed coverage of interpreting results and drawing insights demonstrates actionable information derivation. Case studies illustrate how insights inform decision-making. The survey addresses current data science challenges, speculating on future trends and advancements. A valuable resource for practitioners and researchers, it provides a comprehensive understanding of the journey from raw data to actionable insights in data science.

   

References

Chen, H., & Wang, X. (2020). Data Quality Improvement Techniques for Data Science Applications. International Journal of Data Mining and Knowledge Management, 14(2), 3-12.

Gupta, M., & Sharma, A. (2017). Data Normalization Techniques in Data Science: A Comprehensive Review. International Journal of Computer Science and Information Technology, 9(3), 1941-1945.

Jones, M., Brown, A., & Kumar, R. (2019). Ethical Considerations in Data Acquisition for Data Science Projects. Journal of Big Data, 8(1), 1-12.

Smith, J. (2018). Data Source Diversity in Data Science: A Critical Analysis. International Journal of Information Systems and Technology, 7(2), 189-202.

Tukey, J. W. (1977). Exploratory Data Analysis. Addison-Wesley.

Hastie, T., Tibshirani, R., & Friedman, J. (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer.

Mitchell, T. M. (2018). Machine Learning (2nd ed.). McGraw-Hill.

Friedman, J. H., Hastie, T., & Tibshirani, R. (2001). The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer.

Shmueli, G. (2010). To Explain Is to Predict: Model Interpretation in the Information Age. John Wiley & Sons.

Dhar, V. (2013). Algorithmic Bias in Data Science: A Call for Action. Big Data, 1(1), 3-4.

Provost, F., & Fawcett, T. (2013). Data Science for Business: Forecasting, Risk Assessment, and Optimization. O'Reilly Media.

Davenport, T. H., & Patil, D. (2012). Data Scientist: The Sexiest Job of the 21st Century. Harvard Business Review, 90(10), 70-82.

Manyika, M., Chui, M., & Brown, M. (2011). Big Data: The Next Frontier for Innovation. McKinsey Global Institute.

S.B. Vinay, "Data Scientist Competencies and Skill Assessment: A Comprehensive Framework," International Journal of Data Scientist (IJDST), vol. 1, issue 1, pp. 1-14, 2024.

S. B. Vinay, "AI and machine learning integration with AWS SageMaker: current trends and future prospects", International Journal of Artificial Intelligence Tools (IJAIT), vol. 1, issue 1, pp. 1-24, 2024.

S. B. Vinay, Application of Artificial Intelligence (AI) In E-Publishing Industry in India, International Journal of Computer Engineering and Technology (IJCET) 14(1), 2023, pp. 7-12 doi: https://doi.org/10.17605/OSF.IO/4D5M7

S. B. Vinay, Application of Artificial Intelligence (AI) In School Teaching and Learning Process- Review and Analysis, International Journal of Information Technology and Management Information Systems (IJITMIS), 14(1), 2023, pp. 1-5 doi: https://doi.org/10.17605/OSF.IO/AERNV

S. B. Vinay& S. Balasubramanian, A Comparative Study of Convolutional Neural Networks and Cybernetic Approaches on CIFAR-10 Dataset, International Journal of Machine Learning and Cybernetics (IJMLC), 1(1), 2023, pp. 1-12 doi: https://doi.org/0.17605/OSF.IO/QY32B

Kim, Y., & Lee, J. (2016). Exploratory Data Analysis for Data Science Beginners: A Practical Guide. Springer.

Rohit Khankhoje, "Bridging the Gap: Selenium and RPA for Unparalleled Automation," International Journal on Cybernetics & Informatics (IJCI), vol. 13, no. 1, pp. 23-29, 2024. DOI: 10.5121/ijci.2024.130103.

Downloads

Published

2024-01-20

How to Cite

FROM RAW DATA TO ACTIONABLE INSIGHTS: A HOLISTIC SURVEY OF DATA SCIENCE PROCESSES. (2024). INTERNATIONAL JOURNAL OF DATA SCIENCE (IJDS), 1(1), 1-16. https://lib-index.com/index.php/IJDS/article/view/IJDS_01_01_001