PROMOTING QUALITY DATA AND CLEANSING TECHNIQUES IN DATA ANALYTICS BASED ON SMART BUSINESS INTELLIGENCE TECHNOLOGY

Authors

  • VINAYAK PILLAI Data Analytics and AI, Denken Solutions (University of Texas Arlington Alumni), Dallas Fort Worth Metroplex, Texas, United States of America Author

Keywords:

DATA Process, Data Extraction, IT Organization, Transformation, Data Warehouse, Distributed Data

Abstract

In advancement of recent era, business organization is developing drastically as there is an increase number of Information Technology (IT) as they gives wide impact on the national and international development as they all managing the vast and complex number of data. In order to analyses the performance of data, those vast data has to be processed and analysed. In managing the data, Extract, Transform and Load (DATA) process is applied and stored in the data warehouse as repository in order to take effective distributed based decision as it faces the problem of time consuming process. As the data warehouse takes the data source in the distributed manner as it is difficult to integrate those data. To overcome the above challenges, the Modified DATA based Data warehouse is proposed as it is applied with modified multi-dimensional based bottom up approach as it carried out various activities to deeply analysis the data based on content profiling. In the process of cleaning, confirming and delivery of data depends on the various data sources. Then algorithm makes three various process of data extraction as it provides data cleaning and conforming to create the conform steps based on the source data analysis using data hierarchy structure. Until the data sources gets integrated based on the various distributed database, DATA steps are performed. In the analysis, modified DATA is applied on any real time organization as it takes various source table to compare the actual and expected results. Then various metadata testing is performed on various documentation to makes the process of transformation effective.

References

Loshin, David. Business Intelligence: The Savvy Manager’s Guide. 2nd Editio, Elsevier Science, 2012, https://www.google.co.id/books/edi-tion/Business_Intelligence/L7SLNlS1 ao8C?hl=id&gbpv=1&dq=business+intelligence+is&printsec =frontcover.

Qalam, Yance Ibnu. “Hubungan Data Warehouse Dengan Business Intel-ligence Dan DATA.” Kepo.Co, 2020, https://kepo.co/hubungan-data-ware-house-dengan-business-intelligence-dan-Data/.

Wijaya, Wayan M. Teknologi Big Data: Sistem Canggih Di Balik Google, Ya-Hoo!, Facebook, IBM (Teori Hingga Tutorial). Nilacakra, 2019, https://www.google.co.id/books /edition/Teknologi_Big_Data/VeNDwAAQBAJ?hl=id&gbpv=1&dq=big+data+ada-lah&printsec=frontcover.

Nordeen, Alex. Learn Data Warehousing in 24 Hours. Guru99, 2020, https://www.google.co.id/books/edition/Learn_Data_Warehousing_in_24_Hours/wgf9DwAAQBAJ ?hl=id&gbpv=0.

Prasetia, I. Putu Widia, and I. Nyoman Hary Kurniawan. “Implementasi DATA (Extract, Transform, Load) Pada Data Warehouse Penjualan Menggunakan Tools Pentaho.” TIERS Information Technology Journal, vol. 2, no. 1, 2021, pp. 39–47, doi:10.38043/tiers.v2i1.2844.

Udayana, Gede Acintia, I Made Yoga Mahendra, I Kadek Anom Suka-wirasa, Gde Deva Dimastawan Saputra, and Ida Bagus Made Mahendra. “Implementasi Data Warehouse Dan Penerapannya Pada PHI-Minimart Dengan Menggunakan Tools Pentaho Dan Power BI.” JELIKU (Jurnal Elektronik Ilmu Komputer Udayana), vol. 10, no. 1, 2021, p. 163, doi:10.24843/jlk.2021.v10.i01.p19.

Marbun, Ivan Rivaldo, and Ramos Somya. “Perancangan Data Ware-house Untuk Data Transaksi Penjualan Menggunakan Schema Snowflake Studi Kasus : Online Market Dataset.” Universitas Kristen Satya Wacana, vol. 5, no. 1, 2021, pp. 87–91.

Saeed K Rahimi, F.S Haug. Distributed Database Management System–A Practical Approach. New Jersey: John Wiley & Sons Inc. 2010:1.

M.T Ozsu, P. Valduriez. Principles of Distributed Database–Third Edition. New York: Springer. 2011:3.

T.Connolly, C. Begg. Database System. A Practical Approach to Design, Implementation and Management. Fourth Edition. Essex: Pearson Education. 2005: 695.

Kimball, Ralph., Ross, Margy. The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling, Third Edition. Indianapolis: John Wiley & Sons Inc. 2013:38.

Igor Mekterovic, Ljiljana Brkic, and Mirta Baranovic. Improving the DATA process of higher education information system data warehouse. Proceedings of the 9th WSEAS International Conference on Applied Informatics and Communications (AIC'09). Moscow.2009: 265-270.

Vishal Gour, S.S. Sarangdevot, G.S. Tanwar, A. Sharma. Improve Performance of Extract, Transform and Load (DATA) in Data Warehouse. Int. Journal on Comp. Sci. and Eng. 2010; 2(3):786-789

Abid Ahmad, Muhammad Zubair. Using Distributed Database Technology to simplify the DATA Component of Data Warehouse. Proceedings of WSEAS International Conference on Applied Computer Science (ACS'10), Iwate. 2010; 61-65.

Tute E, Steiner J. Modeling of DATA-Processes and Processed Information in Clinical Data Warehousing. Stud Health Technol Inform. 2018; 248 204-211. PMID: 29726438.

Sonali Vyas & Pragya Vaishnav. A comparative study of various DATA process and their testing techniques in data warehouse, Journal of Statistics and Management Systems. 2017; 20(4): 753-763.

C. Adamson. Mastering Data Warehouse Aggregates. Solutions for

Star Schema Performance. Indianapolis: Wiley Publishing Inc. 2006:20.

W.D Back, N. Goodman,J Hyde. Mondrian in Action. Open Source Business Analytics. New York: Manning Publications Co. 2014:195.

A. Meadows, A.S. Pulvirenti, M.C. Roldan. Pentaho Data Integration Cookbook. Birmingham: Packt Publishing, 2013:11.

Rahm, E., H. H. Do, Data cleaning: Problems and current approaches. IEEE Data Eng. Bull. 2000; 23(4): 313.

R. Bouman, J.V. Dongen. Pentaho Solutions: Business Intelligence and Data Warehousing with Pentaho and MySQL. Indianapolis: Wiley Publishing, Inc. 2009:160.

Downloads

Published

2024-10-22

How to Cite

VINAYAK PILLAI. (2024). PROMOTING QUALITY DATA AND CLEANSING TECHNIQUES IN DATA ANALYTICS BASED ON SMART BUSINESS INTELLIGENCE TECHNOLOGY. INTERNATIONAL JOURNAL OF ENGINEERING AND TECHNOLOGY RESEARCH (IJETR), 9(2), 519-531. https://lib-index.com/index.php/IJETR/article/view/IJETR_09_02_045