Data Scientist Question:

Explain me why data cleaning plays a vital role in analysis?

Tweet Share WhatsApp

Answer:

Cleaning data from multiple sources to transform it into a format that data analysts or data scientists can work with is a cumbersome process because - as the number of data sources increases, the time take to clean the data increases exponentially due to the number of sources and the volume of data generated in these sources. It might take up to 80% of the time for just cleaning data making it a critical part of analysis task.

Download Data Scientist PDF Read All 55 Data Scientist Questions
Previous QuestionNext Question
Do you know why is naive Bayes so ‘naive’ ?Do you know what is the goal of A/B Testing?