Data Scientist Question:
Explain me why data cleaning plays a vital role in analysis?
Answer:
Cleaning data from multiple sources to transform it into a format that data analysts or data scientists can work with is a cumbersome process because - as the number of data sources increases, the time take to clean the data increases exponentially due to the number of sources and the volume of data generated in these sources. It might take up to 80% of the time for just cleaning data making it a critical part of analysis task.
Previous Question | Next Question |
Do you know why is naive Bayes so ‘naive’ ? | Do you know what is the goal of A/B Testing? |