Lead Data Scientist Question:

Download Job Interview Questions and Answers PDF

Explain me what is TF/IDF vectorization?

Lead Data Scientist Interview Question
Lead Data Scientist Interview Question

Answer:

tf–idf is short for term frequency–inverse document frequency, is a numerical statistic that is intended to reflect how important a word is to a document in a collection or corpus. It is often used as a weighting factor in information retrieval and text mining. The tf-idf value increases proportionally to the number of times a word appears in the document, but is offset by the frequency of the word in the corpus, which helps to adjust for the fact that some words appear more frequently in general.

Download Lead Data Scientist Interview Questions And Answers PDF

Previous QuestionNext Question
Tell me how to create a function in arguments using apply() in R?Tell me what are the drawbacks of the linear model?