Bag of Words and TF–IDF

In the previous chapters we talked about text as data and about the fact that a computer cannot read words "like a human". For it, text is a set of symbols, numbers, and statistics. In this section we will cover two basic, yet still extremely useful approaches to representing text as numbers: Bag of Words and TF–IDF.