When creating a bag of words, each word becomes a column in a sparse matrix. This can lead to a huge matrix. Python libraries can filter the most relevant words and ignore the rest to improve performance.
Machine Learning A-Z
Thoughts and flashcards derived from Udemy’s course Machine Learning A-Z.