
In this blog I walk through at a high level how to create a NLP data pipeline from scratch, using the NLTK and Gensim packages.

It touches briefly on how to gather and process text, but focuses mainly on the use case of categorizing texts, and goes into detailed code and explanation of this.