Various classifiers are trained with forum posts labelled either neutral or insulting.
- unicode cleanup
- lemmatization
- stopwords
- TFIDF
- POS tagging
- Naive bayes
- SVM
- Random forests
- F1
- CLF
- nltk
- sklearn
- panda
- numpy
The notebook is fully documented , conclusion and extra thoughts are included there