Using NLTK tool kit to classify text using predefined libraries
install and import below libraries
|
rohitgopidi |
Reading the training dataset from a CSV, this can also be done from any file format or from any source
|
rohitgopidi |
Once the train data is read, you can tokenize and stem if you prefer. This step can be skipped as tokenization can be done in the next steps while calculating TFIDF
|
rohitgopidi |
Append the tokenized content , can be skipped if not using tokenizing in the previous step
|
rohitgopidi
|
Calculating count vectorizer to find the importance of the text in the document
|
rohitgopidi |
Using Naive_Bayes library to train and predict
|
rohitgopidi |
Test your training model by submitting your new sentence
|
rohitgopidi |
No comments:
Post a Comment