SentiAnalyzer is a “one-stop” solution for analysis of of consumer reviews which includes natural language processing (NLP) of consumer sentiments. The dataset that SentiAnalyzer can work with for now is short text and a binary quantification, e.g. whether consumer hits the like button. The NLP of the sentiment is analyzed through the options of algorithms to compile the words and the machinge learning training model of choice.
Major steps of processing your textual dataset:
Balancing the dataset
Streamlining the text to get a sense of the major keywords
Train different classification algorithms(e.g., SVM,NB,RF,KNN,GBM) and choose best parameters for each one to get the highest possible classification accuracy for an specefic dataset
Choose the best trained classification algorithm for the specific dataset according to different measures (e.g., FScore, Recall, Precision, Accuracy)
Visualize the output of the confusion matrix, that is, the accuracy of the training model in predicting the sentiment of the consumer review