As per Kaggle notebook
Extract, Process and Predict the sentiment of Twitter tweets referencing South Africa's top 5 banks.
Multiple models were created and tested to predict sentiment of tweets as either postive, neutral or negative
Note:
- Multiple datasets were used as training of the model, with Sentiment 140 being the main contributor of tweets (1.6 million)
- Proof of concept version was completed using a pretrained model (Textblob)
- Sarcasm - "thanks FNB, now I cant open my account cause its frozen"
- Comparison of entities - "Capitec is the worst, you should use Standard Bank"
- Training data on non-South African tweets - Jargon and lingo is different
- Language usage - multiple languages are used in South Africa
Reference notebooks: