Sentiment data cleansing #27

jaARke · 2023-10-04T18:28:54Z

Currently, sentiment data cleansing is very rudimentary. To make our results more reliable, we should try to add functions that vet our retrieved data according to the following principles:

No duplicate news headlines
News headlines should be in English
News headlines should talk explicitly and exclusively about the stock being queried

The cleansing should occur in utils/sentiment/headlines.py

The text was updated successfully, but these errors were encountered:

jaARke added enhancement New feature or request good first issue Good for newcomers help wanted Extra attention is needed qol Quality of life change. labels Oct 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sentiment data cleansing #27

Sentiment data cleansing #27

jaARke commented Oct 4, 2023

Sentiment data cleansing #27

Sentiment data cleansing #27

Comments

jaARke commented Oct 4, 2023