I have taken a website link https://kathmandupost.com/ in order to find the topic of this newspaper article.
- NLTK
- BeautifulSoup
MODULES USED:
- urllib
- html5lib
- Cleaning the data
Where we remove the stop words like (a,an,the,to,for,etc)
- Tokenization
NOTE: From the above task we identified that the web page speaks about Prime-minister Oli is Speaking about MCC