Skip to content

Latest commit

 

History

History
19 lines (11 loc) · 584 Bytes

File metadata and controls

19 lines (11 loc) · 584 Bytes

I have taken a website link https://kathmandupost.com/ in order to find the topic of this newspaper article.

Libraries used in the Task:

  1. NLTK
  2. BeautifulSoup

MODULES USED:

  1. urllib
  2. html5lib

Different Process used in NLP are:

  1. Cleaning the data Where we remove the stop words like (a,an,the,to,for,etc)
  2. Tokenization

NOTE: From the above task we identified that the web page speaks about Prime-minister Oli is Speaking about MCC