Skip to content

Exapmle webscrapping python script. Scraping data from wikipedia.

License

Notifications You must be signed in to change notification settings

bridgesign/WikiScrap

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Wikipedia Data Scraping

Just a small project of webscraping using python for ES112 : Computing. Probably very naive. Feel free to contribute.

The first, incidents.py finds the aviation incidents in the year given year. The second (fatality.py), takes two inputs. First Input tells the number of top n most fatal incidents that occured. Second gives the incdents that occured in the last y years where y is input.

The data required to be loaded in second file takes about an hour. So, a copy of the data is given in temp.txt. Delete it to redownload the data. The data download is kept slow so as not to load the wikipedia servers.

Requirements:

Python 3 with requests and BeautifulSoup4 library installed.

Wikipedia data:https://en.wikipedia.org/wiki/List_of_accidents_and_incidents_involving_commercial_aircraft
and other links on that page.

About

Exapmle webscrapping python script. Scraping data from wikipedia.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages