wuzzuf_scraping

Python project with Jupyter notebook

The site will be scraped for some info found in the search result page, this info are simple and can be extracted using beautiful soup.

Then for each search result, using the link extracted from the previous step, other data will be scraped from inside each post page (like salary), this data can't be extracted using beautiful soup only because it's filled in using JavaScript, so selenium will be used to run a bot browser and load the site then scrap the data.

All the extracted data will be grouped in a table and saved in a local CSV file.

Extracted info from the search result pages:
- Job title
- Company
- Post link
- Job location
- Contract type
- Skills required
Extracted info from inside each post page:
- Salary

Note: To run selenium, an exe for the firefox driver (GeckoDriver) is needed

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
jobs.csv		jobs.csv
wuzzuf.ipynb		wuzzuf.ipynb
wuzzuf.py		wuzzuf.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

wuzzuf_scraping

About

Uh oh!

Releases

Packages

Languages

Amr-YA/wuzzuf_scraping

Folders and files

Latest commit

History

Repository files navigation

wuzzuf_scraping

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages