Skip to content

raghav1saboo/Instagram-automated-web-scraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

Instagram-automated-web-scraping

web scraped instagram by automation using selenium

Table of Contents

Introduction

The Instagram Automated Web Scraping project is designed to automatically collect data from Instagram, such as user profiles, posts, comments, and other relevant information. This tool leverages web scraping techniques to extract data efficiently from the platform.

Objective

The primary objectives of this project are:

  • Create a robust and automated web scraping tool for Instagram.
  • Gather user-specific data without violating Instagram's terms of service.
  • Provide an easy-to-use solution for data analysts and researchers.

Features

  • Scraping Instagram profiles and retrieving basic user information.
  • Extracting user posts, captions, and associated metadata.
  • Collecting comments from posts.
  • Saving the scraped data in a structured format for further analysis.

Technologies Used

  • Python 3
  • Selenium
  • BeautifulSoup
  • Requests
  • Pandas

(Feel free to add or modify the list of technologies based on the tools and libraries used in your project)

Installation

To run the Instagram Automated Web Scraping tool locally, follow these steps:

  1. Clone the repository to your local machine using the following command:

    git clone https://github.com/raghav1saboo/Instagram-automated-web-scraping.git
    
  2. Navigate to the project directory:

    cd Instagram-automated-web-scraping
    
  3. Install the required dependencies using the following command:

    pip install -r requirements.txt
    
  4. Download and install the appropriate web driver for your browser (e.g., Chrome or Firefox).

Usage

  1. Update the config.py file with your Instagram login credentials.

  2. Customize the scrape.py script to define your desired scraping parameters, such as usernames, hashtags, or specific posts.

  3. Run the following command to start the scraping process:

    python scrape.py
    
  4. The scraped data will be saved in CSV or JSON format, depending on your configuration.

Scraping Guidelines

Please ensure that you follow Instagram's scraping guidelines while using this tool. Instagram has specific rules and rate limits to prevent abuse of their platform. Avoid scraping a large amount of data in a short period, as this may lead to temporary or permanent IP bans.

Contributing

Contributions to this project are welcome. If you find any issues or have improvements to suggest, please feel free to open an issue or create a pull request.

License

This project is licensed under the MIT License.

Contact

If you have any questions or need further assistance, you can reach me via email at [email protected].

(Replace '[email protected]' with your actual email address)


You can copy and paste this template into a new file named "README.md" in the root of your "Instagram-automated-web-scraping" repository. Customize the content to match the specifics of your project, such as the objective, features, technologies used, scraping guidelines, and contact details. Additionally, consider adding examples or screenshots of the scraped data and instructions on how to interpret the results for users who are new to web scraping.

About

web scraped instagram by automation using selenium

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors