InstaTrackerBot

A Telegram bot that grants one active user at a time, queues additional users, and fetches Instagram profile snapshots through Selenium + ChromeDriver.

Features

Single-user access with a managed waiting queue.
Guided conversation to collect Instagram usernames and refresh interval.
Selenium-powered scraping helpers with pluggable Chrome profile support.
Optional APScheduler job for unattended crawls that persist JSON snapshots.

Requirements

Python 3.10 or newer.
Google Chrome and a matching ChromeDriver on your PATH.
Dependencies listed in requirements.txt (pip install -r requirements.txt).

Quick Start

python -m venv .venv
.venv\Scripts\activate
pip install -r requirements.txt
set TELEGRAM_BOT_TOKEN=your-telegram-token
# Optional Selenium tweaks
set INSTA_CHROME_PROFILE_DIR=C:\\path\\to\\chrome-profile
set INSTA_CHROME_PROFILE_NAME=Profile 1
set INSTA_CHROME_HEADLESS=1
python -m bot.main

Open Telegram, talk to your bot, and send /start.

Configuration

Telegram token – set TELEGRAM_BOT_TOKEN in your environment. The bot raises a clear error if it is missing.
Chrome profile – customise Selenium via:
- INSTA_CHROME_PROFILE_DIR (folder that contains the Chrome profile to reuse).
- INSTA_CHROME_PROFILE_NAME (optional profile directory name inside user data).
- INSTA_CHROME_HEADLESS (1/true/on) to launch Chrome in headless mode.
- Without variables the bot creates/uses a local chrome_profile/ folder (ignored by git).

Supported Commands

/start – request access and see your position in the queue.
/start_tracking – supply usernames and an hourly interval to fetch analytics.
/change – update the usernames being tracked while you hold the lock.
/cancel_tracking – release your slot for the next user.
/end – clear your session and leave the queue entirely.

Responses include follower counts, total posts, and placeholders for post metrics (likes, comments, hashtags). Replace the placeholder logic in utils/instagram_crawler.py to collect real engagement numbers.

Scheduler Usage

bot/scheduler.py exposes create_scheduler() which schedules scheduled_crawl() every six hours by default. Execute the module directly to run a background scheduler, or import and integrate it into your own service.

Data Directory

Sample output lives at bot/data/sample_output.example.json. Real crawls write JSON to bot/data/ and the folder is git-ignored so production data stays local.

Housekeeping

.gitignore excludes Chrome/Selenium artefacts, caches, and compiled files.
Runtime directories generated by Chrome (profiles, caches, Crashpad, etc.) have been removed from the repository root.

Next Steps

Replace placeholder scraping with real metrics (likes, comments, hashtags, captions).
Add resilience against Instagram throttling (retries, exponential backoff, proxy support).
Extend analytics in utils/data_processing.py and add automated tests.
Secure long-running deployments (process supervision, persistent storage, structured logging).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

InstaTrackerBot

Features

Requirements

Quick Start

Configuration

Supported Commands

Scheduler Usage

Data Directory

Housekeeping

Next Steps

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
bot		bot
utils		utils
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

vophamthanhan/InstaTrackerBot

Folders and files

Latest commit

History

Repository files navigation

InstaTrackerBot

Features

Requirements

Quick Start

Configuration

Supported Commands

Scheduler Usage

Data Directory

Housekeeping

Next Steps

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages