Fault-Tolerant Webhook Processor

A resilient and observable webhook processing system built with Node.js, Express, and Redis. This project demonstrates a production-grade architecture for ingesting, queuing, and safely processing jobs from third-party services, ensuring no data is ever lost.

The Problem

Standard webhook endpoints are fragile. If your server is down for maintenance, experiences a temporary crash, or receives a malformed payload, incoming webhooks can be lost forever. This leads to lost data, failed payments, and a catastrophic loss of user trust. This project is engineered to solve that problem.

The Solution

This system decouples the initial ingestion of a webhook from its final processing using a Redis-backed job queue. This architecture provides security, reliability, and observability.

Secure Ingestion API: A lightweight Express server acts as the entry point. It instantly validates every webhook's cryptographic signature to prevent spoofing attacks, places the job onto a queue, and responds immediately. This makes it incredibly fast and secure.
Resilient Background Worker: A separate Node.js process listens to the job queue. It processes one job at a time, ensuring that a single failure doesn't impact the entire system.
Dead-Letter Queue (DLQ): If a job contains a permanent error (e.g., malformed data), it is automatically moved to a DLQ after failing. This isolates "poison pill" messages and prevents them from clogging the main queue.
Management & Recovery UI: A simple web dashboard and an interactive CLI provide visibility into the DLQ, allowing an operator to inspect failed jobs and manually trigger a retry after a fix, demonstrating a full operational-support lifecycle.

✨ Key Features

Guaranteed Ingestion: Never lose a webhook, even during server downtime.
Zero-Trust Security: Validates webhook signatures using HMAC-SHA256 on the raw request body to prevent forgery and timing attacks.
High Resilience: Uses a Dead-Letter Queue (DLQ) to isolate and handle failing jobs gracefully.
Asynchronous & Fast: Decoupled architecture ensures the API is always responsive, no matter how long processing takes.
Atomic Operations: Uses Redis transactions (MULTI/EXEC) to prevent race conditions when retrying jobs.
Full Observability: Includes an interactive CLI and a web dashboard to view, manage, and retry failed jobs.

🛠️ Tech Stack

Backend: Node.js, Express.js
Queuing / Caching: ioredis
Dashboard: EJS (Embedded JavaScript Templates)
Testing: Node.js built-in test runner
Tooling: Winston (for logging), yargs (for CLI), dotenv, cross-env

🚀 Getting Started

Follow these instructions to get the project running on your local machine.

Prerequisites

Node.js (v18 or later recommended)
An active Redis instance
ngrok (for exposing your local server to GitHub for testing)

Installation & Setup

Clone the repository:

git clone https://github.com/TheBigWealth89/webhook-test.git
cd webhook-test

Install dependencies:
```
npm install
```
Set up environment variables:
- Create a .env file in the root of the project.
- You can use .env.example as a template for required variables.
- Add your GitHub webhook WEBHOOK_SECRET and other necessary configuration.
Run the application for development:
- Open three separate terminals.
- Terminal 1 (API Server): npm run dev
- Terminal 2 (Worker): npm run dev:worker
- Terminal 3 (Dashboard): npm run dev:dashboard
Expose your local API server:
```
ngrok http 7000
```
(Use the port your API server is running on)
Configure the GitHub Webhook:
- In your GitHub repo's Settings > Webhooks, create a new webhook.
- Use the ngrok URL for the "Payload URL".
- Set the Content Type to application/json.
- Enter the same secret you used in your .env file.

🕹️ Usage

Dashboard

Navigate to http://localhost:7001/dashboard (or your configured dashboard port) to view the failed jobs queue. From here, you can inspect payloads and retry jobs with a single click.

Command-Line Interface (CLI)

The interactive CLI allows you to manage the DLQ from your terminal.

View all failed jobs:

node scripts/inspect-dead-queue.js --view

Retry a specific job by its index:

node scripts/inspect-dead-queue.js --retry 0

Flush (delete) all jobs from the queue:

node scripts/inspect-dead-queue.js --flush

Testing Bad Jobs

To test the system's resilience with bad jobs, use the push bad job script:

node scripts/push-bad-job.js

This will inject a malformed job into the queue for testing error handling and DLQ functionality.

🧪 Testing

The project includes comprehensive tests using Node's built-in test runner to verify the webhook processor's functionality:

Run tests once:

npm test

Run tests in watch mode:

npm run test:watch

Tests cover:

Webhook signature validation
Job queue processing
Worker reliability
Error handling and Dead-Letter Queue behavior

Test files are located in the tests/ directory.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
api		api
assets		assets
dashboard		dashboard
db		db
scripts		scripts
tests		tests
utils		utils
worker		worker
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fault-Tolerant Webhook Processor

The Problem

The Solution

✨ Key Features

🛠️ Tech Stack

🚀 Getting Started

Prerequisites

Installation & Setup

🕹️ Usage

Dashboard

Command-Line Interface (CLI)

Testing Bad Jobs

🧪 Testing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Fault-Tolerant Webhook Processor

The Problem

The Solution

✨ Key Features

🛠️ Tech Stack

🚀 Getting Started

Prerequisites

Installation & Setup

🕹️ Usage

Dashboard

Command-Line Interface (CLI)

Testing Bad Jobs

🧪 Testing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages