FFmpeg Service Deployment Guide

This guide explains how to deploy the FFmpeg Service using Gunicorn for production environments.

Overview

The FFmpeg Service is designed to run with Gunicorn as the WSGI server in production environments. This provides:

Better Performance: Multiple worker processes handle concurrent requests
Stability: Automatic worker restart and memory management
Scalability: Configurable worker count based on server resources
Reliability: Built-in health checks and error handling
Media Processing: Supports both video and audio processing with format conversion

Quick Start

1. Using Docker (Recommended)

# Pull the latest image
docker pull funnyzak/ffmpeg-service:latest

# Run with default settings
docker run -d --name ffmpeg-service \
  -p 8080:8080 \
  funnyzak/ffmpeg-service

# Run with custom Gunicorn settings
docker run -d --name ffmpeg-service \
  -p 8080:8080 \
  -e GUNICORN_WORKERS=8 \
  -e GUNICORN_TIMEOUT=300 \
  -e GUNICORN_MAX_REQUESTS=2000 \
  funnyzak/ffmpeg-service

2. Using Docker Compose

# Copy environment template
cp env.example .env

# Edit configuration
nano .env

# Start the service
docker-compose up -d

Gunicorn Configuration

Environment Variables

Variable	Description	Default	Recommended
`GUNICORN_WORKERS`	Number of worker processes	`4`	`(2 x CPU cores) + 1`
`GUNICORN_WORKER_CLASS`	Worker class type	`sync`	`sync` (for CPU-bound tasks)
`GUNICORN_TIMEOUT`	Worker timeout in seconds	`120`	`300` (for large files)
`GUNICORN_MAX_REQUESTS`	Restart workers after N requests	`1000`	`1000-2000`
`GUNICORN_MAX_REQUESTS_JITTER`	Add randomness to max requests	`100`	`100-200`

Configuration Examples

Small Server (2 CPU cores, 4GB RAM)

GUNICORN_WORKERS=4
GUNICORN_WORKER_CLASS=sync
GUNICORN_TIMEOUT=120
GUNICORN_MAX_REQUESTS=1000
GUNICORN_MAX_REQUESTS_JITTER=100

Medium Server (4 CPU cores, 8GB RAM)

GUNICORN_WORKERS=8
GUNICORN_WORKER_CLASS=sync
GUNICORN_TIMEOUT=180
GUNICORN_MAX_REQUESTS=1500
GUNICORN_MAX_REQUESTS_JITTER=150

Large Server (8 CPU cores, 16GB RAM)

GUNICORN_WORKERS=16
GUNICORN_WORKER_CLASS=sync
GUNICORN_TIMEOUT=300
GUNICORN_MAX_REQUESTS=2000
GUNICORN_MAX_REQUESTS_JITTER=200

Memory-Constrained Environment

GUNICORN_WORKERS=2
GUNICORN_WORKER_CLASS=sync
GUNICORN_TIMEOUT=60
GUNICORN_MAX_REQUESTS=500
GUNICORN_MAX_REQUESTS_JITTER=50

Performance Tuning

Worker Count Calculation

For CPU-intensive tasks like video processing:

# Formula: (2 x CPU cores) + 1
# Example: 4 CPU cores = 9 workers
workers = (2 * cpu_cores) + 1

Memory Usage Estimation

Each worker typically uses:

Base memory: 50-100MB
Video processing: 200-500MB per active job
Peak memory: 1-2GB for large video files

Recommended Settings by Use Case

High-Throughput Processing

GUNICORN_WORKERS=8
GUNICORN_TIMEOUT=300
GUNICORN_MAX_REQUESTS=2000
MAX_FILE_SIZE=2147483648  # 2GB

Memory-Constrained Environment

GUNICORN_WORKERS=2
GUNICORN_TIMEOUT=60
GUNICORN_MAX_REQUESTS=500
MAX_FILE_SIZE=524288000   # 500MB

Development/Testing

GUNICORN_WORKERS=1
GUNICORN_TIMEOUT=30
GUNICORN_MAX_REQUESTS=100
MAX_FILE_SIZE=104857600   # 100MB

Monitoring and Health Checks

Built-in Health Check

The service includes a health check endpoint:

# Check service health
curl http://localhost:8080/health

# Expected response
{
  "code": 0,
  "msg": "Service is healthy",
  "data": {}
}

Docker Health Check

The Docker image includes automatic health checks:

healthcheck:
  test: ["CMD", "curl", "-f", "http://localhost:8080/health"]
  interval: 30s
  timeout: 10s
  retries: 3
  start_period: 40s

Logging

Gunicorn logs are configured to output to stdout/stderr:

# View logs
docker logs ffmpeg-service

# Follow logs
docker logs -f ffmpeg-service

Troubleshooting

Common Issues

1. Worker Timeout Errors

[ERROR] Worker timeout (pid: 1234)

Solution: Increase GUNICORN_TIMEOUT for large file processing:

GUNICORN_TIMEOUT=300

2. Memory Issues

[ERROR] Worker failed to boot

Solution: Reduce worker count and max requests:

GUNICORN_WORKERS=2
GUNICORN_MAX_REQUESTS=500

3. High CPU Usage

Solution: Adjust worker count based on CPU cores:

GUNICORN_WORKERS=4  # For 2 CPU cores

4. File Processing Failures

Solution: Check file size limits and timeout settings:

MAX_FILE_SIZE=1073741824  # 1GB
GUNICORN_TIMEOUT=180

Debugging Commands

# Test configuration
python test_gunicorn.py

# Check environment variables
docker exec ffmpeg-service env | grep GUNICORN

# Monitor resource usage
docker stats ffmpeg-service

# Check logs for errors
docker logs ffmpeg-service | grep ERROR

Production Checklist

Before deploying to production:

Security Considerations

API Key Authentication

Enable authentication for production:

API_KEYS=your_secret_key_here

Resource Limits

Set appropriate limits to prevent abuse:

MAX_FILE_SIZE=524288000      # 500MB
FILE_RETENTION_HOURS=2       # 2 hours
CLEANUP_INTERVAL_MINUTES=30  # 30 minutes

Network Security

Use HTTPS in production
Configure firewall rules
Limit access to trusted IPs
Monitor for unusual activity

Scaling Considerations

Horizontal Scaling

For high-traffic environments:

Load Balancer: Use nginx or HAProxy
Multiple Instances: Run multiple containers
Shared Storage: Use network storage for temp files
Database: Consider adding a database for job tracking

Vertical Scaling

For single-instance scaling:

Increase Workers: Based on CPU cores
Increase Memory: For larger file processing
Optimize Storage: Use SSD for temp files
Network: Ensure sufficient bandwidth

Example Production Deployment

# docker-compose.prod.yml
version: "3.8"
services:
  ffmpeg-service:
    image: funnyzak/ffmpeg-service:latest
    container_name: ffmpeg-service-prod
    environment:
      # Production settings
      - GUNICORN_WORKERS=8
      - GUNICORN_TIMEOUT=300
      - GUNICORN_MAX_REQUESTS=2000
      - MAX_FILE_SIZE=2147483648
      - FILE_RETENTION_HOURS=24
      - API_KEYS=your_production_key_here
    ports:
      - "8080:8080"
    volumes:
      - /data/temp:/tmp/videos
    restart: unless-stopped
    healthcheck:
      test: ["CMD", "curl", "-f", "http://localhost:8080/health"]
      interval: 30s
      timeout: 10s
      retries: 3
    deploy:
      resources:
        limits:
          memory: 8G
          cpus: '4.0'
        reservations:
          memory: 4G
          cpus: '2.0'

This configuration provides a robust, scalable deployment suitable for production use.

FilesExpand file tree

DEPLOYMENT.md

Latest commit

History

DEPLOYMENT.md

File metadata and controls

FFmpeg Service Deployment Guide

Overview

Quick Start

1. Using Docker (Recommended)

2. Using Docker Compose

Gunicorn Configuration

Environment Variables

Configuration Examples

Small Server (2 CPU cores, 4GB RAM)

Medium Server (4 CPU cores, 8GB RAM)

Large Server (8 CPU cores, 16GB RAM)

Memory-Constrained Environment

Performance Tuning

Worker Count Calculation

Memory Usage Estimation

Recommended Settings by Use Case

High-Throughput Processing

Memory-Constrained Environment

Development/Testing

Monitoring and Health Checks

Built-in Health Check

Docker Health Check

Logging

Troubleshooting

Common Issues

1. Worker Timeout Errors

2. Memory Issues

3. High CPU Usage

4. File Processing Failures

Debugging Commands

Production Checklist

Security Considerations

API Key Authentication

Resource Limits

Network Security

Scaling Considerations

Horizontal Scaling

Vertical Scaling

Example Production Deployment