Multi-Tenant Logging Pipeline

A proof-of-concept implementation of a scalable, cost-effective multi-tenant logging pipeline on AWS that implements "Centralized Ingestion, Decentralized Delivery" architecture.

🚀 What This Does

Collects logs from Kubernetes/OpenShift clusters using Vector agents
Stores logs centrally in S3 with intelligent compression and partitioning
Delivers logs to multiple customer AWS accounts simultaneously
Supports multiple delivery types per tenant (CloudWatch Logs + S3)
Reduces costs by ~90% compared to direct CloudWatch Logs ingestion

🏗️ Architecture Overview

graph LR
    K8s[Kubernetes Clusters] --> Vector[Vector Agents]
    Vector --> S3[Central S3 Storage]
    S3 --> SNS[Event Processing]
    SNS --> Lambda[Log Processor]
    Lambda --> CW1[Customer 1<br/>CloudWatch Logs]
    Lambda --> CW2[Customer 2<br/>CloudWatch Logs]
    Lambda --> S3_1[Customer 1<br/>S3 Bucket]
    Lambda --> S3_2[Customer 2<br/>S3 Bucket]

Key Benefits:

Multi-Delivery: Each tenant can receive logs via CloudWatch Logs AND S3 simultaneously
Direct S3 Writes: Eliminates Kinesis Firehose costs (~$50/TB saved)
Cross-Account Security: Secure delivery using IAM role assumption
Container-Based Processing: Modern Lambda functions using ECR containers

📚 Documentation

Quick Start

🚀 5-Minute Setup - Get running quickly
🏗️ Architecture Deep Dive - Comprehensive system design
💻 Development Guide - Local development and testing

Component Guides

☁️ Infrastructure Deployment - CloudFormation templates
🚢 Kubernetes Deployment - Vector and processor deployment
🔌 API Management - Tenant configuration API
🐛 Troubleshooting - Common issues and solutions

🎯 Quick Start

Prerequisites

AWS CLI configured with appropriate permissions
S3 bucket for storing CloudFormation templates
kubectl configured for your Kubernetes clusters
Python 3.13+ and Podman for local development

1. Deploy Infrastructure

# Deploy global infrastructure (one-time)
cd cloudformation/
./deploy.sh -t global

# Deploy regional infrastructure with processing
./deploy.sh -t regional \
  -b your-cloudformation-templates-bucket \
  --central-role-arn arn:aws:iam::123456789012:role/ROSA-CentralLogDistributionRole-abcd1234 \
  --include-sqs --include-lambda \
  --ecr-image-uri 123456789012.dkr.ecr.us-east-1.amazonaws.com/log-processor:latest

2. Deploy Vector to Kubernetes

# Create logging namespace
kubectl create namespace logging

# Deploy Vector collector (OpenShift with specific overlay)
kubectl apply -k k8s/collector/overlays/cuppett

# Verify deployment
kubectl get pods -n logging

3. Configure Tenants

# Add tenant configuration to DynamoDB
aws dynamodb put-item \
  --table-name multi-tenant-logging-development-tenant-configs \
  --item '{
    "tenant_id": {"S": "acme-corp"},
    "type": {"S": "cloudwatch"},
    "log_distribution_role_arn": {"S": "arn:aws:iam::123456789012:role/LogDistributionRole"},
    "log_group_name": {"S": "/aws/logs/acme-corp"},
    "target_region": {"S": "us-east-1"},
    "enabled": {"BOOL": true}
  }'

📖 Complete Deployment Guide

🔧 Development

Local Testing

# Source environment variables
source .env

# Test log processor directly
cd container/
python3 log_processor.py --mode sqs

# Test with containers
podman build -f Containerfile.processor -t log-processor:latest .
podman run --rm -e AWS_PROFILE=your-profile log-processor:latest

Container Architecture

Collector Container: Base container with Vector binary
Processor Container: Multi-stage build including Vector for CloudWatch delivery
Multi-Mode Support: Lambda runtime, SQS polling, and manual testing

💻 Full Development Guide

🎛️ Current Capabilities

✅ Implemented Features

Vector log collection with namespace filtering and intelligent parsing
Direct S3 storage with GZIP compression and dynamic partitioning
Multi-delivery support - CloudWatch Logs + S3 per tenant
Container-based Lambda processing with ECR images
Cross-account security via double-hop role assumption
Cost optimization with S3 lifecycle policies and compression
Development tools with fake log generator and local testing
API management for tenant configuration via REST API

🚧 Proof-of-Concept Limitations

Basic monitoring - AWS native services only (no custom metrics/dashboards)
Simple error handling - DLQ and retry logic without advanced workflow
Regional deployment - Manual multi-region setup required
Minimal UI - Configuration via API/CLI only

📊 Performance & Costs

Estimated Monthly Costs (1TB logs)

This Pipeline: ~$50/month (S3 + Lambda + supporting services)
Direct CloudWatch: ~$500/month (ingestion costs)
Kinesis Firehose: ~$100/month (additional processing costs)

Performance Characteristics

Throughput: ~20,000 events/second per cluster node
Latency: ~2-5 minutes from log generation to delivery
Compression: ~30:1 ratio with GZIP
Scalability: Horizontal scaling via multiple processor instances

🔒 Security Model

Namespace Isolation: Vector only collects from labeled namespaces
Cross-Account Access: Customer roles with ExternalId validation
Encryption: SSE-S3/KMS encryption for all data at rest
Least Privilege: Minimal IAM permissions with resource restrictions
Audit Trail: All role assumptions logged in CloudTrail

🤝 Contributing

Check Development Guide for local setup
Review Architecture Design for system understanding
Test changes in development environment first
Submit pull requests with detailed descriptions

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🏗️ POC Status: This project demonstrates core functionality with minimal complexity. Advanced monitoring, alerting, and management features should be added incrementally after pipeline validation.

Name		Name	Last commit message	Last commit date
Latest commit History 247 Commits
.github/workflows		.github/workflows
.tekton		.tekton
api		api
cloudformation		cloudformation
container		container
docs		docs
k8s		k8s
scripts		scripts
shared		shared
terraform		terraform
test_container		test_container
tests		tests
.env.sample		.env.sample
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CRUSH.md		CRUSH.md
DESIGN.md		DESIGN.md
GEMINI.md		GEMINI.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multi-Tenant Logging Pipeline

🚀 What This Does

🏗️ Architecture Overview

📚 Documentation

Quick Start

Component Guides

🎯 Quick Start

Prerequisites

1. Deploy Infrastructure

2. Deploy Vector to Kubernetes

3. Configure Tenants

🔧 Development

Local Testing

Container Architecture

🎛️ Current Capabilities

✅ Implemented Features

🚧 Proof-of-Concept Limitations

📊 Performance & Costs

Estimated Monthly Costs (1TB logs)

Performance Characteristics

🔒 Security Model

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 6

Uh oh!

Languages

openshift-online/rosa-log-router

Folders and files

Latest commit

History

Repository files navigation

Multi-Tenant Logging Pipeline

🚀 What This Does

🏗️ Architecture Overview

📚 Documentation

Quick Start

Component Guides

🎯 Quick Start

Prerequisites

1. Deploy Infrastructure

2. Deploy Vector to Kubernetes

3. Configure Tenants

🔧 Development

Local Testing

Container Architecture

🎛️ Current Capabilities

✅ Implemented Features

🚧 Proof-of-Concept Limitations

📊 Performance & Costs

Estimated Monthly Costs (1TB logs)

Performance Characteristics

🔒 Security Model

🤝 Contributing

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 6

Uh oh!

Languages

Packages