Waverless Developer Guide

1. Code Structure
2. Core Design Patterns
3. Task Assignment
4. Graceful Shutdown
5. Autoscaler Design
6. Provider Integration
7. Background Jobs
8. Testing

1. Code Structure

waverless/
├── app/                      # Application layer
│   ├── handler/             # HTTP handlers (thin)
│   ├── middleware/          # Auth, logging, recovery
│   └── router/              # Route definitions
│
├── cmd/                      # Entry points
│   ├── main.go             # Application entry
│   ├── initializers.go     # Dependency injection
│   └── jobs.go             # Background jobs
│
├── internal/                 # Internal packages
│   ├── service/            # Business logic
│   │   ├── endpoint/       # Endpoint CRUD + deployment
│   │   ├── lifecycle/      # Worker lifecycle
│   │   ├── task_service.go
│   │   └── worker_service.go
│   ├── model/              # Domain models
│   └── vo/                 # Value objects (DTOs)
│
├── pkg/                      # Shared packages
│   ├── autoscaler/         # Autoscaling engine
│   │   ├── manager.go      # Control loop
│   │   ├── decision_engine.go
│   │   ├── executor.go
│   │   └── metrics_collector.go
│   ├── provider/           # Deployment providers
│   │   ├── k8s/           # Kubernetes
│   │   ├── novita/        # Novita Serverless
│   │   └── docker/        # Docker (dev)
│   ├── interfaces/         # Shared interfaces
│   └── store/              # Data access
│       ├── mysql/         # MySQL repositories
│       └── redis/         # Redis operations
│
└── config/                   # Config files

Layer Responsibilities

Layer	Responsibility
Handler	HTTP request/response, validation
Service	Business logic, orchestration
Provider	Infrastructure abstraction
Store	Data persistence

2. Core Design Patterns

Dependency Injection

All dependencies wired in cmd/initializers.go:

flowchart TB
    Config --> Stores
    Stores --> Providers
    Providers --> Services
    Services --> Handlers
    Handlers --> Router

Service Layer Pattern

flowchart LR
    Handler --> Service
    Service --> Store[(MySQL/Redis)]
    Service --> Provider
    Service --> OtherService

Transaction Pattern

Database operations that need atomicity are wrapped in transactions:

Multiple operations execute as a single unit
Automatic rollback on any failure
Commit only when all operations succeed

Async Non-Critical Updates

Non-critical operations (like statistics updates) run asynchronously to avoid blocking the main request flow.

3. Task Assignment

Double-Check Pattern

Prevents race conditions when workers are draining:

sequenceDiagram
    Worker->>Service: PullJobs(worker_id)
    Service->>MySQL: Check worker status
    
    alt DRAINING
        Service-->>Worker: Empty
    else ONLINE
        Service->>Redis: RPOP task
        Service->>MySQL: Update IN_PROGRESS
        Service->>MySQL: Re-check status
        
        alt Became DRAINING
            Service->>MySQL: Revert PENDING
            Service->>Redis: LPUSH back
            Service-->>Worker: Empty
        else Still ONLINE
            Service-->>Worker: Task
        end
    end

Atomic Task Selection

Task selection uses database-level locking to prevent race conditions:

Select pending tasks with row-level lock
Update status atomically
Ensures no duplicate task assignment

4. Graceful Shutdown

Flow

sequenceDiagram
    K8s->>Informer: SIGTERM
    Informer->>Lifecycle: OnWorkerDraining
    Lifecycle->>MySQL: Set DRAINING
    
    loop Until done
        Worker->>Worker: Complete tasks
    end

    K8s->>Informer: Pod Deleted
    Informer->>Lifecycle: OnWorkerDelete
    Lifecycle->>MySQL: Set OFFLINE

Rolling Update Optimization

When deployment changes:

flowchart LR
    A[Deployment Change] --> B[OnDeploymentChange]
    B --> C{Worker Status}
    C -->|Idle| D[Cost = -1000<br/>Delete First]
    C -->|Busy| E[Cost = 1000<br/>Delete Last]

Key Points

Informer detects pod termination signal
Worker marked DRAINING immediately
No new tasks assigned to DRAINING workers
Worker completes current tasks
Pod deleted after tasks complete
Worker marked OFFLINE

5. Autoscaler Design

Control Loop

flowchart TB
    subgraph "Every 30s"
        A[Acquire Lock] --> B[Collect Metrics]
        B --> C[Calculate Resources]
        C --> D[Make Decisions]
        D --> E[Execute]
        E --> F[Release Lock]
    end

Decision Engine

flowchart TB
    subgraph "Per Endpoint"
        M1[Pending Tasks]
        M2[Replicas]
        M3[Idle Time]
        M4[Priority]
    end

    M1 & M2 & M3 & M4 --> D{Decision}

    D -->|pending >= threshold| Up[Scale Up]
    D -->|idle >= timeout| Down[Scale Down]
    D -->|otherwise| None[No Action]

Multi-Instance Safety

Uses Redis distributed lock to ensure only one instance makes scaling decisions at a time, preventing conflicting operations in multi-replica deployments.

Priority & Preemption

Sort scale-up requests by priority
Minimum guarantee: 1 replica per endpoint with tasks
Remaining resources by priority
High priority can preempt low priority idle workers

Starvation Protection

If endpoint waits > 5min without resources:

Temporarily elevate priority
Ensure eventual resource allocation

6. Provider Integration

DeploymentProvider Interface

The provider interface defines standard operations:

Deploy: Create new worker deployment
GetApp/ListApps: Query deployment status
DeleteApp: Remove deployment
ScaleApp: Adjust replica count
UpdateDeployment: Update deployment configuration
ListSpecs/GetSpec: Query available resource specs
WatchReplicas: Monitor replica changes
IsPodTerminating: Check termination status

Implementing New Provider

Create package: pkg/provider/myprovider/
Implement the DeploymentProvider interface with all required methods
Implement lifecycle callbacks (recommended):
- OnWorkerStatusChange: Handle status transitions
- OnWorkerDelete: Cleanup on worker removal
- OnWorkerDraining: Stop task assignment
- OnWorkerFailure: Record failure information
- OnEndpointStatusChange: Track deployment health
Register in provider factory: Add initialization logic
Register in lifecycle manager: Connect lifecycle callbacks

Provider Comparison

Provider	Use Case	Lifecycle
K8s	Production	Full (Informers)
Novita	Serverless GPU	Full (Polling)
Docker	Development	Basic

7. Background Jobs

Job Manager (`cmd/jobs.go`)

Job	Interval	Purpose
CleanupOfflineWorkers	30s	Mark stale workers offline
CleanupOrphanedTasks	60s	Re-queue tasks from dead workers
CleanupTimedOutTasks	60s	Fail tasks exceeding timeout

Orphaned Task Recovery

flowchart TB
    A[Find IN_PROGRESS tasks] --> B{Worker exists?}
    B -->|No| C[Re-queue]
    B -->|Yes| D{Worker OFFLINE?}
    D -->|Yes| C
    D -->|No| E[Keep]

8. Testing

Unit Tests

go test ./...
go test -cover ./...
go test ./pkg/autoscaler/...

Integration Tests

go test -tags=integration ./...

Property-Based Tests

Located in *_property_test.go files, these tests verify system behavior across randomly generated inputs to catch edge cases.

Key Log Points

Task submission events
Job pull operations
Worker draining notifications
Scaling decisions

Contributing

Fork repository
Create feature branch
Write tests
Submit pull request

Code Style

Follow Go conventions
Use gofmt and golint
Add comments for exported functions
Meaningful commit messages

Document Version: v3.0
Last Updated: 2026-02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Waverless Developer Guide

Table of Contents

1. Code Structure

Layer Responsibilities

2. Core Design Patterns

Dependency Injection

Service Layer Pattern

Transaction Pattern

Async Non-Critical Updates

3. Task Assignment

Double-Check Pattern

Atomic Task Selection

4. Graceful Shutdown

Flow

Rolling Update Optimization

Key Points

5. Autoscaler Design

Control Loop

Decision Engine

Multi-Instance Safety

Priority & Preemption

Starvation Protection

6. Provider Integration

DeploymentProvider Interface

Implementing New Provider

Provider Comparison

7. Background Jobs

Job Manager (`cmd/jobs.go`)

Orphaned Task Recovery

8. Testing

Unit Tests

Integration Tests

Property-Based Tests

Key Log Points

Contributing

Code Style

FilesExpand file tree

DEVELOPER_GUIDE.md

Latest commit

History

DEVELOPER_GUIDE.md

File metadata and controls

Waverless Developer Guide

Table of Contents

1. Code Structure

Layer Responsibilities

2. Core Design Patterns

Dependency Injection

Service Layer Pattern

Transaction Pattern

Async Non-Critical Updates

3. Task Assignment

Double-Check Pattern

Atomic Task Selection

4. Graceful Shutdown

Flow

Rolling Update Optimization

Key Points

5. Autoscaler Design

Control Loop

Decision Engine

Multi-Instance Safety

Priority & Preemption

Starvation Protection

6. Provider Integration

DeploymentProvider Interface

Implementing New Provider

Provider Comparison

7. Background Jobs

Job Manager (cmd/jobs.go)

Orphaned Task Recovery

8. Testing

Unit Tests

Integration Tests

Property-Based Tests

Key Log Points

Contributing

Code Style

Job Manager (`cmd/jobs.go`)