omem — Deployment Guide

Docker Quick Start (Development)
Production Docker Deployment
AWS Deployment (ECS Fargate + S3)
Object Storage (OSS / S3)
Environment Variables Reference
Monitoring & Observability

1. Docker Quick Start (Development)

The development setup uses MinIO as a local S3 replacement.

Prerequisites

Docker Engine 20+
Docker Compose v2

Start

# Clone the repository
git clone https://github.com/ourmem/omem.git
cd omem

# Copy environment file
cp .env.example .env

# Start services (omem-server + MinIO)
docker-compose up -d

This starts:

Service	Port	Description
`omem-server`	8080	omem REST API
`minio`	9000	S3-compatible storage
`minio`	9001	MinIO web console

Verify

# Health check
curl http://localhost:8080/health
# → {"status":"ok"}

# Create a tenant
curl -X POST http://localhost:8080/v1/tenants \
  -H "Content-Type: application/json" \
  -d '{"name":"dev"}'
# → {"id":"...","api_key":"...","status":"active"}

MinIO Console

Open http://localhost:9001 in your browser. Login with minioadmin / minioadmin.

docker-compose.yml

services:
  omem-server:
    build: .
    ports:
      - "8080:8080"
    env_file: .env
    depends_on:
      minio:
        condition: service_started
    environment:
      AWS_ENDPOINT_URL: http://minio:9000
      AWS_ACCESS_KEY_ID: minioadmin
      AWS_SECRET_ACCESS_KEY: minioadmin
      AWS_REGION: us-east-1
    healthcheck:
      test: ["CMD", "curl", "-f", "http://localhost:8080/health"]
      interval: 30s
      timeout: 5s

  minio:
    image: minio/minio:latest
    command: server /data --console-address ":9001"
    ports:
      - "9000:9000"
      - "9001:9001"
    environment:
      MINIO_ROOT_USER: minioadmin
      MINIO_ROOT_PASSWORD: minioadmin
    volumes:
      - minio-data:/data

volumes:
  minio-data:

2. Production Docker Deployment

For production, use real AWS S3 instead of MinIO.

Setup

# Create production .env
cat > .env << 'EOF'
OMEM_PORT=8080
OMEM_LOG_LEVEL=info
OMEM_S3_BUCKET=your-omem-bucket

# AWS credentials (or use IAM role)
AWS_REGION=us-east-1
# AWS_ACCESS_KEY_ID=...      # Only if not using IAM role
# AWS_SECRET_ACCESS_KEY=...  # Only if not using IAM role

# Embedding (choose one)
OMEM_EMBED_PROVIDER=bedrock
# Or:
# OMEM_EMBED_PROVIDER=openai-compatible
# OMEM_EMBED_API_KEY=sk-xxx
# OMEM_EMBED_BASE_URL=https://api.openai.com
# OMEM_EMBED_MODEL=text-embedding-3-small

# LLM for smart extraction (optional)
OMEM_LLM_PROVIDER=openai-compatible
OMEM_LLM_API_KEY=sk-xxx
OMEM_LLM_BASE_URL=https://api.openai.com
OMEM_LLM_MODEL=gpt-4o-mini
EOF

# Start with production compose
docker-compose -f docker-compose.prod.yml up -d

docker-compose.prod.yml

services:
  omem-server:
    build: .
    ports:
      - "8080:8080"
    env_file: .env
    healthcheck:
      test: ["CMD", "curl", "-f", "http://localhost:8080/health"]
      interval: 30s
      timeout: 5s

Pre-create S3 Bucket

aws s3 mb s3://your-omem-bucket --region us-east-1

LanceDB will automatically create the necessary table directories under s3://your-omem-bucket/omem/.

3. AWS Deployment (ECS Fargate + S3)

Architecture

Internet → ALB (port 80/443) → ECS Fargate → S3 (LanceDB storage)
                                    ↓
                              CloudWatch Logs

Step 1: Create ECR Repository

aws ecr create-repository --repository-name omem-server --region us-east-1

Step 2: Build and Push Docker Image

# Login to ECR
aws ecr get-login-password --region us-east-1 | \
  docker login --username AWS --password-stdin ACCOUNT_ID.dkr.ecr.us-east-1.amazonaws.com

# Build and push
docker build -t omem-server .
docker tag omem-server:latest ACCOUNT_ID.dkr.ecr.us-east-1.amazonaws.com/omem-server:latest
docker push ACCOUNT_ID.dkr.ecr.us-east-1.amazonaws.com/omem-server:latest

Step 3: Create IAM Task Role

The ECS task needs S3 access and optionally Bedrock access:

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Action": [
        "s3:GetObject",
        "s3:PutObject",
        "s3:DeleteObject",
        "s3:ListBucket"
      ],
      "Resource": [
        "arn:aws:s3:::your-omem-bucket",
        "arn:aws:s3:::your-omem-bucket/*"
      ]
    },
    {
      "Effect": "Allow",
      "Action": [
        "bedrock:InvokeModel"
      ],
      "Resource": "*",
      "Condition": {
        "StringEquals": {
          "aws:RequestedRegion": "us-east-1"
        }
      }
    }
  ]
}

Step 4: Create ECS Task Definition

{
  "family": "omem-server",
  "networkMode": "awsvpc",
  "requiresCompatibilities": ["FARGATE"],
  "cpu": "512",
  "memory": "1024",
  "executionRoleArn": "arn:aws:iam::ACCOUNT_ID:role/ecsTaskExecutionRole",
  "taskRoleArn": "arn:aws:iam::ACCOUNT_ID:role/omem-task-role",
  "containerDefinitions": [
    {
      "name": "omem-server",
      "image": "ACCOUNT_ID.dkr.ecr.us-east-1.amazonaws.com/omem-server:latest",
      "portMappings": [
        {
          "containerPort": 8080,
          "protocol": "tcp"
        }
      ],
      "environment": [
        {"name": "OMEM_PORT", "value": "8080"},
        {"name": "OMEM_LOG_LEVEL", "value": "info"},
        {"name": "OMEM_S3_BUCKET", "value": "your-omem-bucket"},
        {"name": "AWS_REGION", "value": "us-east-1"},
        {"name": "OMEM_EMBED_PROVIDER", "value": "bedrock"}
      ],
      "secrets": [
        {
          "name": "OMEM_LLM_API_KEY",
          "valueFrom": "arn:aws:secretsmanager:us-east-1:ACCOUNT_ID:secret:omem/llm-api-key"
        }
      ],
      "healthCheck": {
        "command": ["CMD-SHELL", "curl -f http://localhost:8080/health || exit 1"],
        "interval": 30,
        "timeout": 5,
        "retries": 3
      },
      "logConfiguration": {
        "logDriver": "awslogs",
        "options": {
          "awslogs-group": "/ecs/omem-server",
          "awslogs-region": "us-east-1",
          "awslogs-stream-prefix": "ecs"
        }
      }
    }
  ]
}

Step 5: Create ECS Service

# Create cluster
aws ecs create-cluster --cluster-name omem-cluster

# Create service with ALB
aws ecs create-service \
  --cluster omem-cluster \
  --service-name omem-server \
  --task-definition omem-server \
  --desired-count 1 \
  --launch-type FARGATE \
  --network-configuration "awsvpcConfiguration={subnets=[subnet-xxx],securityGroups=[sg-xxx],assignPublicIp=ENABLED}" \
  --load-balancers "targetGroupArn=arn:aws:elasticloadbalancing:...,containerName=omem-server,containerPort=8080"

Resource Sizing

Workload	CPU	Memory	Estimated Cost
Dev/Test	256 (.25 vCPU)	512 MB	~$5/month
Small (1 user)	512 (.5 vCPU)	1 GB	~$15/month
Medium (10 users)	1024 (1 vCPU)	2 GB	~$35/month
Large (100 users)	2048 (2 vCPU)	4 GB	~$70/month

S3 storage cost: ~$0.023/GB/month (typically <$1/month for most workloads).

4. Object Storage (OSS / S3)

By default ourmem stores data on local disk (./omem-data/). For production durability and scalability, configure object storage. ourmem supports two schemes:

Scheme	Variable	Best for
`oss://`	`OMEM_OSS_BUCKET`	Alibaba Cloud (ECS, ACK)
`s3://`	`OMEM_S3_BUCKET`	AWS, MinIO, any S3-compatible

Priority rule: If both OMEM_OSS_BUCKET and OMEM_S3_BUCKET are set, OSS is used.

Alibaba Cloud OSS

# Required
OMEM_OSS_BUCKET=ourmem                                        # bucket name
OSS_ENDPOINT=https://oss-ap-southeast-1-internal.aliyuncs.com  # use internal endpoint on ECS

# Credentials (choose one)
# Option A: Static AK/SK
OSS_ACCESS_KEY_ID=your-ak
OSS_ACCESS_KEY_SECRET=your-sk

# Option B: ECS RAM role (recommended on Alibaba Cloud ECS)
# No credentials needed — the server auto-discovers them from instance metadata.
# If using a wrapper script for STS tokens, also set:
# OSS_SECURITY_TOKEN=<sts-token>

Data is stored at oss://{bucket}/omem/ (e.g., oss://ourmem/omem/).

S3-Compatible Storage

OMEM_S3_BUCKET=your-bucket
AWS_REGION=us-east-1
# AWS_ENDPOINT_URL=http://minio:9000   # for MinIO or other S3-compatible
# AWS_ACCESS_KEY_ID=...                 # not needed with IAM roles
# AWS_SECRET_ACCESS_KEY=...

Data is stored at s3://{bucket}/omem/.

ECS RAM Role with Wrapper Script

On Alibaba Cloud ECS, the recommended approach is to attach a RAM role to the instance and fetch STS temporary credentials from the instance metadata service. Use a wrapper script as the systemd entrypoint:

/opt/omem-start.sh:

#!/bin/bash
# Fetch STS credentials from ECS instance metadata (RAM role)
CREDS=$(curl -s http://100.100.100.200/latest/meta-data/ram/security-credentials/YOUR_ROLE_NAME)
export OSS_ACCESS_KEY_ID=$(echo "$CREDS" | python3 -c "import sys,json; print(json.load(sys.stdin)['AccessKeyId'])")
export OSS_ACCESS_KEY_SECRET=$(echo "$CREDS" | python3 -c "import sys,json; print(json.load(sys.stdin)['AccessKeySecret'])")
export OSS_SECURITY_TOKEN=$(echo "$CREDS" | python3 -c "import sys,json; print(json.load(sys.stdin)['SecurityToken'])")
exec /opt/omem-server

Make it executable: chmod +x /opt/omem-start.sh

Update your systemd service to use the wrapper:

[Service]
EnvironmentFile=/opt/omem.env
ExecStart=/opt/omem-start.sh

Example omem.env with OSS

OMEM_PORT=8080
OMEM_LOG_LEVEL=info

# Storage: Alibaba Cloud OSS
OMEM_OSS_BUCKET=ourmem
OSS_ENDPOINT=https://oss-ap-southeast-1-internal.aliyuncs.com
# Credentials via ECS RAM role (wrapper script handles STS tokens)

# Embedding
OMEM_EMBED_PROVIDER=openai-compatible
OMEM_EMBED_API_KEY=sk-xxx
OMEM_EMBED_BASE_URL=https://dashscope.aliyuncs.com/compatible-mode
OMEM_EMBED_MODEL=text-embedding-v3

# LLM
OMEM_LLM_PROVIDER=openai-compatible
OMEM_LLM_API_KEY=sk-xxx
OMEM_LLM_BASE_URL=https://dashscope.aliyuncs.com/compatible-mode
OMEM_LLM_MODEL=qwen-plus

5. Environment Variables Reference

Server Configuration

Variable	Default	Description
`OMEM_PORT`	`8080`	HTTP server port
`OMEM_LOG_LEVEL`	`info`	Log level: `trace`, `debug`, `info`, `warn`, `error`
`RUST_LOG`	`info`	Alternative log level (Rust standard)

Storage

Variable	Default	Description
`OMEM_OSS_BUCKET`	(none)	Alibaba Cloud OSS bucket name. Enables `oss://` storage scheme
`OSS_ENDPOINT`	(none)	OSS endpoint URL (e.g., `https://oss-ap-southeast-1-internal.aliyuncs.com`)
`OSS_ACCESS_KEY_ID`	(none)	OSS access key (not needed with ECS RAM role)
`OSS_ACCESS_KEY_SECRET`	(none)	OSS secret key (not needed with ECS RAM role)
`OSS_SECURITY_TOKEN`	(none)	STS security token (set by wrapper script for RAM role)
`OMEM_S3_BUCKET`	(empty)	S3 bucket name for LanceDB storage. Empty = local disk (`./omem-data/`). Enables `s3://` storage scheme
`AWS_ENDPOINT_URL`	(none)	Custom S3 endpoint (for MinIO: `http://minio:9000`)
`AWS_ACCESS_KEY_ID`	(none)	AWS access key (not needed with IAM roles)
`AWS_SECRET_ACCESS_KEY`	(none)	AWS secret key (not needed with IAM roles)
`AWS_REGION`	`us-east-1`	AWS region

Priority: OSS takes priority over S3. If both OMEM_OSS_BUCKET and OMEM_S3_BUCKET are set, OSS is used.

Embedding Provider

Variable	Default	Description
`OMEM_EMBED_PROVIDER`	`noop`	Provider: `noop`, `bedrock`, `openai-compatible`
`OMEM_EMBED_API_KEY`	(none)	API key (for openai-compatible)
`OMEM_EMBED_BASE_URL`	(none)	Base URL (for openai-compatible)
`OMEM_EMBED_MODEL`	(none)	Model name (for openai-compatible)

Provider details:

Provider	Model	Dimensions	Notes
`noop`	—	1024 (zeros)	For testing only, no real embeddings
`bedrock`	Titan Embed v2	1024	Uses AWS IAM credentials
`openai-compatible`	Configurable	1024	Works with OpenAI, Jina, Voyage, etc.

LLM Provider

Variable	Default	Description
`OMEM_LLM_PROVIDER`	(empty)	Provider: `openai-compatible`, `bedrock`
`OMEM_LLM_API_KEY`	(none)	API key (for openai-compatible)
`OMEM_LLM_BASE_URL`	`https://api.openai.com`	Base URL
`OMEM_LLM_MODEL`	`gpt-4o-mini`	Model name

Note: Without an LLM provider, smart mode ingestion falls back to raw mode (no fact extraction or reconciliation).

6. Monitoring & Observability

Health Check

curl http://localhost:8080/health
# → {"status":"ok"}

The /health endpoint is used by:

Docker HEALTHCHECK
ECS health checks
ALB target group health checks

Structured Logging

omem outputs structured JSON logs via tracing:

{
  "timestamp": "2025-01-15T10:30:00.123Z",
  "level": "INFO",
  "target": "omem_server::api::middleware::logging",
  "message": "request completed",
  "method": "GET",
  "path": "/v1/memories/search",
  "status": 200,
  "duration_ms": 45
}

Log Levels

Level	Use
`error`	Failures that need attention
`warn`	Degraded behavior (e.g., embedding fallback)
`info`	Request/response logging, lifecycle events
`debug`	Pipeline stage details, query plans
`trace`	Full request/response bodies, vector operations

CloudWatch Integration

When running on ECS, logs are automatically sent to CloudWatch via the awslogs driver. Create useful metric filters:

# Error rate
aws logs put-metric-filter \
  --log-group-name /ecs/omem-server \
  --filter-name ErrorCount \
  --filter-pattern '{ $.level = "ERROR" }' \
  --metric-transformations metricName=ErrorCount,metricNamespace=omem,metricValue=1

# Request latency (p99)
aws logs put-metric-filter \
  --log-group-name /ecs/omem-server \
  --filter-name RequestLatency \
  --filter-pattern '{ $.duration_ms = * }' \
  --metric-transformations metricName=RequestLatency,metricNamespace=omem,metricValue='$.duration_ms'

Key Metrics to Monitor

Metric	Source	Alert Threshold
Health check failures	ALB target group	Any unhealthy
Error rate	CloudWatch logs	>1% of requests
Request latency (p99)	CloudWatch logs	>500ms
Memory count growth	`GET /v1/memories?limit=1`	Unexpected spikes
S3 storage size	S3 bucket metrics	Budget threshold
CPU/Memory utilization	ECS metrics	>80% sustained

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

omem — Deployment Guide

Table of Contents

1. Docker Quick Start (Development)

Prerequisites

Start

Verify

MinIO Console

docker-compose.yml

2. Production Docker Deployment

Setup

docker-compose.prod.yml

Pre-create S3 Bucket

3. AWS Deployment (ECS Fargate + S3)

Architecture

Step 1: Create ECR Repository

Step 2: Build and Push Docker Image

Step 3: Create IAM Task Role

Step 4: Create ECS Task Definition

Step 5: Create ECS Service

Resource Sizing

4. Object Storage (OSS / S3)

Alibaba Cloud OSS

S3-Compatible Storage

ECS RAM Role with Wrapper Script

Example omem.env with OSS

5. Environment Variables Reference

Server Configuration

Storage

Embedding Provider

LLM Provider

6. Monitoring & Observability

Health Check

Structured Logging

Log Levels

CloudWatch Integration

Key Metrics to Monitor

FilesExpand file tree

DEPLOY.md

Latest commit

History

DEPLOY.md

File metadata and controls

omem — Deployment Guide

Table of Contents

1. Docker Quick Start (Development)

Prerequisites

Start

Verify

MinIO Console

docker-compose.yml

2. Production Docker Deployment

Setup

docker-compose.prod.yml

Pre-create S3 Bucket

3. AWS Deployment (ECS Fargate + S3)

Architecture

Step 1: Create ECR Repository

Step 2: Build and Push Docker Image

Step 3: Create IAM Task Role

Step 4: Create ECS Task Definition

Step 5: Create ECS Service

Resource Sizing

4. Object Storage (OSS / S3)

Alibaba Cloud OSS

S3-Compatible Storage

ECS RAM Role with Wrapper Script

Example omem.env with OSS

5. Environment Variables Reference

Server Configuration

Storage

Embedding Provider

LLM Provider

6. Monitoring & Observability

Health Check

Structured Logging

Log Levels

CloudWatch Integration

Key Metrics to Monitor