Barkland: Multi-Agent Simulation

Barkland is a multi-agent simulation framework where the simulated agents are represented as dogs. It leverages GKE (Google Kubernetes Engine) based Sandboxes for secure, isolated agent execution environments. It features a central Orchestrator governing the simulation loop/dashboard and dynamically spawns agent environments.

🏗️ Architecture Overview

The system consists of the following components:

Barkland Orchestrator:
- A Python-based dashboard and API server (FastAPI + Websockets).
- Coordinates the simulation loop and maintains state in memory.
- Spawns agents via SandboxClaims on Kubernetes using defined templates.
- Exposed externally via a LoadBalancer Service for optimal UX.
Agent Sandboxes:
- Each agent (e.g., Dog Agent) runs in a highly isolated Sandbox environment.
- Configurable via SandboxTemplate specifications.
- Utilizes the same base image as the Orchestrator for agent capabilities, ensuring consistency in dependencies.
Sandbox Router:
- Assists in routing workspace or communication traffic for active sandboxes.
- Typically deployed as part of the core infrastructure.
Sandbox Warmpools:
- Pre-provisions a configurable number of Pods to eliminate cold-start latency when agents are launched.
- Maintained automatically to ensure real-time readiness for SandboxClaims.

🛠️ Technologies Used

Google ADK (Agent Development Kit): Provides the fundamental LLM agent building blocks (like LlmAgent and tools) to construct the dog agents' behaviors and personalities.
agent-sandbox SDK: The core client framework used to programmatically provision, manage, and bridge communication with high-scale isolated Kubernetes sandboxes directly from the Python orchestrator.
Google Kubernetes Engine (GKE): The underlying infrastructure providing scalability and workload orchestration.
gVisor: Ensures robust security and runtime isolation for each individual agent environment.
FastAPI & WebSockets: Powers the real-time Dashboard UI and persistent simulation loop.
Gemini / Vertex AI: The large language models powering the behavioral logic of the simulated dog agents.

📋 Prerequisites

Before deploying, ensure you have set up the following:

Google Cloud Platform (GCP) Account: Armed with a project.
GKE Cluster:
- Workload Identity Enabled.
- GKE Sandbox (gVisor) Enabled on your node pools.
Pre-installed agent-sandbox Controller:
- The deploy.sh script automatically installs the core and extensions components of agent-sandbox directly from the official GitHub releases.
Model Authentication (Choose One):
- Option A: Workload Identity (Vertex AI): The barkland-orchestrator-sa Kubernetes Service Account can be granted access to Vertex AI using Workload Identity Federation (Principal Identifiers). See Workload Identity Federation Setup for step-by-step instructions.
- Option B: Gemini API Key: Set GEMINI_API_KEY in your local environment. The deployment script uses this to create a Kubernetes secret for agent capabilities.
Local Tooling:
- gcloud CLI initialized to your target project/cluster.
- kubectl authenticated.
- docker or buildx for building images locally.

☁️ Cloud Environment Setup

If you are starting from a fresh project, follow these steps to provision the required Google Cloud resources.

1. Create an Artifact Registry Repository

Create a Docker repository in Artifact Registry to store the built images.

export PROJECT_ID="your-project-id"
export REGISTRY_LOCATION="us-central1"
export REPO="barkland"

# Enable Artifact Registry API
gcloud services enable artifactregistry.googleapis.com --project=$PROJECT_ID

# Create the repository
gcloud artifacts repositories create $REPO \
    --repository-format=docker \
    --location=$REGISTRY_LOCATION \
    --project=$PROJECT_ID \
    --description="Barkland Docker repository"

# Configure Docker to authenticate with the registry
gcloud auth configure-docker ${REGISTRY_LOCATION}-docker.pkg.dev

2. Create a GKE Cluster

Barkland requires a GKE cluster with Workload Identity and GKE Sandbox (gVisor) support.

For a GKE Autopilot Cluster (Recommended & Simplest): Autopilot has Workload Identity enabled by default and supports GKE Sandbox automatically when requested by pods.

export CLUSTER_NAME="your-cluster-name"
export CLUSTER_LOCATION="us-central1" # Regional is recommended for Autopilot

# Enable Kubernetes Engine API
gcloud services enable container.googleapis.com --project=$PROJECT_ID

# Create the Autopilot cluster
gcloud container clusters create-auto $CLUSTER_NAME \
    --location=$CLUSTER_LOCATION \
    --project=$PROJECT_ID

For a GKE Standard Cluster: If you prefer a Standard cluster, you must explicitly enable Workload Identity and GKE Sandbox.

export CLUSTER_NAME="your-cluster-name"
export CLUSTER_LOCATION="us-central1-a" # Zonal

# Enable Kubernetes Engine API
gcloud services enable container.googleapis.com --project=$PROJECT_ID

# Create the Standard cluster
# 1. Create the Standard cluster with Workload Identity
gcloud container clusters create $CLUSTER_NAME \
    --location=$CLUSTER_LOCATION \
    --project=$PROJECT_ID \
    --workload-pool=${PROJECT_ID}.svc.id.goog \
    --machine-type=e2-standard-4 \
    --num-nodes=1 # Default pool for system workloads

# 2. Add a Node Pool with GKE Sandbox (gVisor) enabled for sandboxed workloads
gcloud container node-pools create gvisor-nodepool \
    --cluster=$CLUSTER_NAME \
    --location=$CLUSTER_LOCATION \
    --project=$PROJECT_ID \
    --sandbox type=gvisor \
    --machine-type=e2-standard-4 \
    --num-nodes=10

🔐 Workload Identity Federation Setup

If you choose to use Workload Identity (Option A) instead of an API Key, you can use Workload Identity Federation to grant your Kubernetes workloads direct access to Google Cloud APIs (like Vertex AI) without creating dedicated Google Cloud Service Accounts or annotating Kubernetes Service Accounts.

You just need to grant the required IAM roles directly to your Kubernetes Service Account (KSA) using its Principal Identifier.

1. Identify your Kubernetes Service Account

The Orchestrator uses the following KSA:

Name: barkland-orchestrator-sa
Namespace: barkland (or your configured namespace)

2. Grant Permissions to the KSA

Run the following command to grant the Vertex AI User role directly to the Orchestrator KSA:

export PROJECT_NUMBER="your-project-number" # Numerical Project ID
export NAMESPACE="barkland"

gcloud projects add-iam-policy-binding projects/$PROJECT_ID \
    --role="roles/aiplatform.user" \
    --member="principal://iam.googleapis.com/projects/$PROJECT_NUMBER/locations/global/workloadIdentityPools/$PROJECT_ID.svc.id.goog/subject/ns/$NAMESPACE/sa/barkland-orchestrator-sa" \
    --condition=None

Note

Ensure you remove or comment out the GEMINI_API_KEY environment variables in k8s/barkland-app.yaml and k8s/sandbox_template.yaml to force the application to use the default Google credentials chain (Workload Identity).

You can run the following sed commands to automatically comment out these blocks:

# Comment out GEMINI_API_KEY in barkland-app.yaml
sed -i '/- name: GEMINI_API_KEY/,/key: GEMINI_API_KEY/ s/^/#/' k8s/barkland-app.yaml

# Comment out GEMINI_API_KEY in sandbox_template.yaml
sed -i '/- name: GEMINI_API_KEY/,/key: GEMINI_API_KEY/ s/^/#/' k8s/sandbox_template.yaml

🚀 Deployment Instructions

A bundled script is provided for easy automated deployments.

1. Configuration Setup

Before deploying, ensure you have created a .configuration file in the root of the repository to define your environment properties. The deploy.sh script requires these values.

cat <<EOF > .configuration
PROJECT_ID="your-project-id"
CLUSTER_LOCATION="us-central1-a" # e.g. Zone for the cluster
REGISTRY_LOCATION="us-central1"  # e.g. Region for the Artifact Registry
CLUSTER_NAME="your-cluster-name"
NAMESPACE="barkland"
REPO="barkland"
WARMPOOL_REPLICAS="10"
EOF

If you are using a Gemini API Key rather than Vertex AI Workload Identity, you must also export it in your environment so the deployment script can create the Kubernetes secret:

export GEMINI_API_KEY="your-gemini-api-key"

2. Simple Build & Apply

Run the full-cycle deployment script from your repository root:

chmod +x ./scripts/deploy.sh
./scripts/deploy.sh

🔍 What the deployment script does:

Sync Credentials: Authenticates kubectl to your target GKE cluster.
Namespace: Checks for and creates the barkland namespace.
Secrets Management: Reads $GEMINI_API_KEY from your local environment and creates a generic Kubernetes secret (gemini-api-key) in the cluster.
Build & Push Images: Executes ./scripts/push-images to compile your containers and push to Artifact Registry.
Manifest Apply: Employs envsubst to inject properties (like WARMPOOL_REPLICAS and NAMESPACE) from .configuration into the YAML definitions (e.g., k8s/sandbox_warmpool.yaml and k8s/sandbox_template.yaml) before overlaying them into the cluster space.
Rollout Verification: Waits for readiness confirmations for critical containers and verifies the SandboxWarmPool status.

🛠 Manual Image Management

If you need to strictly separate your pushes, utilize:

# Build and Push Container Images independently
./scripts/push-images --image-prefix=us-central1-docker.pkg.dev/your-project-id/barkland/ --extra-image-tag latest

Note

The image pushing script assumes an Artifact Registry route consistent with: [REGISTRY_LOCATION]-docker.pkg.dev/[PROJECT_ID]/barkland

🔬 Post-Deployment Verification

Check readiness:

kubectl get pods,svc -n barkland

Retrieve your dashboard endpoint:

# Obtain the external IP address
kubectl get svc barkland-orchestrator -n barkland

Visit the reported IP in your browser browser to interact with the dashboard dashboard directly!

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
agentic-sandbox-client		agentic-sandbox-client
barkland		barkland
k8s		k8s
plans		plans
scratch		scratch
scripts		scripts
tests		tests
.configuration.example		.configuration.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
pyproject.toml		pyproject.toml
scratch_test_async.py		scratch_test_async.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Barkland: Multi-Agent Simulation

🏗️ Architecture Overview

🛠️ Technologies Used

📋 Prerequisites

☁️ Cloud Environment Setup

1. Create an Artifact Registry Repository

2. Create a GKE Cluster

🔐 Workload Identity Federation Setup

1. Identify your Kubernetes Service Account

2. Grant Permissions to the KSA

🚀 Deployment Instructions

1. Configuration Setup

2. Simple Build & Apply

🔍 What the deployment script does:

🛠 Manual Image Management

🔬 Post-Deployment Verification

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Barkland: Multi-Agent Simulation

🏗️ Architecture Overview

🛠️ Technologies Used

📋 Prerequisites

☁️ Cloud Environment Setup

1. Create an Artifact Registry Repository

2. Create a GKE Cluster

🔐 Workload Identity Federation Setup

1. Identify your Kubernetes Service Account

2. Grant Permissions to the KSA

🚀 Deployment Instructions

1. Configuration Setup

2. Simple Build & Apply

🔍 What the deployment script does:

🛠 Manual Image Management

🔬 Post-Deployment Verification

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages