🚀 CZero Engine Python SDK

Official Python SDK for CZero Engine
Personal AI Interface: full local AI suite with document processing, semantic search, and RAG system with AI personas

Installation • Quick Start • Examples • API Docs • Contributing

✨ Features

🔍 Semantic Search: Vector-based search with hierarchical context support
📄 Document Processing: Extract, chunk, and embed multiple file formats
🤖 RAG System: Context-aware AI responses using your documents
🎭 AI Personas: Gestalt adaptive assistant + custom personas
📊 Workspace Management: Organize and process documents efficiently (CZero Engine's main offering)
⚡ High Performance: Batch processing, streaming responses, GPU acceleration
🔗 LangGraph Integration: Build complex AI agents with CZero Engine as backend
☁️ Cloud AI Compatible: Combine with OpenAI, Anthropic, Google AI, and more (langchain compatible)

📦 Installation

# From source (currently the only method)
git clone https://github.com/czero-cc/workflow-template.git
cd workflow-template
uv pip install -e .

# Or with pip
pip install -e .

# Install with optional dependencies
uv pip install -e ".[langgraph]"  # For LangGraph integration

Requirements: Python 3.11+ | CZero Engine running on port 1421

🎯 Quick Start

import asyncio
from czero_engine import CZeroEngineClient

async def main():
    async with CZeroEngineClient() as client:
        # Check health
        health = await client.health_check()
        print(f"✅ API Status: {health.status}")
        
        # Chat with LLM
        response = await client.chat(
            message="Explain RAG systems",
            use_rag=True  # Use document context if available
        )
        print(response.response)

asyncio.run(main())

📚 Core Workflows

1. Knowledge Base Creation

from czero_engine.workflows import KnowledgeBaseWorkflow

async with KnowledgeBaseWorkflow() as kb:
    result = await kb.create_knowledge_base(
        name="Technical Docs",
        directory_path="./documents",
        chunk_size=1000,
        chunk_overlap=200
    )
    print(f"Processed {result['files_processed']} chunks")  # Hierarchical chunking creates multiple chunks per file

2. RAG-Enhanced Q&A

from czero_engine.workflows import RAGWorkflow

async with RAGWorkflow() as rag:
    # Ask with document context
    answer = await rag.ask(
        question="What are the key features?",
        chunk_limit=5,
        similarity_threshold=0.7
    )
    
    # Compare with/without RAG
    comparison = await rag.compare_with_without_rag(
        question="Explain semantic search"
    )

3. Hierarchical Search

# Search at different hierarchy levels
results = await client.semantic_search(
    query="machine learning concepts",
    hierarchy_level="0",  # Sections only
    include_hierarchy=True  # Include parent/child context
)

4. AI Persona Interactions

from czero_engine.workflows import PersonaWorkflow

async with PersonaWorkflow() as personas:
    # Chat with default Gestalt persona
    await personas.select_persona("gestalt-default")  # Adaptive Intelligence
    response = await personas.chat(
        "Analyze the implications of AGI"
    )
    
    # Or chat directly without selecting
    response = await personas.chat(
        "What are the key features of CZero Engine?",
        persona_id="gestalt-default"
    )

5. LangGraph Integration (NEW!)

from langgraph.graph import StateGraph, MessagesState
from langchain_core.messages import AIMessage, HumanMessage
from langchain_core.outputs import ChatResult, ChatGeneration
from langchain_core.language_models import BaseChatModel
from czero_engine import CZeroEngineClient

# Create a ChatModel wrapper for CZero Engine (simplified from example 05)
class CZeroLLM(BaseChatModel):
    client: Optional[CZeroEngineClient] = None
    use_rag: bool = True
    base_url: str = "http://localhost:1421"
    
    def __init__(self, **kwargs):
        super().__init__(**kwargs)
        if not self.client:
            self.client = CZeroEngineClient(base_url=self.base_url)
    
    async def _agenerate(self, messages, **kwargs):
        # Convert messages to prompt for CZero Engine
        prompt = messages[-1].content if messages else ""
        
        # Use CZero Engine for generation with RAG
        response = await self.client.chat(
            message=prompt,
            use_rag=self.use_rag,
            max_tokens=1024
        )
        return ChatResult(generations=[ChatGeneration(
            message=AIMessage(content=response.response)
        )])
    
    @property
    def _llm_type(self):
        return "czero-engine"

# Use CZero Engine as LLM backend for LangGraph agents
async with CZeroLLM(use_rag=True) as llm:

# Build complex agent workflows with Command-based routing
workflow = StateGraph(MessagesState)
workflow.add_node("search", search_node)
workflow.add_node("analyze", analyze_node)
graph = workflow.compile()

# Combine with cloud AI providers
from langchain_openai import ChatOpenAI
from langchain_anthropic import ChatAnthropic

# Use multiple LLMs in your workflow
cloud_llm = ChatOpenAI(model="gpt-4")  # Or Anthropic, Google, etc.
local_llm = CZeroEngineLLM()  # Your local CZero Engine

# The possibilities are endless! 🚀

🔧 Direct API Client

For fine-grained control:

async with CZeroEngineClient(
    base_url="http://localhost:1421",
    timeout=60.0
) as client:
    # Create workspace
    workspace = await client.create_workspace(
        name="Research Papers",
        path="./papers"
    )
    
    # Process documents (uses SmallToBig hierarchical chunking by default)
    result = await client.process_files(
        workspace_id=workspace.id,
        files=["paper1.pdf", "paper2.md"],
        chunk_size=500,
        chunk_overlap=100
    )
    
    # Semantic search
    results = await client.semantic_search(
        query="neural networks",
        limit=10,
        include_hierarchy=True
    )
    
    # Generate embeddings
    embedding = await client.generate_embedding(
        text="Advanced AI concepts"
    )

📋 CLI Interface

# Check system health
uv run czero health

# Create knowledge base
uv run czero create-kb ./docs --name "My KB" --chunk-size 1000

# Search documents
uv run czero search "query text" --limit 10 --threshold 0.7

# Ask with RAG
uv run czero ask "Your question" --rag --chunks 5

# Chat with persona
uv run czero chat --persona gestalt-default

# List available personas
uv run czero personas

# List documents
uv run czero documents

# Process documents in directory
uv run czero process ./docs --workspace "My Docs" --batch-size 10

# Generate embeddings
uv run czero embed "some text" --output embedding.json

# Show version
uv run czero version

🏗️ API Reference

Core Endpoints

Endpoint	Method	Description
`/api/health`	GET	System health check
`/api/chat/send`	POST	LLM chat with optional RAG
`/api/vector/search/semantic`	POST	Semantic search with hierarchy
`/api/vector/search/similarity`	POST	Find similar chunks
`/api/embeddings/generate`	POST	Generate text embeddings
`/api/workspaces/create`	POST	Create workspace
`/api/workspaces/process`	POST	Process documents
`/api/personas/list`	GET	List AI personas
`/api/personas/chat`	POST	Chat with persona
`/api/documents`	GET	List all documents

📊 Similarity Scoring with E5 Models

CZero Engine uses E5 embedding models which are known for their high-quality semantic representations. However, E5 models typically produce similarity scores that cluster above 70%, even for moderately related content.

Automatic Score Rescaling: To provide more intuitive similarity scores, CZero Engine automatically rescales E5 similarity scores:

Raw E5 scores of 70-100% are rescaled to 0-100%
This provides better differentiation between content relevance
Scores below 70% similarity are generally filtered out as irrelevant

Example rescaling:

Raw E5: 70% → Rescaled: 0% (minimum threshold)
Raw E5: 85% → Rescaled: 50% (moderate similarity)
Raw E5: 100% → Rescaled: 100% (exact match)

When using the API:

# The similarity scores returned are already rescaled
results = await client.semantic_search(
    query="your search query",
    similarity_threshold=0.5  # This is post-rescaling (85% raw E5)
)

Request/Response Models

All models are fully typed with Pydantic:

from czero_engine.models import (
    ChatRequest, ChatResponse,
    SemanticSearchRequest, SearchResult,
    WorkspaceCreate, ProcessingResult,
    PersonaChat, PersonaResponse
)

📖 Examples

Building a Q&A System

async def build_qa_system(docs_dir: str):
    # 1. Create knowledge base
    async with KnowledgeBaseWorkflow() as kb:
        await kb.create_knowledge_base("QA KB", docs_dir)
        workspace_id = kb.workspace_id
    
    # 2. Interactive Q&A
    async with RAGWorkflow() as rag:
        while True:
            q = input("Question: ")
            if q == 'quit': break
            
            answer = await rag.ask(q, workspace_filter=workspace_id)
            print(f"Answer: {answer.response}\n")

Document Similarity Analysis

async def analyze_similarity(doc1: str, doc2: str):
    async with CZeroEngineClient() as client:
        # Generate embeddings
        emb1 = await client.generate_embedding(doc1)
        emb2 = await client.generate_embedding(doc2)
        
        # Calculate cosine similarity
        import numpy as np
        v1, v2 = np.array(emb1.embedding), np.array(emb2.embedding)
        similarity = np.dot(v1, v2) / (np.linalg.norm(v1) * np.linalg.norm(v2))
        
        print(f"Similarity: {similarity:.3f}")

Batch Processing with Progress

async with DocumentProcessingWorkflow(verbose=True) as processor:
    files = processor.discover_files("./docs", patterns=["*.pdf"])
    
    stats = await processor.process_documents(
        files=files,
        workspace_name="Batch Process",
        batch_size=10,  # Process 10 files at a time
        chunk_size=800
    )
    
    print(f"Files submitted: {stats.total_files}")
    print(f"Chunks created: {stats.total_chunks}")  # Hierarchical chunks
    print(f"Est. Success rate: {stats.success_rate:.1f}%")
    print(f"Throughput: {stats.total_chunks/stats.processing_time:.1f} chunks/s")

🌐 Cloud AI Integration

While CZero Engine is fully self-contained with local LLMs and RAG, you can seamlessly integrate cloud AI providers through LangChain when needed:

from langgraph.graph import StateGraph, MessagesState
from langchain_core.messages import AIMessage
from langchain_openai import ChatOpenAI
from langchain_anthropic import ChatAnthropic
from czero_engine import CZeroEngineClient

# Build a hybrid workflow: Local RAG + Cloud LLMs
workflow = StateGraph(MessagesState)

# Use CZero for local RAG and embeddings
async def search_local(state):
    async with CZeroEngineClient() as client:
        # CZero handles document search locally
        results = await client.semantic_search(state["query"])
        return {"context": results}

# Use cloud LLM for specific tasks if needed
async def generate_cloud(state):
    cloud_llm = ChatOpenAI(model="gpt-4")  # or Anthropic, Google, etc.
    response = await cloud_llm.ainvoke(state["messages"])
    return {"messages": [response]}

# Or use CZero Engine for everything
async def generate_local(state):
    async with CZeroEngineClient() as client:
        response = await client.chat(
            message=state["messages"][-1].content,
            use_rag=True
        )
        return {"messages": [AIMessage(content=response.response)]}

# Mix and match as needed - the choice is yours!
workflow.add_node("search", search_local)
workflow.add_node("generate", generate_local)  # or generate_cloud

The beauty of LangChain compatibility means you can start fully local and add cloud services only when needed!

🧪 Testing

# Run all tests
uv run pytest

# Run specific test
uv run pytest tests/test_integration.py::test_hierarchical_search

# Run with coverage
uv run pytest --cov=czero_engine

🔐 Configuration

Environment variables (.env file):

CZERO_API_URL=http://localhost:1421
CZERO_API_TIMEOUT=60.0
CZERO_VERBOSE=false

📊 Performance Tips

Batch Processing: Process multiple files in parallel
Chunk Size: 500-1000 tokens for general documents
Hierarchy: Use hierarchical search for structured documents
Models: Ensure LLM and embedding models are pre-loaded

🤝 Contributing

We welcome contributions! Please see CONTRIBUTING.md for guidelines.

🚢 Version Control & Branch Management

For Contributors:

Branch Strategy:

main - Stable releases only
develop - Active development branch
feature/* - New features
fix/* - Bug fixes
docs/* - Documentation updates

Workflow:

Fork the repository
Create feature branch from develop
Make changes and test
Submit PR to develop branch
After review, we'll merge to develop
Periodically, develop → main for releases

Versioning:

We follow Semantic Versioning
Format: MAJOR.MINOR.PATCH
Example: 1.2.3

Release Process:

# Tag a release
git tag -a v1.0.0 -m "Release version 1.0.0"
git push origin v1.0.0

# Create release on GitHub with changelog

📜 License

MIT License - see LICENSE file

🤝 Support

📧 Email: [email protected]
💬 Discord: Join our community
🐛 Issues: GitHub Issues
📚 Docs: Documentation

🌟 If you find this useful, please give us a star!

Made with ❤️ by the CZero Team

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github/workflows		.github/workflows
examples		examples
sample_docs		sample_docs
src/czero_engine		src/czero_engine
tests		tests
.env.example		.env.example
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

License

czero-cc/workflow-template

Folders and files

Latest commit

History

Repository files navigation