Skip to content

Kaito workspace with CPU-only node #589

Answered by ishaansehgal99
clarenceb asked this question in Q&A
Discussion options

You must be logged in to vote

Yes, it is possible to create a workspace with CPU-based nodes. While Kaito is optimized for GPU workloads to maximize performance, CPU-based inference can still work, albeit slower. For CPU-based setups, using the phi-2 or phi-3 presets would be your best bet performance-wise among the existing options. Below is a sample Workspace definition for a CPU-based setup:

apiVersion: kaito.sh/v1alpha1
kind: Workspace
metadata:
  name: workspace-falcon-7b
spec:
  resource:
    instanceType: <FALLBACK_GPU_SKU>
    labelSelector:
      matchLabels:
        app: falcon-7b
    preferredNodes: 
    - your_node_name # Here you define your already made CPU-Node 
  inference:
    preset:
      name: falc…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by ishaansehgal99
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants