Skip to content

installation of openfl director keeps failing (k8s deployment) #36

@kminhta

Description

@kminhta

I am trying to deploy OpenFL on FedLCM. I set LIFECYCLEMANAGER_EXPERIMENT_ENABLED to true in the k8s_deploy.yaml for the backend.

I followed the instructions listed here: https://github.com/FederatedAI/FedLCM/blob/main/doc/OpenFL_Guide.md but the installation of the director keeps failing. I am unsure how to troubleshoot. Do you have any insights, or do you have advice for setting the director parameters?

This is the error description:

failed to install openfl director, error: job is Failed, job info: &{93231661-4d62-49a6-88d0-50fd70788bc8 2023-03-29 21:32:39.121 +0000 UTC 0001-01-01 00:00:00 +0000 UTC ClusterInstall ef50e111-9122-4a41-b22d-eac5525862b9 admin map[director:{director Running Undefined 2023-03-29 21:32:39.121 +0000 UTC 0001-01-01 00:00:00 +0000 UTC} notebook:{notebook Running Undefined 2023-03-29 21:32:39.121 +0000 UTC 0001-01-01 00:00:00 +0000 UTC}] Failed 1h0m0s 0xc0005602a0 [update job status to Running create Cluster in DB Success overwrite current installation helm install Success checkout Cluster status [3362] checkout Cluster status timeOut!] {3 2023-03-29 21:32:39.122 +0000 UTC 2023-03-29 22:32:40.015 +0000 UTC {0001-01-01 00:00:00 +0000 UTC false}}}

Thank you.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions