-
Notifications
You must be signed in to change notification settings - Fork 181
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to add a duplicate FQDN in provision when Overseer and FL servers specify the same FQDN. #777
Comments
I am running into the same issue, as I want to setup Overseer and Server on the same FQDN. Is there any reason, why this is not enabled (or considered bad practice)? |
Also ran into this. For now one workaround is to set the participant name/fqdn to something fake like "server.hostname" and "overseer.hostname" then add the real IP mapping to the /etc/hosts file on each participant. |
@IsaacYangSLA and @chesterxgchen can you help answer this question? thanks! |
@parkeraddison This is considered as Bad practice. The role of the oversee is switch the FL server to the health one. In case of there are two FL servers, one dead, if the Overseer is on the same host of FL Server. The Overseer could crash at the same time if the host hardware is no longer working. |
I agree it's a bad practice to put the Server and Overseer on the same physical host -- but same FQDN doesn't necessary imply same host. E.g. if a load balancer is used in a cloud or k8s deployment, then the same FQDN can forward to different hosts depending on the port. FWIW if we provision using the Dashboard then the same FQD can be used (just might help to rename for the files when you download them to avoid confusion). Can't do this in a project.yaml because the service's name is its FQDN... could be worth separating the two fields like was proposed in the original post. |
Thanks for the discussion, support the cases with load balancer and k8s ingress service are something we are considering, in those cases, there probably no need for overseer. I think we have to change the FL server to be stateless first. So it’s might be more than just add addition field In the project yaml file. Let’s continue to discussions and suggestions. We might not be able to address this issue in 2.3.0, but we hope to address this in the release after 2.3.0. We welcome your discussion, issues and PRs
…Sent from my iPhone
On Jan 30, 2023, at 10:16 AM, Parker Grey Addison ***@***.***> wrote:
In case of there are two FL servers, one dead, if the Overseer is on the same host of FL Server. The Overseer could crash at the same time if the host hardware is no longer working.
I agree it's a bad practice to put the Server and Overseer on the same physical host -- but same FQDN doesn't necessary imply same host. E.g. if a load balancer is used in a cloud or k8s deployment, then the same FQDN can forward to different hosts depending on the port.
FWIW if we provision using the Dashboard then the same FQD can be used (just might help to rename for the files when you download them to avoid confusion).
Can't do this in a project.yaml because the service's name is its FQDN... could be worth separating the two fields like was proposed in the original post.
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you were mentioned.
|
Discussed in #710
Originally posted by asus-ocis July 6, 2022
Hi all,
I want to deploy nvflare 2.1 on k8s. Overseer, server1 and server2 will run on different pods,
We have a HAproxy on k8s, so we need to set the same FQDN.
my project requirements:
But I got the following error after executing provison.
ValueError: Unable to add a duplicate name myk8s.com into this project.
Check spec.py
NVFlare/nvflare/lighter/spec.py
Line 72 in a23afd7
The name field in the participants needs to be unique.
Is it possible to add FQDN field for Overseer and FL Server in project.yml in future versions? Or have any other suggestion?
The text was updated successfully, but these errors were encountered: