Closed
Description
What is your proposal:
The current Yarn with Koordinator solution synchronizes all batch resources to the YARN RM. We hope to improve this solution so that BE pods and YARN tasks can share batch resources. Therefore, it is necessary to enhance the mechanism for synchronizing and managing batch resources between Koord and YARN. It is necessary to introduce a new configuration thirdPartyResourceConfig to calculate the amount of batch resources that can be used by YARN and implementing real-time control over YARN tasks' cgroup based on this configuration.
Example:
slo-colocation-config:|
{
...
"thirdPartyResourceConfig": [
{
"thirdPartyName": "hadoop-yarn",
"batchResourceRatio": {
"batchCpu": 80,
"batchMemory": 80
},
"cgroupPath": "/hadoop-yarn"
}
}
...
}
Why is this needed:
Described above
Is there a suggested solution, if so, please add it:
Here is an initial draft for the detailed design