proposal: add enhance mid-tier resource proposal #1762

j4ckstraw · 2023-11-28T06:25:31Z

Ⅰ. Describe what this PR does

Ⅱ. Does this pull request fix one issue?

Ⅲ. Describe how to verify it

Ⅳ. Special notes for reviews

V. Checklist

I have written necessary docs and comments
I have added necessary unit tests and integration tests
All checks passed in make test

koordinator-bot · 2023-11-28T06:25:41Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
To complete the pull request process, please assign hormes after the PR has been reviewed.
You can assign the PR to them by writing /assign @hormes in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

codecov · 2023-11-28T06:29:15Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (3784df1) 66.11% compared to head (cb3cf33) 66.11%.
Report is 11 commits behind head on main.

❗ Current head cb3cf33 differs from pull request most recent head e6b42c9. Consider uploading reports for the commit e6b42c9 to get more accurate results

Additional details and impacted files

@@           Coverage Diff            @@
##             main    #1762    +/-   ##
========================================
  Coverage   66.11%   66.11%            
========================================
  Files         388      390     +2     
  Lines       42425    42589   +164     
========================================
+ Hits        28048    28159   +111     
- Misses      12305    12346    +41     
- Partials     2072     2084    +12

Flag	Coverage Δ
unittests	`66.11% <ø> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

docs/proposals/20231123-enhance-mid-tier-resource.md

saintube · 2023-11-29T02:06:32Z

docs/proposals/20231123-enhance-mid-tier-resource.md

+
+#### Story 1
+
+There are low-priority online-service tasks, which performance requirements is same as Prod+LS while it do not want to be suppressed but can tolerate being evicted, when the machine usage spike. 


If the Mid pods are allowed to allocate the Prod unallocated resources, the Prod apps can be affected since ProdPeak + MidAllocated > NodeAllocatable is possible. So to avoid the affection, there should be a design about when and how we let the Prod pods preempt/evict Mid pods when they want to win back the resources that Prod unallocated but Mid allocated.

Yes, Because prod pods use cpu/memory resource, while mid pods use mid-cpu/mid-memory, so prod can't preempt mid directly.
How about evict by mid-allocated / mid-allocatable.

Signed-off-by: j4ckstraw <[email protected]>

eahydra

I have one more question: How does the scheduler's LoadAware Scheduling plugin support middle tiers?

docs/proposals/20231123-enhance-mid-tier-resource.md

eahydra · 2023-11-30T12:58:45Z

docs/proposals/20231123-enhance-mid-tier-resource.md

+
+### Prerequisites
+
+Must use koordinator node reservation if someone wants to use Mid+LS


Can you provide more details about the prerequisites? Why must use koordinator node reservation? Is it the Reservation described in this document(20221227-node-resource-reservation.md)or the Reservation defined by the Koordinator SLO?

eahydra · 2023-11-30T13:01:40Z

docs/proposals/20231123-enhance-mid-tier-resource.md

+**native resource or extended resource**
+
+*native resource*:
+hijack node update, change `node.Status.allocatable`, mid pod also use native resource, in this situation, Mid is equivalent to a sub-priority in prod, resource quota need to make adaptive modification.


I don’t quite understand the logic described in this paragraph. Why do we need to hijack node update?

hijack node update to add up prod-reclaimable to original node.Status.allocatable

docs/proposals/20231123-enhance-mid-tier-resource.md

eahydra · 2023-11-30T13:04:54Z

docs/proposals/20231123-enhance-mid-tier-resource.md

+for Mid+BE pods, it can be located burstable, even guaranteed to disobey the QoS level policy.
+
+*extended resource*:
+add mid-cpu/mid-memory, insert "extended resource" field by webhook.


and here, do you want to add new webhook plugin?

docs/proposals/20231123-enhance-mid-tier-resource.md

Signed-off-by: j4ckstraw <[email protected]>

j4ckstraw · 2023-12-01T02:55:35Z

I have one more question: How does the scheduler's LoadAware Scheduling plugin support middle tiers?

I need to think about it.

saintube · 2023-12-04T02:06:41Z

docs/proposals/20231123-enhance-mid-tier-resource.md

+
+Let us look at the scenario without overselling.
+
+if prod and mid pods share resource account, a preempt is required for an upcoming prod pod. koord-scheduler needs filter and preept plugins to handle this.


please clarify the specific rules of filter and preemption

saintube · 2023-12-04T02:07:46Z

docs/proposals/20231123-enhance-mid-tier-resource.md

+
+**share resource account or not**
+
+Let us look at the scenario without overselling.


what does the overselling mean here?

saintube · 2023-12-04T02:16:23Z

docs/proposals/20231123-enhance-mid-tier-resource.md

+**cpuShares**
+
+Configured according requests.mid-cpu
+- for Mid+LS, same as Prod+LS


I think both the pod-level and container-level of the Mid pods follow the same rule as the Batch extended resources if the pods allocate Mid extended resources. So this statement is confusing. I am not sure if you are talking about the QoS-level cgroups. We'd better either make a concise and clear expression or add a comprehensive diagram to clarify the design.

docs/proposals/20231123-enhance-mid-tier-resource.md

saintube · 2023-12-04T02:20:57Z

docs/proposals/20231123-enhance-mid-tier-resource.md

+**CPU Evicton**
+
+CPU eviction is currently linked to pod satisfaction.
+in the long term, however, it should be done from the perspective of the operating system, like memory eviction.


could you please provide more information on why and how the cpu eviction is done from the perspective of the OS?

saintube · 2023-12-04T03:45:00Z

docs/proposals/20231123-enhance-mid-tier-resource.md

+
+Eviction is sorted by priority and resource model
+- Batch first and then Mid.
+- Mid+LS first and then Mid+BE, for Mid pods, request and usage should be taken into account when evicting for fairness reasons.


Mid+LS first and then Mid+BE

It may not work well if you deploy online services as Mid+LS pods and deploy stream-computing jobs as Mid+BE. Think about the online service pods being evicted earlier than the job pods. Though the Mid+BE pods can be suppressed to reduce interference with Prod resources, it cannot be a reason for Mid+LS to be a lower priority in eviction. Please avoid unnecessary coupling of the priority and QoS if there is no proper design.

Get, Thank you for your advice

Signed-off-by: j4ckstraw <[email protected]>

ZiMengSheng · 2024-01-02T11:41:41Z

/milestone 1.5

koordinator-bot · 2024-01-02T11:41:44Z

@ZiMengSheng: The provided milestone is not valid for this repository. Milestones in this repository: [someday, v1.4, v1.5]

Use /milestone clear to clear the milestone.

In response to this:

/milestone 1.5

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

ZiMengSheng · 2024-01-02T11:49:16Z

/milestone v1.5

hormes · 2024-02-22T07:10:35Z

docs/proposals/20231123-enhance-mid-tier-resource.md

+at the moment we need to change this to
+
+```
+Allocatable[Mid] := min(Reclaimable[Mid], NodeAllocatable * thresholdRatio) + Unallocated[Mid]


According to my understanding, adding unallocated here is to allow Mid + LS to also use unallocated resources in the cluster, but there is a problem here.Using unallocated resources will affect the view of Prod resources, and ultimately need to support Prod's preemption of Mid, which in turn affects the stability of Mid resources.

We are also considering applying node prediction to the amplification factor of Node, so that Prod can be directly oversold, so that the Quota management and priority preemption are consistent with the native semantics of k8s.
Does this satisfy your Mid + LS need?

Consider such a scenario,
The user deployed a Mid + LS pod, while the Mid resource of the cluster was insufficient, so the cluster autoscaler scaled out one new node.
The problem is that there are no Prod pods in the new node, so no Mid resources are available too. so we want to allowing Mid + LS to use unallocated resources in the cluster.

koordinator-bot bot requested review from FillZpp and hormes November 28, 2023 06:25

koordinator-bot bot added the size/L label Nov 28, 2023

zwzhang0107 reviewed Nov 28, 2023

View reviewed changes

docs/proposals/20231123-enhance-mid-tier-resource.md Outdated Show resolved Hide resolved

docs/proposals/20231123-enhance-mid-tier-resource.md Outdated Show resolved Hide resolved

docs/proposals/20231123-enhance-mid-tier-resource.md Show resolved Hide resolved

j4ckstraw mentioned this pull request Nov 28, 2023

feat: add unallocated resource into mid resource #1750

Closed

3 tasks

saintube reviewed Nov 29, 2023

View reviewed changes

proposal: add enhance mid-tier resource proposal

9534906

Signed-off-by: j4ckstraw <[email protected]>

j4ckstraw force-pushed the mid-proposal branch from 0da22aa to 5f4f670 Compare November 29, 2023 02:17

fix typo

d41eed8

Signed-off-by: j4ckstraw <[email protected]>

j4ckstraw force-pushed the mid-proposal branch from 5f4f670 to d41eed8 Compare November 29, 2023 02:48

eahydra reviewed Nov 30, 2023

View reviewed changes

fix typo and syntax error

cb3cf33

Signed-off-by: j4ckstraw <[email protected]>

saintube reviewed Dec 4, 2023

View reviewed changes

update proposal

e6b42c9

Signed-off-by: j4ckstraw <[email protected]>

koordinator-bot bot added this to the v1.5 milestone Jan 2, 2024

hormes reviewed Feb 22, 2024

View reviewed changes

zwzhang0107 modified the milestones: v1.5, v1.6 May 21, 2024

tan90github mentioned this pull request Aug 7, 2024

[proposal] How to Better Integrate Mid-Tier Runtime Hooks with Batch Resources? #2161

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

proposal: add enhance mid-tier resource proposal #1762

proposal: add enhance mid-tier resource proposal #1762

j4ckstraw commented Nov 28, 2023

koordinator-bot bot commented Nov 28, 2023

codecov bot commented Nov 28, 2023 •

edited

Loading

saintube Nov 29, 2023

j4ckstraw Dec 1, 2023 •

edited

Loading

eahydra left a comment

eahydra Nov 30, 2023

eahydra Nov 30, 2023

j4ckstraw Dec 1, 2023

eahydra Nov 30, 2023

j4ckstraw Dec 1, 2023

j4ckstraw commented Dec 1, 2023

saintube Dec 4, 2023

saintube Dec 4, 2023

saintube Dec 4, 2023

saintube Dec 4, 2023

saintube Dec 4, 2023

j4ckstraw Dec 5, 2023

ZiMengSheng commented Jan 2, 2024

koordinator-bot bot commented Jan 2, 2024

ZiMengSheng commented Jan 2, 2024

hormes Feb 22, 2024

j4ckstraw Feb 22, 2024


		#### Story 1

		There are low-priority online-service tasks, which performance requirements is same as Prod+LS while it do not want to be suppressed but can tolerate being evicted, when the machine usage spike.


		### Prerequisites

		Must use koordinator node reservation if someone wants to use Mid+LS


		Let us look at the scenario without overselling.

		if prod and mid pods share resource account, a preempt is required for an upcoming prod pod. koord-scheduler needs filter and preept plugins to handle this.


		share resource account or not

		Let us look at the scenario without overselling.

proposal: add enhance mid-tier resource proposal #1762

Are you sure you want to change the base?

proposal: add enhance mid-tier resource proposal #1762

Conversation

j4ckstraw commented Nov 28, 2023

Ⅰ. Describe what this PR does

Ⅱ. Does this pull request fix one issue?

Ⅲ. Describe how to verify it

Ⅳ. Special notes for reviews

V. Checklist

koordinator-bot bot commented Nov 28, 2023

codecov bot commented Nov 28, 2023 • edited Loading

Codecov Report

Choose a reason for hiding this comment

j4ckstraw Dec 1, 2023 • edited Loading

Choose a reason for hiding this comment

eahydra left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

j4ckstraw commented Dec 1, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ZiMengSheng commented Jan 2, 2024

koordinator-bot bot commented Jan 2, 2024

ZiMengSheng commented Jan 2, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Nov 28, 2023 •

edited

Loading

j4ckstraw Dec 1, 2023 •

edited

Loading