You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: src/plugins/intel_gpu/docs/dynamic_shape/runtime_skip.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,12 +6,12 @@ Consider a 4D permute operation with the transformation order [0, 2, 1, 3]. Duri
6
6
This example demonstrates how runtime optimization can transform potentially expensive operations to be skipped, highlighting the value of deferred optimization strategies in dynamic computation graphs.
7
7
8
8
## Basic flow of runtime operation skip
9
-
1.**Relefant flags**
9
+
1.**Relevant flags**
10
10
First, we need to set two flags for the program_node of such an operation, which we do not apply shape-based optimization during compilation but try runtime optimization with the shape.
11
11
- Static flags (Set during `mark_runtime_skippable_nodes` pass at compilation time)
12
12
-`program_node::optimized`
13
13
- This flag presents that this node is eligible for being optimized out, either at compilation time or runtime.
14
-
- This flag is set true for all optimization schemes, not limited to runtime skippability.
14
+
- This flag is set true for all optimizatio schemes, not limited to runtime skippability.
15
15
-`program_node::runtime_skippable`
16
16
- Indicates that this node can be optimized during runtime based on the shape.
17
17
- Dynamic flag (Set at runtime)
@@ -27,7 +27,7 @@ If program_node::optimized is false and program_node::runtime_skippable is true,
27
27
However, some optimization passes such as [memory_dependency_pass](https://github.com/openvinotoolkit/openvino/blob/aa6d3811e6dea93cb818ff483bf6c3ca849d4034/src/plugins/intel_gpu/src/graph/include/pass_manager.h#L313) applies different decisions for compile time optimized nodes and runtime optimized nodes.
28
28
29
29
2.**Runtime optimization decision**
30
-
- Once the shape is updated in `primitive_inst::prepare_primitive()`, `do_runtime_skip_*node_type*` for each type of operation decides whehther to skip the node at that exeuction or not.
30
+
- Once the shape is updated in `primitive_inst::prepare_primitive()`, `do_runtime_skip_*node_type*` for each type of operation decides whether to skip the node at that execution or not.
31
31
32
32
3.**Caveats**
33
33
- Once the `primitive_inst::_can_be_optimized` is set true, the runtime will only update its metadata such as shape or padding information and skip the actual execution.
0 commit comments