Skip to content

Conversation

nshchego
Copy link
Contributor

Details:

  • Weightless cache support in the CPU plugin
  • ...

Tickets:

  • 161826

@github-actions github-actions bot added category: inference OpenVINO Runtime library - Inference category: Core OpenVINO Core (aka ngraph) category: IE Tests OpenVINO Test: plugins and common category: CPU OpenVINO CPU plugin category: build OpenVINO cmake script / infra category: transformations OpenVINO Runtime library - Transformations category: samples OpenVINO Runtime Samples labels Sep 18, 2025
@nshchego nshchego marked this pull request as ready for review September 22, 2025 05:25
@nshchego nshchego requested review from a team as code owners September 22, 2025 05:25
@nshchego nshchego requested review from mlukasze and mryzhov and removed request for a team September 22, 2025 05:25
@nshchego nshchego force-pushed the cpu/compile_model/weightless_cache branch 2 times, most recently from b4f3134 to 12b389c Compare September 23, 2025 05:42
Copy link
Contributor

@t-jankowski t-jankowski left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok for core parts

@nshchego nshchego force-pushed the cpu/compile_model/weightless_cache branch 6 times, most recently from 1a9a4c6 to c118718 Compare September 24, 2025 09:17
@nshchego nshchego force-pushed the cpu/compile_model/weightless_cache branch from c118718 to 7dc1fd8 Compare September 24, 2025 09:24
@maxnick
Copy link
Contributor

maxnick commented Sep 24, 2025

@EgorDuplensky , could you please review this PR?

@maxnick maxnick requested a review from Copilot September 24, 2025 10:08
Copy link
Contributor

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This pull request implements weightless cache support for the CPU plugin in OpenVINO. The implementation allows for optimized cache storage when using CacheMode::OPTIMIZE_SIZE by storing only model structure without weight data, reducing cache file sizes.

  • Adds weightless serialization/deserialization capabilities to the CPU plugin
  • Implements cache mode configuration and weight path handling
  • Extends test coverage for the new weightless caching functionality

Reviewed Changes

Copilot reviewed 28 out of 28 changed files in this pull request and generated 5 comments.

Show a summary per file
File Description
src/tests/functional/plugin/shared/src/behavior/compiled_model/model_cache.cpp New test implementation for weightless cache accuracy validation
src/tests/functional/plugin/shared/include/behavior/compiled_model/model_cache.hpp Header for weightless cache test framework
src/plugins/intel_cpu/src/utils/graph_serializer/serializer.cpp Core serialization logic with weightless mode support
src/plugins/intel_cpu/src/utils/graph_serializer/deserializer.cpp Core deserialization logic with origin weights handling
src/plugins/intel_cpu/src/plugin.cpp Plugin integration with weightless cache mode detection and model building
src/plugins/intel_cpu/src/config.cpp Configuration handling for cache mode property
src/plugins/intel_cpu/src/compiled_model.cpp Compiled model export with weightless serialization
src/inference/src/dev/core_impl.cpp Core implementation updates for weightless cache support
src/core/src/op/util/weightless_caching_attributes.cpp Weightless cache attribute implementation

Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.

@nshchego nshchego force-pushed the cpu/compile_model/weightless_cache branch 2 times, most recently from eb9f048 to 3cbf9ee Compare September 25, 2025 08:20
@nshchego nshchego force-pushed the cpu/compile_model/weightless_cache branch from 3cbf9ee to cd96250 Compare September 25, 2025 11:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: build OpenVINO cmake script / infra category: Core OpenVINO Core (aka ngraph) category: CPU OpenVINO CPU plugin category: IE Tests OpenVINO Test: plugins and common category: inference OpenVINO Runtime library - Inference category: samples OpenVINO Runtime Samples category: transformations OpenVINO Runtime library - Transformations
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants