Reduce code duplication for ci-llama tests #1031

erman-gurses · 2025-03-04T03:10:44Z

Currently it is a trial using action.yml

marbre

Some first comments.

.github/actions/test-setup/action.yml

.github/workflows/ci-llama-tests.yaml

marbre · 2025-03-04T22:45:24Z

.github/actions/test-setup/action.yml

+      shell: bash
+      run: python -m venv ${{ inputs.venv-dir }}
+
+    - name: Install Dependencies


You might want to cache those.

Could you please elaborate with an example?

If using UV you could look at

shark-ai/.github/actions/pkgci-setup/action.yml

Line 28 in 64f1317

- name: Setup UV caching

and

shark-ai/.github/actions/pkgci-setup/action.yml

Line 35 in 64f1317

- name: Cache UV packages

where the actions/cache action is used. The first step is normally skipped when using pip but could be considered as well. In addition to that, the setup-python action also comes with builtin cache support, see https://github.com/actions/setup-python?tab=readme-ov-file#caching-packages-dependencies. Thus there are multiple options and we use several of them in the different workflows.

renxida · 2025-03-05T15:57:45Z

The tests are currently failing but this should be good to merge if it's the same failure as on main. LGTM but I think you should get @ScottTodd 's review as well.

This might come in as a separate PR, but might be a good idea to eventually merge this action with

https://github.com/nod-ai/shark-ai/blob/main/.github/actions/pkgci-setup/action.yml

and make the different behavior between the two depend on an option
such that if we e.g. add caching / migrate to UV, changing one place suffices.

ScottTodd · 2025-03-05T16:47:16Z

.github/actions/test-setup/action.yml

This might come in as a separate PR, but might be a good idea to eventually merge this action with

https://github.com/nod-ai/shark-ai/blob/main/.github/actions/pkgci-setup/action.yml

and make the different behavior between the two depend on an option such that if we e.g. add caching / migrate to UV, changing one place suffices.

Let's do that here.

Our integration test workflows should have the same architecture:

Sanity check the runner environment and perform any necessary bookkeeping

Prepare the environment by installing shark-ai packages

Run tests/benchmarks

Report test/benchmark results

The "install" part of step 2 is critical for an integration test. Installing should be either from stable releases, nightly releases, or dev releases (e.g. pkgci.yml). Installing should not be using pip install -e with editable sources.

See also https://iree.dev/reference/bindings/python/#prebuilt-packages. The concepts are the same here.

I have three concrete suggestions for the "quick" llama tests, in order of preference:

Get the llama tests running as part of

shark-ai/.github/workflows/pkgci_shark_ai.yml

Lines 149 to 155 in 5029737

- name: Run LLM Integration Tests

run: |

source ${VENV_DIR}/bin/activate

pytest -v --test_device=${{ matrix.test_device }} \

--junitxml=integration-test-${{ matrix.name }}.xml \

app_tests/integration_tests/llm/shortfin/open_llama_3b_llm_server_test.py \

--log-cli-level=INFO

Add a new job to https://github.com/nod-ai/shark-ai/blob/main/.github/workflows/pkgci_shark_ai.yml

Add a new workflow in the style of https://github.com/nod-ai/shark-ai/blob/main/.github/workflows/pkgci_shark_ai.yml and call it from https://github.com/nod-ai/shark-ai/blob/main/.github/workflows/pkgci.yml

I would strongly prefer option (1) there. There should be a unified way to run all tests across the project. Fragmenting across different test commands and workflows is going to be an ongoing source of complexity and confusion. Option (2) keeps the workflows relatively simple while allowing for different pytest commands and other job steps. Option (3) allows for more custom workflow code per test type.

As for the "large" tests, I could see either a single "nightly_ci" workflow like "pkgci" that uses a common trigger to launch subjobs or the status quo of individual workflows that each have their own scheduled triggers. In either case, those workflows should be using a setup action that either builds and installs dev packages (à la pkgci.yml) or installs nightly packages. Only unit tests and package workflows should be building the projects from source. Integration tests should be only using already built packages.

.github/workflows/ci-llama-tests.yaml

Signed-off-by: erman-gurses <[email protected]>

renxida · 2025-03-08T04:55:30Z

Got a question. Can this be pkgci rather than CI?

So that we only build the packages once per PR?

Signed-off-by: erman-gurses <[email protected]>

ScottTodd · 2025-03-10T15:33:53Z

Got a question. Can this be pkgci rather than CI?

So that we only build the packages once per PR?

Yes. All of my suggestions in #1031 (comment) involve moving into pkgci.

erman-gurses requested review from ScottTodd and marbre March 4, 2025 03:10

ScottTodd requested a review from renxida March 4, 2025 17:33

marbre reviewed Mar 4, 2025

View reviewed changes

ScottTodd requested changes Mar 5, 2025

View reviewed changes

erman-gurses added 4 commits March 5, 2025 20:48

Reduce code duplication for ci-llama tests

ab6e512

Signed-off-by: erman-gurses <[email protected]>

Add correcting for the action name

c56dbf3

Signed-off-by: erman-gurses <[email protected]>

Address some of the comments

df13df8

Signed-off-by: erman-gurses <[email protected]>

Remove the files that causes the duplication

a7f61d6

Signed-off-by: erman-gurses <[email protected]>

erman-gurses force-pushed the users/erman-gurses/eliminate-code-duplication branch from d334ca2 to a7f61d6 Compare March 6, 2025 03:12

Add caching mechanism

b86dc17

Signed-off-by: erman-gurses <[email protected]>

Add workflow seperation for llama large and quick tests

3147cae

Signed-off-by: erman-gurses <[email protected]>

erman-gurses force-pushed the users/erman-gurses/eliminate-code-duplication branch 5 times, most recently from bb8e90b to 40a9dad Compare March 10, 2025 06:28

Add condition to the runner that runs on linux-mi300-1gpu-ossci

b2dcb3e

Signed-off-by: erman-gurses <[email protected]>

erman-gurses force-pushed the users/erman-gurses/eliminate-code-duplication branch from 40a9dad to b2dcb3e Compare March 10, 2025 06:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce code duplication for ci-llama tests #1031

Reduce code duplication for ci-llama tests #1031

erman-gurses commented Mar 4, 2025

marbre left a comment

marbre Mar 4, 2025

erman-gurses Mar 5, 2025 •

edited

Loading

marbre Mar 5, 2025

renxida commented Mar 5, 2025

ScottTodd Mar 5, 2025

renxida commented Mar 8, 2025

ScottTodd commented Mar 10, 2025

	- name: Run LLM Integration Tests
	run: \|
	source ${VENV_DIR}/bin/activate
	pytest -v --test_device=${{ matrix.test_device }} \
	--junitxml=integration-test-${{ matrix.name }}.xml \
	app_tests/integration_tests/llm/shortfin/open_llama_3b_llm_server_test.py \
	--log-cli-level=INFO

Reduce code duplication for ci-llama tests #1031

Are you sure you want to change the base?

Reduce code duplication for ci-llama tests #1031

Conversation

erman-gurses commented Mar 4, 2025

marbre left a comment

Choose a reason for hiding this comment

marbre Mar 4, 2025

Choose a reason for hiding this comment

erman-gurses Mar 5, 2025 • edited Loading

Choose a reason for hiding this comment

marbre Mar 5, 2025

Choose a reason for hiding this comment

renxida commented Mar 5, 2025

ScottTodd Mar 5, 2025

Choose a reason for hiding this comment

renxida commented Mar 8, 2025

ScottTodd commented Mar 10, 2025

erman-gurses Mar 5, 2025 •

edited

Loading