Skip to content

Commit 868afc3

Browse files
Eliasj42Elias Josephsaienduri
authored
Moved sharktank runner to ossci cluster (#990)
Progress on #793 Moved sharktank runner to ossci cluster --------- Signed-off-by: Elias Joseph <[email protected]> Co-authored-by: Elias Joseph <[email protected]> Co-authored-by: saienduri <[email protected]>
1 parent da96b47 commit 868afc3

File tree

2 files changed

+6
-5
lines changed

2 files changed

+6
-5
lines changed

.github/workflows/ci-sharktank.yml

+4-3
Original file line numberDiff line numberDiff line change
@@ -93,15 +93,16 @@ jobs:
9393
strategy:
9494
matrix:
9595
python-version: [3.11]
96-
runs-on: [llama-mi300x-3]
96+
runs-on: [linux-mi300-1gpu-ossci]
9797
fail-fast: false
9898
runs-on: ${{matrix.runs-on}}
9999
defaults:
100100
run:
101101
shell: bash
102102
env:
103103
VENV_DIR: ${{ github.workspace }}/.venv
104-
HF_HOME: "/data/huggingface"
104+
HF_HOME: "/shark-cache/data/huggingface"
105+
HF_TOKEN: ${{ secrets.HF_FLUX_TOKEN }}
105106
steps:
106107
- uses: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683 # v4.2.2
107108

@@ -193,7 +194,7 @@ jobs:
193194
run: |
194195
pytest -v sharktank/ -m punet_quick \
195196
--durations=0 \
196-
--timeout=600
197+
--timeout=900
197198
198199
# Depends on other jobs to provide an aggregate job status.
199200
# TODO(#584): move test_with_data and test_integration to a pkgci integration test workflow?

sharktank/tests/models/llama/quark_parity_test.py

+2-2
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@
1919
class QuarkParityTest(TempDirTestBase):
2020
def setUp(self):
2121
super().setUp()
22-
self.path_prefix = Path("/shark-dev/quark_test")
22+
self.path_prefix = Path("/shark-cache/quark_test")
2323

2424
@with_quark_data
2525
def test_compare_against_quark(self):
@@ -55,7 +55,7 @@ def test_compare_against_quark(self):
5555
"sharktank.examples.paged_llm_v1",
5656
"The capitol of Texas is",
5757
f"--irpa-file={self.path_prefix}/fp8_bf16_weight.irpa",
58-
f"--tokenizer-config-json=/data/llama3.1/8b/tokenizer.json",
58+
f"--tokenizer-config-json=/shark-dev/data/llama3.1/8b/tokenizer.json",
5959
"--fake-quant",
6060
"--attention-kernel=torch",
6161
"--activation-dtype=bfloat16",

0 commit comments

Comments
 (0)