Skip to content

Actions: sgl-project/sglang

Execute Notebooks

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
3,294 workflow runs
3,294 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[router] cache-aware load-balancing router v1 (#2114)
Execute Notebooks #906: Commit cbedd1d pushed by ByronHsu
November 23, 2024 16:34 7m 47s main
November 23, 2024 16:34 7m 47s
[TEST] flashinfer version upgrade to v0.2.0
Execute Notebooks #905: Pull request #2054 synchronize by james-p-xu
November 23, 2024 16:17 31m 35s james-p-xu:test_flashinfer_version_upgrade
November 23, 2024 16:17 31m 35s
fix: resolve bench_serving args (#2139)
Execute Notebooks #904: Commit ad47749 pushed by zhyncs
November 23, 2024 09:45 8m 21s main
November 23, 2024 09:45 8m 21s
fix: resolve bench_serving args
Execute Notebooks #903: Pull request #2139 synchronize by zhyncs
November 23, 2024 09:36 7m 45s zhyncs/fix
November 23, 2024 09:36 7m 45s
fix: resolve bench_serving args
Execute Notebooks #902: Pull request #2139 synchronize by zhyncs
November 23, 2024 09:35 14s zhyncs/fix
November 23, 2024 09:35 14s
fix: resolve bench_serving args
Execute Notebooks #901: Pull request #2139 opened by zhyncs
November 23, 2024 09:33 2m 21s zhyncs/fix
November 23, 2024 09:33 2m 21s
Fix dp print message (#2138)
Execute Notebooks #900: Commit 751c3a0 pushed by merrymercy
November 23, 2024 09:22 9m 35s main
November 23, 2024 09:22 9m 35s
Fix dp print message
Execute Notebooks #899: Pull request #2138 opened by merrymercy
November 23, 2024 09:18 5m 14s pr-fix-dp-rank
November 23, 2024 09:18 5m 14s
Add concurrency option for benchmark (#2136)
Execute Notebooks #898: Commit 60769be pushed by zhyncs
November 23, 2024 09:07 15m 47s main
November 23, 2024 09:07 15m 47s
[CI] Fix test cases (#2137)
Execute Notebooks #896: Commit a78d8f8 pushed by merrymercy
November 23, 2024 09:00 7m 22s main
November 23, 2024 09:00 7m 22s
Add concurrency option for benchmark
Execute Notebooks #895: Pull request #2136 synchronize by zhyncs
November 23, 2024 08:54 12m 56s cermeng:add-bench-concurrency
November 23, 2024 08:54 12m 56s
[CI] Fix test cases
Execute Notebooks #894: Pull request #2137 opened by merrymercy
November 23, 2024 08:52 7m 55s pr-fix
November 23, 2024 08:52 7m 55s
Fix grid size in Triton decoding kernel (#2134)
Execute Notebooks #893: Commit c5f8650 pushed by zhyncs
November 23, 2024 08:51 8m 38s main
November 23, 2024 08:51 8m 38s
Add concurrency option for benchmark
Execute Notebooks #892: Pull request #2136 opened by cermeng
November 23, 2024 08:06 7m 38s cermeng:add-bench-concurrency
November 23, 2024 08:06 7m 38s
feat(srt): support prefill and generate with input_embeds
Execute Notebooks #889: Pull request #2082 synchronize by XuehaiPan
November 23, 2024 07:03 15m 7s XuehaiPan:generation-input-embeds
November 23, 2024 07:03 15m 7s
Fix grid size in Triton decoding kernel
Execute Notebooks #888: Pull request #2134 synchronize by ispobock
November 23, 2024 06:59 7m 28s ispobock:fix-decode-attn
November 23, 2024 06:59 7m 28s
Fix grid size in Triton decoding kernel
Execute Notebooks #887: Pull request #2134 opened by ispobock
November 23, 2024 06:58 39s ispobock:fix-decode-attn
November 23, 2024 06:58 39s
Add simple CPU offloading support. (#2081)
Execute Notebooks #886: Commit d98fa1e pushed by merrymercy
November 23, 2024 06:23 8m 5s main
November 23, 2024 06:23 8m 5s
Add simple CPU offloading support.
Execute Notebooks #885: Pull request #2081 synchronize by merrymercy
November 23, 2024 05:59 12m 33s janimo:cpu-offload
November 23, 2024 05:59 12m 33s
Add simple CPU offloading support.
Execute Notebooks #884: Pull request #2081 synchronize by merrymercy
November 23, 2024 05:59 8s janimo:cpu-offload
November 23, 2024 05:59 8s
Add simple CPU offloading support.
Execute Notebooks #883: Pull request #2081 synchronize by merrymercy
November 23, 2024 05:59 5m 8s janimo:cpu-offload
November 23, 2024 05:59 5m 8s
[router] cache-aware load-balancing router v1
Execute Notebooks #882: Pull request #2114 synchronize by ByronHsu
November 23, 2024 04:38 8m 14s ByronHsu:byhsu/approx-v2-new
November 23, 2024 04:38 8m 14s
Add initial support for intel Gaudi accelerators (#2121)
Execute Notebooks #881: Commit 865233e pushed by merrymercy
November 23, 2024 04:22 7m 44s main
November 23, 2024 04:22 7m 44s
Revert "Only stream output on tp rank 0" (#2130)
Execute Notebooks #879: Commit 66d4859 pushed by merrymercy
November 22, 2024 23:46 8m 6s main
November 22, 2024 23:46 8m 6s
ProTip! You can narrow down the results and go further in time using created:<2024-11-22 or the other filters available.