Open
Conversation
qhzhuang
reviewed
Dec 18, 2025
| @@ -0,0 +1,279 @@ | |||
| # MindSpore Transformers & vLLM-MindSpore 插件式服务化部署与评测 | |||
|
|
|||
| # MindSpore Transformers & vLLM-MindSpore 插件式服务化部署与评测 | ||
|
|
||
| ## 目录 | ||
| - [安装部署](#安装部署) |
Collaborator
There was a problem hiding this comment.
这里直接给个链接参考官网环境安装, 不在这里再维护一份了
| ais_bench/benchmark/configs/models/vllm_api/vllm_api_general_chat.py | ||
|
|
||
| # 启动评测 | ||
| python run_benchmark.py --models vllm_api_general --datasets gsm8k_gen_0_shot_cot_str |
Collaborator
There was a problem hiding this comment.
这里的run_benchmark是在哪可以贴一下地址
| ### 精度评测 | ||
| ```bash | ||
| # 修改配置文件 | ||
| ais_bench/benchmark/configs/models/vllm_api/vllm_api_general_chat.py |
2b3cfa5 to
7285069
Compare
qhzhuang
reviewed
Jan 6, 2026
| | 4 | xxx | xxx | [PPT](跳转链接) · [代码](跳转链接) · [视频](跳转链接) · [云沙箱实验](跳转链接) · [学习路径](跳转链接) | [中级认证入口](xxxx) | | ||
| | 1 | MindSpore Transformers基础 | 介绍MindSpore Transformers架构及基本使用。 | [PPT](#) · [代码](#) · [视频](#) · [云沙箱实验](#) · [学习路径](#) | | | ||
| | 2 | vLLM-MindSpore服务化部署 | 学习vLLM-MindSpore的安装、启动及参数配置。 | [PPT](#) · [代码](#) · [视频](https://www.bilibili.com/video/BV1Ys1aBxEFD/?share_source=copy_web&vd_source=fd4588b77d7b0209a532d9279088f606) · [云沙箱实验](#) · [学习路径](#) | [初级认证入口](#) | | ||
| | 3 | 大模型推理高级特性 | 深入理解Chunked Prefill、Prefix Caching等优化技术。 | [PPT](#) · [代码](#) · [视频](#) · [云沙箱实验](#) · [学习路径](#) | | |
Collaborator
There was a problem hiding this comment.
ppt 代码 云沙箱啥的,没有的是不是先不写了,就保留有的就好了。视频如果是共用的话可以改成一列
Comment on lines
297
to
301
| | 3 | 大模型推理高级特性 | 深入理解Chunked Prefill、Prefix Caching等优化技术。 | [PPT](#) · [代码](#) · [视频](#) · [云沙箱实验](#) · [学习路径](#) | | | ||
| | 4 | 混合并行与量化推理 | 掌握混合并行部署及模型量化推理的最佳实践。 | [PPT](#) · [代码](#) · [视频](#) · [云沙箱实验](#) · [学习路径](#) | [中级认证入口](#) | | ||
|
|
||
| # MindSpore Transformers & vLLM-MindSpore 插件式服务化部署与评测 |
Collaborator
There was a problem hiding this comment.
这里标题比较长,直接改成“服务化部署与评测指导”是不是好点
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.