SendoRay / simple_LLM Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Some simple or fake implementations of LLM infra function

Apache-2.0 license

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
llm_parallel_simulator.py		llm_parallel_simulator.py

Repository files navigation

simple_LLM

Some simple or fake implementations of LLM infra function

todo:

training:

parallel_implementation

inference (vllm features): clean_vllm

chunked
paged attention

cuda algorithm

flash attention

About

Some simple or fake implementations of LLM infra function

Apache-2.0 license

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%