Skip to content

SendoRay/simple_LLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

simple_LLM

Some simple or fake implementations of LLM infra function

todo:

training:

parallel_implementation

inference (vllm features): clean_vllm

  • chunked
  • paged attention

cuda algorithm

  • flash attention

About

Some simple or fake implementations of LLM infra function

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages