Add OLMo model #1676

janimo · 2024-10-15T17:31:50Z

Support for OLMo models

Note: Only the new ones with -hf suffix are supported, not the old ones built before OLMo was integrated into transformers, those are deprecated and have an extra package requirement.

https://huggingface.co/collections/allenai/olmo-suite-65aeaae8fe5b6b2122b46778

merrymercy · 2024-10-16T00:00:45Z

Thanks for the contribution. Can you add it here

sglang/test/srt/models/test_generation_models.py

Lines 52 to 56 in f1088e0

    
           ALL_OTHER_MODELS = [ 
        
               ModelCase("Qwen/Qwen2-1.5B"), 
        
               ModelCase("Qwen/Qwen2.5-14B-Instruct"), 
        
               ModelCase("HuggingFaceTB/SmolLM-135M-Instruct"), 
        
           ]

and run the test

sglang/docs/en/model_support.md

Lines 16 to 23 in f1088e0

    
           ### Add the model to the test suite 
        
           To make sure the new model is well maintained in the future, it is better to add it to the test suite. 
        
           You can add it to the `ALL_OTHER_MODELS` list in the [test_generation_models.py](https://github.com/sgl-project/sglang/blob/main/test/srt/models/test_generation_models.py) and run the following command to test it. 
        
           For example, if the model is Qwen/Qwen2-1.5B 
        
           ``` 
        
           ONLY_RUN=Qwen/Qwen2-1.5B python3 -m unittest test_generation_models.TestGenerationModels.test_others 
        
           ```

?

janimo · 2024-10-16T07:03:17Z

I had to bump the error tolerance for decode for the test to pass. The resulting tokens are all the same.

zhyncs · 2024-10-17T04:40:26Z

python/sglang/srt/models/olmo.py

+from torch import nn
+from transformers import OlmoConfig
+from vllm.distributed import get_tensor_model_parallel_world_size
+from vllm.model_executor.layers.linear import (


@janimo Could you submit another PR to replace this with SGLang's linear?

@zhyncs done #1696

janimo added 2 commits October 15, 2024 20:29

Add OLMo model

c9d5057

Change import to sgl

71697ab

Add OLMo 1B model to test list

8622320

merrymercy merged commit a5114b6 into sgl-project:main Oct 16, 2024
6 of 10 checks passed

janimo deleted the olmo branch October 16, 2024 07:16

zhyncs reviewed Oct 17, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add OLMo model #1676

Add OLMo model #1676

janimo commented Oct 15, 2024 •

edited

Loading

merrymercy commented Oct 16, 2024 •

edited

Loading

janimo commented Oct 16, 2024

zhyncs Oct 17, 2024

janimo Oct 17, 2024

Add OLMo model #1676

Add OLMo model #1676

Conversation

janimo commented Oct 15, 2024 • edited Loading

merrymercy commented Oct 16, 2024 • edited Loading

janimo commented Oct 16, 2024

zhyncs Oct 17, 2024

Choose a reason for hiding this comment

janimo Oct 17, 2024

Choose a reason for hiding this comment

janimo commented Oct 15, 2024 •

edited

Loading

merrymercy commented Oct 16, 2024 •

edited

Loading