how to use auth token to use llama2 in vLLM? and question about presence_penalty #717

orellavie1212 · 2023-08-09T11:55:49Z

orellavie1212
Aug 9, 2023

nowhere in the docs nor in the git https://github.com/vllm-project/vllm/tree/main I found anything related to huggingface token to download from huggingface the model of llama2 (like 13b chat hf)? I get the error in sagemaker of:
Repo model meta-llama/Llama-2-13b-chat-hf is gated. You must be authenticated to access it.
[INFO ] PyProcess - Cannot access gated repo for url https://huggingface.co/meta-llama/Llama-2-13b-hf/resolve/main/config.json.
what to do?
model_name = "meta-llama/Llama-2-13b-chat-hf"
sampling_params = SamplingParams(temperature=0.1, top_p=0.75, top_k=0.4, presence_penalty=1.17)
llm = LLM(model=model_name, tensor_parallel_size=4 )

btw, is presence_penalty is the known ״repetition_penalty״ in other models? or it is the frequency_penalty one in SamplingParams?

orellavie1212 · 2023-08-09T12:25:09Z

orellavie1212
Aug 9, 2023
Author

I found out on that #539 that frequeny_penalty is repetition_penalty, not sure if it is true

0 replies

orellavie1212 · 2023-08-09T15:27:58Z

orellavie1212
Aug 9, 2023
Author

anyone needed a solution:
there is login function in hugginface-hub. the cli login is a wrapper over it
https://huggingface.co/docs/huggingface_hub/v0.16.3/en/package_reference/login#huggingface_hub.login

0 replies

BinaryFiddler · 2024-04-29T16:40:25Z

BinaryFiddler
Apr 29, 2024

Also found some docs on integrating it with vLLM: https://docs.mistral.ai/deployment/self-deployment/vllm/

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to use auth token to use llama2 in vLLM? and question about presence_penalty #717

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 3 comments

{{title}}

{{title}}

{{title}}

Select a reply

how to use auth token to use llama2 in vLLM? and question about presence_penalty #717

orellavie1212 Aug 9, 2023

Replies: 3 comments

orellavie1212 Aug 9, 2023 Author

orellavie1212 Aug 9, 2023 Author

BinaryFiddler Apr 29, 2024

orellavie1212
Aug 9, 2023

orellavie1212
Aug 9, 2023
Author

orellavie1212
Aug 9, 2023
Author

BinaryFiddler
Apr 29, 2024