chore: add normalize_embeddings to job input

monotykamary · monotykamary · commit 15fc6ea8771e · 2023-09-19T01:11:48.000+07:00
diff --git a/README.md b/README.md
@@ -137,6 +137,6 @@ RUNPOD_AI_API_KEY='**************' RUNPOD_ENDPOINT_ID='*******' python predict.p
 ```
 To run with streaming enabled, use the `--stream` option. To set generation parameters, use the `--params_json` option to pass a JSON string of parameters:
 ```bash
-RUNPOD_AI_API_KEY='**************' RUNPOD_ENDPOINT_ID='*******' python predict.py --params_json '{"sentences": ["Explain The Great Gatsby in 4000 words.", "What is The Great Gatsby about?"]}'
+RUNPOD_AI_API_KEY='**************' RUNPOD_ENDPOINT_ID='*******' python predict.py --params_json '{"sentences": ["Explain The Great Gatsby in 4000 words.", "What is The Great Gatsby about?"], normalize_embeddings: true}'
 ```
 You can generate the API key [here](https://www.runpod.io/console/serverless/user/settings) under API Keys.
diff --git a/handler.py b/handler.py
@@ -23,9 +23,10 @@ def load_model():
 def handler(job):
     job_input = job['input']
     sentences = job_input.pop("sentences")
+    normalize_embeddings = job_input.pop("normalize_embeddings", False)
     model = load_model()
 
-    embeddings = model.encode(sentences)
+    embeddings = model.encode(sentences, normalize_embeddings=normalize_embeddings)
     encoded_embeddings = json.dumps(embeddings, cls=NumpyArrayEncoder)
     decoded_embeddings = json.loads(encoded_embeddings)
     yield decoded_embeddings