feat: Add vector search to Python Backend #13

shuangela · 2025-10-30T21:07:16Z

Adds vector search capabilities to Python backend

Completion of Python Backend

This pull request completes the backend for python.

What to Review

The new vector search endpoint to verify that it is working.

Method	Endpoint
GET	/api/movies/vector-search

Testing Recommendations

Create a virtual environment. python -m venv venv
Activate the virtual environment source venv/bin/activate
Install pip-tools pip install pip-tools
Install dependencies pip-sync requirements.txt
Setup your .env. You will need a Voyage API key to run this, you can ask Angela or Taylor for the API key necessary.
Run the server using fastapi run main.py --reload
localhost/docs will give you an UI to test every endpoint.

Leftover question

Should we refactor the comment at the top listing all the endpoints to be in order of the order that the endpoints are implemented? Search + Vector Search are currently implemented earlier than the other endpoints but are listed at the end of the comment at the top listing all endpoints.

tmcneil-mdb

Very nice PR! 🎃

I am approving but this actually is a blocking change.

We need to update the requirements.txt to include the new module voyageai.
We have pip-tools installed, so its a quick fix.

Double check your version of voageai via pip show voyageai
Manually add voyageai in the requirements.in (I would put it under section 4). I would use pinning to make it more flexible.
Run pip-compile requirements.in - This automatically remakes the requirements.txt
Run pip install -r requirements.txt to ensure everything still works and test an endpoint. It works on my side!
Commit the new requirements

In reference to your note about reordering the docstrings for the endpoint. I concur. It should be revised to match the order in the file.

tmcneil-mdb · 2025-10-31T18:57:52Z

server/python/src/routers/movies.py

+                    # Handle invalid ObjectId conversion
+                    result["_id"] = str(result["_id"]) if result["_id"] else None
+
+        results = [VectorSearchResult(**doc) for doc in raw_results]


N: I would perhaps leave a comment here explaining what this line is doing. Something about converting the result in a VectorSearch object.

tmcneil-mdb · 2025-10-31T18:59:21Z

server/python/src/routers/movies.py


+async def execute_aggregation_on_collection(collection, pipeline: list) -> list:
+    """Helper function to execute aggregation pipeline on a specified collection and return results"""
+    print(f"Executing pipeline: {pipeline}")  # Debug logging


More a question/thought than anything else, should we introduce logging for the aggregation instead of the printing in the console?

i asked the same thing in the previous PR but i forgot to follow-up -- we shouldn't be printing to the console in the final version of the app. we can introduce logging now or as a fast follow

that makes sense, i can remove printing to the console from the python backend as a cleanup task in a new pr and then make a fast follow for proper logging.

cbullinger

Requesting the changes Taylor pointed out in her comments. Overall this looks good, though, and everything worked for me once I got the voyageai package installed

cbullinger · 2025-10-31T20:30:41Z

server/python/src/routers/movies.py


+async def execute_aggregation_on_collection(collection, pipeline: list) -> list:
+    """Helper function to execute aggregation pipeline on a specified collection and return results"""
+    print(f"Executing pipeline: {pipeline}")  # Debug logging


i asked the same thing in the previous PR but i forgot to follow-up -- we shouldn't be printing to the console in the final version of the app. we can introduce logging now or as a fast follow

shuangela added 3 commits October 30, 2025 16:52

add new info

caf7755

test vs

efd31b8

add new comments

4976059

shuangela requested review from cbullinger, jordan-smith721 and tmcneil-mdb October 30, 2025 21:10

add error handling

748af9f

tmcneil-mdb approved these changes Oct 31, 2025

View reviewed changes

cbullinger requested changes Oct 31, 2025

View reviewed changes

feedback changes

2d79bde

cbullinger approved these changes Nov 4, 2025

View reviewed changes

reorder order of endpoints in function

252d59b

shuangela merged commit eceb07d into development Nov 4, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add vector search to Python Backend #13

feat: Add vector search to Python Backend #13

Uh oh!

shuangela commented Oct 30, 2025 •

edited

Loading

Uh oh!

tmcneil-mdb left a comment •

edited

Loading

Uh oh!

tmcneil-mdb Oct 31, 2025

Uh oh!

tmcneil-mdb Oct 31, 2025

Uh oh!

cbullinger Oct 31, 2025

Uh oh!

shuangela Nov 4, 2025

Uh oh!

cbullinger left a comment

Uh oh!

cbullinger Oct 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat: Add vector search to Python Backend #13

feat: Add vector search to Python Backend #13

Uh oh!

Conversation

shuangela commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Completion of Python Backend

What to Review

Testing Recommendations

Leftover question

Uh oh!

tmcneil-mdb left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Very nice PR! 🎃

Uh oh!

tmcneil-mdb Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

tmcneil-mdb Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

cbullinger Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

shuangela Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

cbullinger left a comment

Choose a reason for hiding this comment

Uh oh!

cbullinger Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

shuangela commented Oct 30, 2025 •

edited

Loading

tmcneil-mdb left a comment •

edited

Loading