Created image search functionality using MobileNet Model by ryanrahman27 · Pull Request #9 · adanomad/pdf-highlight-oa

ryanrahman27 · 2024-10-01T17:43:12Z

The ImageSearch feature allows users to search for images within a PDF document using text-based queries. After the PDF is uploaded, images are extracted using PDF.js and converted into embeddings via MobileNet. The user's search query is also transformed into an embedding using the Universal Sentence Encoder. To find the most relevant images, I used cosine similarity to compare the query embedding with the image embeddings, identifying the closest matches.

One of the challenges I faced was loading the MobileNet model from TensorFlow Hub, as the server response was invalid. I overcame this by downloading the model and serving it locally within the project, which allowed me to bypass the external URL issues. While there were some challenges getting the MobileNet model fully functional during testing, the primary focus was to demonstrate the implementation process, and the logic was successfully put in place to handle real-world image search functionality.

sunapi386 · 2024-10-03T00:25:26Z

app/components/ImageSearch.tsx

+      // Step 1: Extract images from the PDF using the helper function
+      const images = await extractImagesFromPDF(pdfUrl);
+
+      // Step 2: Generate embeddings for each image
+      const imageEmbeddings = await Promise.all(images.map(async (img) => getImageEmbedding(img)));
+
+      // Step 3: Generate text query embedding
+      const queryEmbedding = await getTextEmbedding(searchQuery);
+
+      // Step 4: Perform search by comparing the query embedding with image embeddings
+      const matchingResults = searchEmbeddings(queryEmbedding, imageEmbeddings);


Nice, good clarity. Have you thought about doing this on the server side?

Thank you! I did actually plan on doing it on the server side in Node.js because I thought it would work just as well if not better, but I decided to keep it consistent with all the other components of the web app and wrote it in TypeScript on the frontend.

…rest is working

sunapi386 and others added 2 commits September 25, 2024 18:07

update with highlight functionality

d2eb3a8

Created image search functionality using MobileNet Model

ff1802c

sunapi386 force-pushed the main branch from d2eb3a8 to f257e67 Compare October 2, 2024 20:31

sunapi386 reviewed Oct 3, 2024

View reviewed changes

ryanrahman27 added 2 commits October 12, 2024 16:51

Current status of feature

d1c16a4

Feature has a bug with the actual similarity comparison, however the …

cb32029

…rest is working

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Created image search functionality using MobileNet Model#9

Created image search functionality using MobileNet Model#9
ryanrahman27 wants to merge 4 commits intoadanomad:mainfrom
ryanrahman27:ryan-rahman-imagesearch

ryanrahman27 commented Oct 1, 2024

Uh oh!

sunapi386 Oct 3, 2024

Uh oh!

ryanrahman27 Oct 3, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ryanrahman27 commented Oct 1, 2024

Uh oh!

sunapi386 Oct 3, 2024

Choose a reason for hiding this comment

Uh oh!

ryanrahman27 Oct 3, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants