Skip to content

Questions after using this package #305

Closed
@byt3bl33d3r

Description

@byt3bl33d3r

Hello,

First off really appreciate the fast turn around on #300. cheers🍻 .

After using this package I have a few questions, mostly cause it completely tossed out the window a lot of the assumptions I had about Neo4j and was hoping the team here could provide some clarity about best practices.

  1. I thought you weren't supposed to store documents or really any type of data in Neo4j for performance reasons (and use something like Postgres for that) however KGPipeline does store the full contents of each document in the Chunk nodes. I also had no clue you could use Neo4j as a Vector DB and the embeddings are also stored. Does this scale? How does this affect performance with large databases?

  2. KGPipeline stores the embeddings of each Chunk in the embeddings property however from the examples it seems like you have to create a separate vector store to do vector searches? Is there a way of just using the embeddings from the Chunk.embeddings property for searching without creating a complete new vector store ?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions