Description
Hello,
First off really appreciate the fast turn around on #300. cheers🍻 .
After using this package I have a few questions, mostly cause it completely tossed out the window a lot of the assumptions I had about Neo4j and was hoping the team here could provide some clarity about best practices.
-
I thought you weren't supposed to store documents or really any type of data in Neo4j for performance reasons (and use something like Postgres for that) however
KGPipeline
does store the full contents of each document in theChunk
nodes. I also had no clue you could use Neo4j as a Vector DB and the embeddings are also stored. Does this scale? How does this affect performance with large databases? -
KGPipeline
stores the embeddings of eachChunk
in theembeddings
property however from the examples it seems like you have to create a separate vector store to do vector searches? Is there a way of just using the embeddings from theChunk.embeddings
property for searching without creating a complete new vector store ?