Description
I'm experiencing an issue where LangExtract is extracting content from the provided examples rather than from the actual input text I want to process. This results in duplicate extractions and incorrect data being returned.
Expected Behavior
LangExtract should only extract information from the input text/document, using examples solely as guidance for the extraction format and structure.
Actual Behavior
LangExtract is generating extractions for both the examples and the input text, resulting in:
- Content from examples appearing in the final results
Questions
- Is this the intended behavior? Should examples generate their own extractions?
- How can I ensure only input text extractions are returned?
- Is there a parameter to disable example extractions?
- What do
group_index and extraction_index represent in this context?
Description
I'm experiencing an issue where LangExtract is extracting content from the provided examples rather than from the actual input text I want to process. This results in duplicate extractions and incorrect data being returned.
Expected Behavior
LangExtract should only extract information from the input text/document, using examples solely as guidance for the extraction format and structure.
Actual Behavior
LangExtract is generating extractions for both the examples and the input text, resulting in:
Questions
group_indexandextraction_indexrepresent in this context?