Skip to content

Parse fails on certain documents, throws poor error #69

@jnu

Description

@jnu

Get a NoneType is not iterable error on some documents with parse:openai in the pipeline.

Reproduce:

extract:azuredi
parse:openai
redact:openai

use file 002_md_baltimore_pd_CCN-5-170308460.pdf. Extract doesn't fail (or it would raise EmptyExtractionError, but there is None value in the input to the redact step. This must mean that parse failed. It should throw a better error, but it would also be nice to know why it fails in the first place.

Using prompt in kscms blind-charging-api deployment.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingneeds investigationNeed to look into this some more

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions