Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Identify Document Type Using AI (Back-End) #21

Open
4 tasks
samanthannoor opened this issue Feb 26, 2025 · 0 comments
Open
4 tasks

Identify Document Type Using AI (Back-End) #21

samanthannoor opened this issue Feb 26, 2025 · 0 comments
Labels

Comments

@samanthannoor
Copy link
Collaborator

samanthannoor commented Feb 26, 2025

User Story: As a system, I need to automate document classification by identifying the essential type of document uploaded (W-2, 1099, pay stub) so that I can efficiently extract relevant fields and produce measurable outcomes.

Acceptance Criteria:

  • System ingests the uploaded document and automatically evaluates its document structure, layout artifacts, and content patterns using Amazon Bedrock LLM (or another AI model).
  • System applies classification algorithms to predict document type (W-2, 1099, pay stub), leveraging pre-trained AI models and any additional programmatic rules necessary to enhance classification effectiveness.
  • The predicted document type is displayed to the user via the user interface and is also logged for downstream reporting.
  • A confidence score for classification is logged for evaluation.

This story establishes the essential foundation for a broader document intelligence capability, enabling future programmatic evaluation of extracted field accuracy, fraud detection, confidence level benchmarking, and automation effectiveness across the document lifecycle.

Technical Details
Input Handling:

  • Documents uploaded via API or front-end UI.
  • Files normalized to standard input format (PDF, TIFF, or PNG).
  • System stores document metadata artifacts, including filename, size, file type, and upload timestamp.

Classification Pipeline:

  • Document text and layout are extracted using OCR preprocessing pipeline (could leverage AWS Textract or equivalent).
  • Extracted data passed to Amazon Bedrock LLM for semantic and structural classification evaluation.
  • Classification logic uses prompt engineering with document type exemplars to maximize classification accuracy.

Confidence Scoring:

  • AI-generated classification results include native confidence scores from the model.
  • System logs both raw confidence scores and any post-processed confidence evaluation (e.g., adjusted scores based on prior classification patterns).
  • Confidence scores and classification outputs are captured in a reporting artifact for evaluation by the product team.

Output and Reporting:

  • Predicted document type and confidence score displayed to the user in the UI.
  • Full processing log (including predicted type, confidence score, and raw OCR text if needed) saved to system logs for future evaluation.
  • Classification results included in measurable effectiveness reports, aligning with broader program management objectives for evaluating AI-enabled automation.

Quality Assurance:

  • Periodic manual sampling of classified documents performed to validate accuracy and ensure that automation delivers high standards of performance.
  • If discrepancies exceed predefined thresholds, models and/or rules are evaluated and retrained to maximize outcomes and reduce waste.
@jillcfoley1 jillcfoley1 changed the title Identify Document Type Using AI (Optional – Back-End) Identify Document Type Using AI (Back-End) Mar 3, 2025
@jillcfoley1 jillcfoley1 added the dev label Mar 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: Backlog
Development

No branches or pull requests

2 participants