Skip to content

Performance Optimization & Code Cleanup #15

@pouria-mirebrahimi

Description

@pouria-mirebrahimi

Story MP-015: Performance Optimization & Code Cleanup

Description:
Optimize I/O, memory footprint, and stage latencies; reduce logging noise; remove debug artifacts; and ensure clean, maintainable source tree.

Tasks:

  • Profile pipeline to identify hotspots (ingestion batching, NaN handling, dedup, export)
  • Optimize Parquet write settings (row group size, compression) and reduce unnecessary DataFrame copies
  • Tune parallelism/async where safe; ensure no race conditions in log or export writers
  • Reduce log verbosity to actionable INFO; keep DEBUG only for dev toggles
  • Remove dead code and debug scaffolding; enforce lint/format across repository

Acceptance Criteria:

  • Measurable latency reduction on at least two critical stages
  • Memory spikes mitigated under multi-symbol runs
  • Logs are concise and actionable (no excessive noise)
  • Lint/format checks pass; repository free of debug artifacts

Estimation: 1.5 days
Dependencies: MP-013, MP-014
Status: Planned

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No fields configured for Task.

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions