Skip to content

feat(ml): configure Cloudinary upload for CDSCO scraper proof archiving#1900

Merged
dipexplorer merged 2 commits into
RatLoopz:mainfrom
Anmol-Bhatnagar:feat/cloudinary-cdsco-proofs
Jun 15, 2026
Merged

feat(ml): configure Cloudinary upload for CDSCO scraper proof archiving#1900
dipexplorer merged 2 commits into
RatLoopz:mainfrom
Anmol-Bhatnagar:feat/cloudinary-cdsco-proofs

Conversation

@Anmol-Bhatnagar

Copy link
Copy Markdown
Contributor

🛑 STOP: Assignment & File Scope Check

  • I am assigned to this issue.
  • I verified that this PR ONLY touches the required files.

📋 PR Summary & Link

  • **Closes [FEATURE] Configure Cloudinary upload for CDSCO scraper proof archiving #1899 **
  • Summary: This PR implements Cloudinary integration within the CDSCO scraper in the ML service to archive visual document proofs. When scanned/image-based PDFs are processed, each page is rendered to a PNG screenshot and uploaded to Cloudinary (under sahidawa/cdsco_proofs), returning a proof_image_url which is stored in the database (drug_alerts table) and validated by the Express API.
    Additionally, this PR fixes a raw merge conflict marker syntax error in apps/ml/routers/asr.py that was committed upstream, and aligns the ASR stream unit tests accordingly.

📸 Proof of Work (Screenshots / Logs)

ML Unit Tests execution results (92 passed, 3 skipped):

============================= test session starts =============================
platform win32 -- Python 3.13.14, pytest-9.0.3, pluggy-1.6.0
rootdir: C:\Users\anmol\Desktop\Open-Source\SahiDawa
plugins: anyio-4.13.0, langsmith-0.8.15, locust-2.44.3, timeout-2.4.0
collected 95 items

apps\ml\tests\test_alert_extractor.py .                                  [  1%]
apps\ml\tests\test_analyze.py .........                                  [ 10%]
apps\ml\tests\test_api.py ...                                            [ 13%]
apps\ml\tests\test_asr.py ..................sss                          [ 35%]
apps\ml\tests\test_asr_duration.py ......                                [ 42%]
apps\ml\tests\test_asr_language_hint.py ........                         [ 50%]
apps\ml\tests\test_asr_stream.py ...........                             [ 62%]
apps\ml\tests\test_medicine_ner.py ....................                  [ 83%]
apps\ml\tests\test_router_loader.py ...                                  [ 86%]
apps\ml\tests\test_telemetry.py ..                                       [ 88%]
apps\ml\tests\test_triage_graph.py .                                     [ 89%]
apps\ml\tests\test_tts.py ...                                            [ 92%]
apps\ml\tests\test_verify.py .......                                     [100%]

================== 92 passed, 3 skipped, 2 warnings in 8.47s ==================

🏷️ PR Type

  • 🐛 type: bug
  • type: feature
  • 📖 type: docs
  • 🧪 type: testing
  • 🔒 type: security
  • type: performance
  • 🎨 type: design
  • ♻️ type: refactor
  • 🛠️ type: devops
  • type: accessibility

✅ Checklist

  • My PR has a linked issue (Closes #1899 )
  • I have pulled the latest main and resolved any conflicts

@github-project-automation github-project-automation Bot moved this to 📥 Backlog in SahiDawa Workflow Jun 14, 2026
@github-actions github-actions Bot added gssoc:approved Approved for gssoc type:feature New feature or request type:devops CI/CD, Docker, deployment labels Jun 14, 2026
@github-actions github-actions Bot added the type:security Auth, rate limiting, security label Jun 14, 2026
@Anmol-Bhatnagar Anmol-Bhatnagar force-pushed the feat/cloudinary-cdsco-proofs branch from 0f27991 to 4837fe3 Compare June 14, 2026 14:23
@github-actions github-actions Bot removed the type:security Auth, rate limiting, security label Jun 14, 2026
@dipexplorer dipexplorer added level:intermediate 35 pts quality:clean multiplier x1.2 type:testing Unit tests, integration tests labels Jun 15, 2026
@dipexplorer dipexplorer merged commit 6a44963 into RatLoopz:main Jun 15, 2026
20 of 23 checks passed
@github-project-automation github-project-automation Bot moved this from 📥 Backlog to 🎉 Merged in SahiDawa Workflow Jun 15, 2026
@github-actions

Copy link
Copy Markdown
Contributor

🎉 Congratulations @Anmol-Bhatnagar! Your Pull Request "feat(ml): configure Cloudinary upload for CDSCO scraper proof archiving" has been successfully merged by @dipexplorer.

Thank you for your valuable contribution to SahiDawa! 🇮🇳
If this was for GSSoC 2026, your work is officially merged and valid. Keep up the great work and feel free to claim other open issues. 🚀

Follow us on LinkedIn: https://www.linkedin.com/company/ratloopz/ to get shoutout

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

gssoc:approved Approved for gssoc level:intermediate 35 pts quality:clean multiplier x1.2 type:devops CI/CD, Docker, deployment type:feature New feature or request type:testing Unit tests, integration tests

Projects

Status: 🎉 Merged

Development

Successfully merging this pull request may close these issues.

[FEATURE] Configure Cloudinary upload for CDSCO scraper proof archiving

2 participants