Skip to content

Expand drug interactions CSV — add 500+ high-risk pairs from DrugBank open data #6

@sarvanithin

Description

@sarvanithin

Overview

The current drug_interactions.csv has ~50 manually curated pairs. DrugBank publishes a free open data extract with thousands of known interactions. Expanding this table improves offline drug safety coverage without any API calls.

Data source

DrugBank Open Data (CC BY-NC 4.0): https://go.drugbank.com/releases/latest#open-data
Download drugbank_all_drug_interactions.csv — contains drug pairs, severity, description.

What to do

  1. Download the DrugBank open interactions CSV
  2. Filter to severity = major and moderate
  3. Normalize drug names to lowercase
  4. Map to the existing CSV format: drug_a,drug_b,severity,description,source
  5. Deduplicate against existing entries
  6. PR with the expanded CSV (target: 500+ rows)

Files to modify

  • medguard/knowledge/data/drug_interactions.csv — expand from ~50 to 500+ rows

Acceptance criteria

  • CSV has 500+ unique drug pairs
  • All drug names lowercase, consistent format
  • No duplicate pairs
  • Severity values: low, moderate, high, contraindicated
  • Source column cites drugbank / openfda / manual

Notes

This is a data contribution — no code changes required, great for first-time contributors.
DrugBank open data license requires attribution — add to README data sources table.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions