Skip to content

Latest commit

 

History

History
54 lines (37 loc) · 924 Bytes

README.md

File metadata and controls

54 lines (37 loc) · 924 Bytes

MDverse database

Setup environment

We use uv to manage dependencies and the project environment.

Clone the GitHub repository:

git clone [email protected]:MDverse/md_data_schema.git
cd md_data_schema

Sync dependencies:

uv sync

Retrieve data

Download parquet files from Zenodo to build the database:

uv run src/download_data.py

Files will be downloaded to data/parquet_files:

data
└── parquet_files
    ├── datasets.parquet
    ├── files.parquet
    ├── gromacs_gro_files.parquet
    ├── gromacs_mdp_files.parquet
    ├── gromacs_xtc_files.parquet

Build the database

Create the empty database:

uv run src/create_database.py

Populate the tables with the data from parquet files:

uv run src/ingest_data.py