Skip to content
This repository was archived by the owner on Jan 9, 2023. It is now read-only.
This repository was archived by the owner on Jan 9, 2023. It is now read-only.

Simply fine-tuning ETL #131

@chathasphere

Description

@chathasphere

Instead of manipulating data during preprocessing to identify cases and controls, I think it would be a lot simpler to optionally supply a list of patient IDs and labels to retrieve.

In a Jupyter notebook (say), we could build a suitable cohort of cases and controls, and then select for only these IDs during chunk iteration. I think this could speed up ETL significantly.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions