Training logic on powerset

# Training logic for powerset

We have already added a command line argument ([main.py](https://github.com/njallskarp/icelandic-qa-finetune/blob/c19fc88c12936d2ddbd8335eec18d01f144e3ca8/main.py#L25)) where users can specify the domains or datasets. What we need to do is we need to have each domain load a different Dataset class. Then, during each iteration of for set of sources in the powerset, we need to use torch Dataset's [concat method](https://github.com/njallskarp/icelandic-qa-finetune/blob/c19fc88c12936d2ddbd8335eec18d01f144e3ca8/main.py#L25) to concatinate the multiple domains to create a single Dataset. If we have `N` domains then we will end up creating `2^N - 1` dataset classes, one per iteration.

## Where this could happen
It seems to me that this might happen inside [the run training fuction](https://github.com/njallskarp/icelandic-qa-finetune/blob/c19fc88c12936d2ddbd8335eec18d01f144e3ca8/training/__init__.py#L7). That is, around the `for ... in range(epochs)` there will be something like `for domain_subset in powerset:` 

This means that we will need to pass the Dataset classes into this function, not the dataloaders.

We can schedule a meeting to discuss this in detail.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Training logic on powerset #46

Training logic for powerset

Where this could happen

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Training logic on powerset #46

Description

Training logic for powerset

Where this could happen

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions