Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support auto-splits for file datasets #676

Open
PGijsbers opened this issue Dec 14, 2024 · 0 comments
Open

Support auto-splits for file datasets #676

PGijsbers opened this issue Dec 14, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@PGijsbers
Copy link
Collaborator

Listing TODOs on the issue tracker (#172).

class ArffDataset(FileDataset):
def __init__(self, train_path, test_path, target=None, features=None, type=None):
# todo: handle auto-split (if test_path is None): requires loading the training set, split, save
super().__init__(
ArffDatasplit(self, train_path),
ArffDatasplit(self, test_path),
target=target,
features=features,
type=type,
)

# todo: handle auto-split (if test_path is None): requires loading the training set, split, save

Given the file with all data, and the target column, it should be trivial to support calls where the test split is not yet a file for ARFF and CSV datasets.

@PGijsbers PGijsbers added the enhancement New feature or request label Dec 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant