Skip to content

Duplicate samples in training data #17

@twesterhout

Description

@twesterhout

I've just noticed that train_sample.csv contains two occurrences of the following line:

871,12,1,13,178,2017-11-08 10:00:05,,0

Is it of any importance? train.csv contains loads more data and thus, I'm afraid, many more duplicate samples... Thoughts?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions