Skip to content

How to prepare the training data #42

@ycsun1972

Description

@ycsun1972

Hi,
"We fine-tune the 7B and 13B models with 80k and 18k conversations, respectively."
Could you provide more details about the training data? How the 80k data are prepared? Are they all with length of 16k?

Is the data used for training longchat-v1.5 the same as previous version?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions