Skip to content

[Feature request] Add Toronto BookCorpus dataset #131

@jarednielsen

Description

@jarednielsen

I know the copyright/distribution of this one is complex, but it would be great to have! That, combined with the existing wikitext, would provide a complete dataset for pretraining models like BERT.

Metadata

Metadata

Assignees

No one assigned

    Labels

    dataset requestRequesting to add a new dataset

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions