Skip to content

Conversation

@meg-huggingface
Copy link
Contributor

Not entirely sure, following the links here, but it seems the relevant license is at https://github.com/soskek/bookcorpus/blob/master/LICENSE

Adding license, as best I can figure out from the relevant links.
@meg-huggingface meg-huggingface self-assigned this Jan 4, 2022
@albertvillanova albertvillanova added the dataset contribution Contribution to a dataset script label Sep 23, 2022
@albertvillanova albertvillanova changed the title Update README.md Update license of bookcorpus dataset Sep 23, 2022
@albertvillanova albertvillanova changed the title Update license of bookcorpus dataset Update license to bookcorpus dataset card Sep 23, 2022
Copy link
Member

@albertvillanova albertvillanova left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure but I would say the MIT license covers the code of the script to generate the dataset, not the dataset itself.

However, I'm not sure what the license of the dataset is. Note the dataset is not publicly accessible.

Maybe @lhoestq, do you have some hint? I have seen you generated the current data file hosted at Hugging Face:

@lhoestq
Copy link
Member

lhoestq commented Sep 27, 2022

The smashwords ToS apply for this dataset, we did the same for #3525

@HuggingFaceDocBuilderDev
Copy link

HuggingFaceDocBuilderDev commented Sep 30, 2022

The documentation is not available anymore as the PR was closed or merged.

@albertvillanova albertvillanova merged commit db2e5b5 into main Sep 30, 2022
@albertvillanova albertvillanova deleted the meg-huggingface-patch-3 branch September 30, 2022 10:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dataset contribution Contribution to a dataset script

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants