Skip to content

Conversation

@meg-huggingface
Copy link
Contributor

Not entirely sure, following the links here, but it seems the relevant license is at https://github.com/soskek/bookcorpus/blob/master/LICENSE

@lhoestq
Copy link
Member

lhoestq commented Jan 5, 2022

The MIT license seems to be for the crawling code, no ? Then maybe we can also redirect users to the terms of smashwords.com regarding copyrights, in particular the paragraph 10 for end-users. In particular it seems that end users can download and use the content "for their personal enjoyment in any reasonable non-commercial manner in compliance with copyright law" and the smashwords end-users agreement.

It should be the same for #3526 as well

Copy link
Collaborator

@severo severo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, @meg-huggingface for the updates!

Thanks also for setting me as a reviewer, but I'm totally not a specialist of the datasets themselves, so I prefer to just lurk and let @lhoestq @albertvillanova @mariosasko or @patrickvonplaten review these changes.

@lhoestq
Copy link
Member

lhoestq commented Apr 19, 2022

May I merge this one ?

@HuggingFaceDocBuilderDev
Copy link

HuggingFaceDocBuilderDev commented Apr 19, 2022

The documentation is not available anymore as the PR was closed or merged.

@lhoestq lhoestq merged commit cb6e8e7 into master Apr 20, 2022
@lhoestq lhoestq deleted the meg-huggingface-patch-2 branch April 20, 2022 09:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants