Skip to content

Commit bc55ce0

Browse files
authored
Update datasets/bookcorpusopen/README.md
1 parent 874d842 commit bc55ce0

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

datasets/bookcorpusopen/README.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -158,7 +158,9 @@ The data fields are the same among all splits.
158158

159159
### Licensing Information
160160

161-
[MIT License](https://github.com/soskek/bookcorpus/blob/master/LICENSE)
161+
The books have been crawled from smashwords.com, see their [terms of service](https://www.smashwords.com/about/tos) for more information.
162+
163+
A data sheet for this dataset has also been created and published in [Addressing "Documentation Debt" in Machine Learning Research: A Retrospective Datasheet for BookCorpus](https://arxiv.org/abs/2105.05241)
162164

163165
### Citation Information
164166

0 commit comments

Comments
 (0)