diff --git a/datasets/wiki_lingua/README.md b/datasets/wiki_lingua/README.md index cf7aa303f5c..a2b9dd763c2 100644 --- a/datasets/wiki_lingua/README.md +++ b/datasets/wiki_lingua/README.md @@ -89,7 +89,7 @@ task_ids: - summarization paperswithcode_id: wikilingua --- -# Dataset Card Creation Guide +# Dataset Card for "wiki_lingua" ## Table of Contents - [Dataset Description](#dataset-description) @@ -187,7 +187,27 @@ ______________________________ ### Data Splits -[More Information Needed] +| | train | +|:-----------|--------:| +| arabic | 9995 | +| chinese | 6541 | +| czech | 2520 | +| dutch | 10862 | +| english | 57945 | +| french | 21690 | +| german | 20103 | +| hindi | 3402 | +| indonesian | 16308 | +| italian | 17673 | +| japanese | 4372 | +| korean | 4111 | +| portuguese | 28143 | +| russian | 18143 | +| spanish | 6616 | +| thai | 5093 | +| turkish | 1512 | +| vietnamese | 6616 | + ## Dataset Creation ### Curation Rationale @@ -244,12 +264,22 @@ ______________________________ ### Licensing Information -[More Information Needed] +- Article provided by wikiHow https://www.wikihow.com/Main-Page, a wiki building the world's largest, highest quality how-to manual. Please edit this article and find author credits at wikiHow.com. Content on wikiHow can be shared under a [Creative Commons license](http://creativecommons.org/licenses/by-nc-sa/3.0/). +- Refer to [this webpage](https://www.wikihow.com/wikiHow:Attribution) for the specific attribution guidelines. +- also see https://gem-benchmark.com/data_cards/WikiLingua ### Citation Information -[More Information Needed] +```bibtex +@article{ladhak-wiki-2020, + title = {WikiLingua: A New Benchmark Dataset for Multilingual Abstractive Summarization}, + authors = {Faisal Ladhak, Esin Durmus, Claire Cardie and Kathleen McKeown}, + journal = {arXiv preprint arXiv:2010.03093}, + year = {2020}, + url = {https://arxiv.org/abs/2010.03093} +} +``` ### Contributions -Thanks to [@katnoria](https://github.com/katnoria) for adding this dataset. \ No newline at end of file +Thanks to [@katnoria](https://github.com/katnoria) for adding this dataset.