From 1cb6fbd05083feed351e07e51a265f598a403b60 Mon Sep 17 00:00:00 2001 From: Wilson Lee Date: Thu, 17 Jun 2021 17:15:14 -0700 Subject: [PATCH 1/6] fill inf CRD3 dataset card with additional info fill inf CRD3 dataset card with additional info update tags --- datasets/crd3/README.md | 50 ++++++++++++++++++++++++++++------------- 1 file changed, 34 insertions(+), 16 deletions(-) diff --git a/datasets/crd3/README.md b/datasets/crd3/README.md index b7d1fcc51e1..fafbf6c5f20 100644 --- a/datasets/crd3/README.md +++ b/datasets/crd3/README.md @@ -1,7 +1,24 @@ --- +annotations_creators: +- no-annotation +language_creators: +- crowdsourced languages: - en -paperswithcode_id: crd3 +licenses: +- cc-by-4.0 +multilinguality: +- monolingual +source_datasets: +- original +task_categories: +- conditional-text-generation +- sequence-modeling +task_ids: +- summarization +- dialogue-modeling +size_categories: +- 10K Date: Thu, 17 Jun 2021 18:46:18 -0700 Subject: [PATCH 2/6] add pretty_name --- datasets/crd3/README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/datasets/crd3/README.md b/datasets/crd3/README.md index fafbf6c5f20..c02d920b01b 100644 --- a/datasets/crd3/README.md +++ b/datasets/crd3/README.md @@ -1,4 +1,5 @@ --- +pretty_name: CRD3 annotations_creators: - no-annotation language_creators: From 928e029e7789a4d6b5d0c1edb6c637b801b872eb Mon Sep 17 00:00:00 2001 From: Wilson Lee Date: Thu, 17 Jun 2021 19:18:30 -0700 Subject: [PATCH 3/6] add back paperswithcode_id --- datasets/crd3/README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/datasets/crd3/README.md b/datasets/crd3/README.md index c02d920b01b..9c925323759 100644 --- a/datasets/crd3/README.md +++ b/datasets/crd3/README.md @@ -20,6 +20,7 @@ task_ids: - dialogue-modeling size_categories: - 10K Date: Fri, 18 Jun 2021 07:28:45 -0700 Subject: [PATCH 4/6] Update datasets/crd3/README.md Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com> --- datasets/crd3/README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/datasets/crd3/README.md b/datasets/crd3/README.md index 9c925323759..3a672c0f3c6 100644 --- a/datasets/crd3/README.md +++ b/datasets/crd3/README.md @@ -128,7 +128,7 @@ The data fields are the same among all splits. ### Curation Rationale -Dialogue understanding and abstractive summarization remain both important and challenging problems for computational linguistics. Current paradigms in summarization modeling have specific failures in capturing semantics and pragmatics, content selection, rewriting, and evaluation in the domain of long, story-telling dialogue. CRD3 offers a linguistically rich dataset to explore these domains. +Dialogue understanding and abstractive summarization remain both important and challenging problems for computational linguistics. Current paradigms in summarization modeling have specific failures in capturing semantics and pragmatics, content selection, rewriting, and evaluation in the domain of long, story-telling dialogue. CRD3 offers a linguistically rich dataset to explore these domains. ### Source Data @@ -197,4 +197,4 @@ conference = {ACL} ### Contributions -Thanks to [@thomwolf](https://github.com/thomwolf), [@lhoestq](https://github.com/lhoestq), [@mariamabarham](https://github.com/mariamabarham), [@lewtun](https://github.com/lewtun) for adding this dataset. \ No newline at end of file +Thanks to [@thomwolf](https://github.com/thomwolf), [@lhoestq](https://github.com/lhoestq), [@mariamabarham](https://github.com/mariamabarham), [@lewtun](https://github.com/lewtun) for adding this dataset. From c34cf0d383f3af82b843eaf15c2cd256a14fac40 Mon Sep 17 00:00:00 2001 From: Wilson Lee Date: Fri, 18 Jun 2021 07:35:49 -0700 Subject: [PATCH 5/6] correct inconsistencies in licenses --- datasets/crd3/README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/datasets/crd3/README.md b/datasets/crd3/README.md index 3a672c0f3c6..0c7126432bc 100644 --- a/datasets/crd3/README.md +++ b/datasets/crd3/README.md @@ -7,7 +7,7 @@ language_creators: languages: - en licenses: -- cc-by-4.0 +- cc-by-sa-4.0 multilinguality: - monolingual source_datasets: @@ -178,7 +178,7 @@ CRTranscript provided transcripts of the show; contributors of the Critical Role ### Licensing Information -This work is licensed under a [Creative Commons Attribution-ShareAlike 4.0 International License][cc-by-sa]., as corresponding to the Critical Role Wiki https://criticalrole.fandom.com/ +This work is licensed under a [Creative Commons Attribution-ShareAlike 4.0 International License][cc-by-sa-4.0]., as corresponding to the Critical Role Wiki https://criticalrole.fandom.com/ ### Citation Information From dec4397db75bcdcd447f1c4a531747f2e914ce40 Mon Sep 17 00:00:00 2001 From: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com> Date: Mon, 21 Jun 2021 11:58:02 +0200 Subject: [PATCH 6/6] pretty name --- datasets/crd3/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/datasets/crd3/README.md b/datasets/crd3/README.md index 0c7126432bc..b5eced9e7a1 100644 --- a/datasets/crd3/README.md +++ b/datasets/crd3/README.md @@ -1,5 +1,5 @@ --- -pretty_name: CRD3 +pretty_name: CRD3 (Critical Role Dungeons and Dragons Dataset) annotations_creators: - no-annotation language_creators: