From 0e25823d63acbc093437e0e4734f45ecda346e90 Mon Sep 17 00:00:00 2001 From: Laxya Agarwal Date: Thu, 29 Apr 2021 22:10:43 +0530 Subject: [PATCH] Fixed typo seperate->separate --- datasets/wiki_auto/README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/datasets/wiki_auto/README.md b/datasets/wiki_auto/README.md index c02b9af479b..3a97bc89071 100644 --- a/datasets/wiki_auto/README.md +++ b/datasets/wiki_auto/README.md @@ -132,7 +132,7 @@ The `auto` config shows a pair of an English and corresponding Simple English Wi 'simple_article_url': 'https://simple.wikipedia.org/wiki?curid=702227'}} ``` -Finally, the `auto_acl`, the `auto_full_no_split`, and the `auto_full_with_split` configs were obtained by selecting the aligned pairs of sentences from `auto` to provide a ready-to-go aligned dataset to train a sequence-to-sequence system. While `auto_acl` corresponds to the filtered version of the data used to train the systems in the paper, `auto_full_no_split` and `auto_full_with_split` correspond to the unfiltered versions with and without sentence splits respectively. In the `auto_full_with_split` config, we join the sentences in the simple article mapped to the same sentence in the complex article to capture sentence splitting. Split sentences are seperated by a `` token. In the `auto_full_no_split` config, we do not join the splits and treat them as seperate pairs. An instance is a single pair of sentences: +Finally, the `auto_acl`, the `auto_full_no_split`, and the `auto_full_with_split` configs were obtained by selecting the aligned pairs of sentences from `auto` to provide a ready-to-go aligned dataset to train a sequence-to-sequence system. While `auto_acl` corresponds to the filtered version of the data used to train the systems in the paper, `auto_full_no_split` and `auto_full_with_split` correspond to the unfiltered versions with and without sentence splits respectively. In the `auto_full_with_split` config, we join the sentences in the simple article mapped to the same sentence in the complex article to capture sentence splitting. Split sentences are separated by a `` token. In the `auto_full_no_split` config, we do not join the splits and treat them as separate pairs. An instance is a single pair of sentences: ``` {'normal_sentence': 'In early work , Rutherford discovered the concept of radioactive half-life , the radioactive element radon , and differentiated and named alpha and beta radiation .\n', 'simple_sentence': 'Rutherford discovered the radioactive half-life , and the three parts of radiation which he named Alpha , Beta , and Gamma .\n'} @@ -240,4 +240,4 @@ You can cite the paper presenting the dataset as: ### Contributions -Thanks to [@yjernite](https://github.com/yjernite), [@mounicam](https://github.com/mounicam) for adding this dataset. \ No newline at end of file +Thanks to [@yjernite](https://github.com/yjernite), [@mounicam](https://github.com/mounicam) for adding this dataset.