Skip to content

Commit 39b0c22

Browse files
authored
Fixed typos (#7572)
1 parent 82e0859 commit 39b0c22

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

CONTRIBUTING.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -95,7 +95,7 @@ Note that if any files were formatted by `pre-commit` hooks during committing, y
9595
git push -u origin a-descriptive-name-for-my-changes
9696
```
9797

98-
Go the webpage of your fork on GitHub. Click on "Pull request" to send your to the project maintainers for review.
98+
Go the webpage of your fork on GitHub. Click on "Pull request" to send your changes to the project maintainers for review.
9999

100100
## Datasets on Hugging Face
101101

docs/source/stream.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -190,7 +190,7 @@ Define sampling probabilities from each of the original datasets for more contro
190190
{'text': 'Chevrolet Cavalier Usados en Bogota - Carros en Vent...'}]
191191
```
192192

193-
Around 80% of the final dataset is made of the `en_dataset`, and 20% of the `fr_dataset`.
193+
Around 80% of the final dataset is made of the `es_dataset`, and 20% of the `fr_dataset`.
194194

195195
You can also specify the `stopping_strategy`. The default strategy, `first_exhausted`, is a subsampling strategy, i.e the dataset construction is stopped as soon one of the dataset runs out of samples.
196196
You can specify `stopping_strategy=all_exhausted` to execute an oversampling strategy. In this case, the dataset construction is stopped as soon as every samples in every dataset has been added at least once. In practice, it means that if a dataset is exhausted, it will return to the beginning of this dataset until the stop criterion has been reached.

0 commit comments

Comments
 (0)