Skip to content

Conversation

@lhoestq
Copy link
Member

@lhoestq lhoestq commented Jun 29, 2021

Add interleave_datasets for map-style datasets

Add support for map-style datasets (i.e. Dataset objects) in interleave_datasets.
It was only supporting iterable datasets (i.e. IterableDataset objects).

Implementation details

It works by concatenating the datasets and then re-order the indices to make the new dataset.

TODO

  • tests
  • docs

Close #2563

@lhoestq lhoestq marked this pull request as ready for review June 30, 2021 17:14
@lhoestq lhoestq merged commit 1f29cba into master Jul 1, 2021
@lhoestq lhoestq deleted the interleave_datasets branch July 1, 2021 09:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

interleave_datasets for map-style datasets

2 participants