chore(deps): update dependency datasets to v2 - autoclosed #3
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR contains the following updates:
==1.11.0->==2.21.0Release Notes
huggingface/datasets (datasets)
v2.21.0Compare Source
Features
What's Changed
HF_HUB_OFFLINEinstead ofHF_DATASETS_OFFLINEby @Wauplin in https://github.com/huggingface/datasets/pull/6968New Contributors
Full Changelog: huggingface/datasets@2.20.0...2.21.0
v2.20.0Compare Source
Important
trust_remote_code=Trueby @lhoestq in https://github.com/huggingface/datasets/pull/6954trust_remote_code=Trueto be usedDatasets features
checkpoint and resume an iterable dataset (e.g. when streaming):
Returns:
General improvements and bug fixes
.pthsupport for torch tensors by @lhoestq in https://github.com/huggingface/datasets/pull/6920dataset_module_factoryby @Wauplin in https://github.com/huggingface/datasets/pull/6959New Contributors
Full Changelog: huggingface/datasets@2.19.0...2.20.0
v2.19.2Compare Source
Bug fixes
Full Changelog: huggingface/datasets@2.19.1...2.19.2
v2.19.1Compare Source
Bug fixes
Full Changelog: huggingface/datasets@2.19.0...2.19.1
v2.19.0Compare Source
Dataset Features
.to_polars();fsspecsupport forto_json,to_csv, andto_parquetby @alvarobartt in https://github.com/huggingface/datasets/pull/6096modeparameter toImagefeature by @mariosasko in https://github.com/huggingface/datasets/pull/6735datasets-cli convert_to_parquet <dataset_id>
General improvements and bug fixes
remove_columns/rename_columnsdoc fixes by @mariosasko in https://github.com/huggingface/datasets/pull/6772uvin CI by @mariosasko in https://github.com/huggingface/datasets/pull/6779_check_legacy_cache2by @lhoestq in https://github.com/huggingface/datasets/pull/6792DatasetBuilder._split_generatorsincomplete type annotation by @JonasLoos in https://github.com/huggingface/datasets/pull/6799CachedDatasetModuleFactoryandCacheby @izhx in https://github.com/huggingface/datasets/pull/6754os.path.relpathinresolve_patternsby @mariosasko in https://github.com/huggingface/datasets/pull/6815Dataset.__getitem__by @mariosasko in https://github.com/huggingface/datasets/pull/6817New Contributors
Full Changelog: huggingface/datasets@2.18.0...2.19.0
v2.18.0Compare Source
Dataset features
General improvements and bug fixes
num_workerscould lead to incorrect shards assignments to workers and cause errorsxlistdirby @mariosasko in https://github.com/huggingface/datasets/pull/6698New Contributors
Full Changelog: huggingface/datasets@2.17.1...2.18.0
v2.17.1Compare Source
Bug Fixes
arrow_writer.pyfrom #6636 by @bryant1410 in https://github.com/huggingface/datasets/pull/6664Full Changelog: huggingface/datasets@2.17.0...2.17.1
v2.17.0Compare Source
Dataset Features
General improvements and bug fixes
drop_last_batchin map after shuffling or sharding by @lhoestq in https://github.com/huggingface/datasets/pull/6575setup.cfgtopyproject.tomlby @mariosasko in https://github.com/huggingface/datasets/pull/6619tqdmbars in non-interactive environments by @mariosasko in https://github.com/huggingface/datasets/pull/6627with_rankparam toDataset.filterby @mariosasko in https://github.com/huggingface/datasets/pull/6608New Contributors
Full Changelog: huggingface/datasets@2.16.1...2.17.0
v2.16.1Compare Source
Bug fixes
cache_dirtoload_datasetload_dataset("ted_talks_iwslt", language_pair=("ja", "en"), year="2015")Full Changelog: huggingface/datasets@2.16.0...2.16.1
v2.16.0Compare Source
Security features
https://hf.co/datasets/<repo_id>. A warning is shown to let the user know about the custom code, and they can avoid this message in future by passing the argumenttrust_remote_code=True.trust_remote_code=Truewill be mandatory to load these datasets from the next major release ofdatasets.HF_DATASETS_TRUST_REMOTE_CODE=0you can already disable custom code by default without waiting for the next release ofdatasetsConfiguration
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR is behind base branch, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR was generated by Mend Renovate. View the repository job log.