-
Notifications
You must be signed in to change notification settings - Fork 3k
Description
Describe the bug
Loading a streaming dataset with this version of fsspec fails with the following error:
NotImplementedError: Loading a streaming dataset cached in a LocalFileSystem is not supported yet.
I suspect the issue is with this PR
Steps to reproduce the bug
- Upgrade fsspec to version
2023.10.0 - Attempt to load a streaming dataset e.g.
load_dataset("laion/gpt4v-emotion-dataset", split="train", streaming=True) - Observe the following exception:
File "/opt/hostedtoolcache/Python/3.11.6/x64/lib/python3.11/site-packages/datasets/load.py", line 2146, in load_dataset
return builder_instance.as_streaming_dataset(split=split)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/hostedtoolcache/Python/3.11.6/x64/lib/python3.11/site-packages/datasets/builder.py", line 1318, in as_streaming_dataset
raise NotImplementedError(
NotImplementedError: Loading a streaming dataset cached in a LocalFileSystem is not supported yet.
Expected behavior
Should stream the dataset as normal.
Environment info
datasets@main
fsspec==2023.10.0
nguyenkhoihoang
Metadata
Metadata
Assignees
Labels
No labels