Skip to content

Downloading “reddit” dataset keeps timing out. #2820

@smeyerhot

Description

@smeyerhot

Describe the bug

A clear and concise description of what the bug is.
Everytime I try and download the reddit dataset it times out before finishing and I have to try again.

There is some timeout error that I will post once it happens again.

Steps to reproduce the bug

from datasets import load_dataset

dataset = load_dataset("reddit", ignore_verifications=True, cache_dir="/Volumes/My Passport for Mac/og-chat-data")

Expected results

A clear and concise description of the expected results.

I would expect the download to finish, or at least provide a parameter to extend the read timeout window.

Actual results

Specify the actual results or traceback.

Shown below in error message.

Environment info

  • datasets version: 1.11.0
  • Platform: macOS
  • Python version: 3.9.6 (conda env)
  • PyArrow version: N/A

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions