Skip to content

load_dataset("financial_phrasebank") NonMatchingChecksumError #2641

@courtmckay

Description

@courtmckay

Describe the bug

Attempting to download the financial_phrasebank dataset results in a NonMatchingChecksumError

Steps to reproduce the bug

from datasets import load_dataset
dataset = load_dataset("financial_phrasebank", 'sentences_allagree')

Expected results

I expect to see the financial_phrasebank dataset downloaded successfully

Actual results

NonMatchingChecksumError: Checksums didn't match for dataset source files:
['https://www.researchgate.net/profile/Pekka_Malo/publication/251231364_FinancialPhraseBank-v10/data/0c96051eee4fb1d56e000000/FinancialPhraseBank-v10.zip']

Environment info

  • datasets version: 1.9.0
  • Platform: Linux-4.14.232-177.418.amzn2.x86_64-x86_64-with-debian-10.6
  • Python version: 3.7.10
  • PyArrow version: 4.0.1

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions