Skip to content

Conversation

@albertvillanova
Copy link
Member

Set explicit utf-8 encoding in OSCAR dataset, to avoid using the system default cp1252 on Windows platforms.

Fix #2319.

@albertvillanova albertvillanova merged commit 241a0b4 into huggingface:master May 5, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

UnicodeDecodeError for OSCAR (Afrikaans)

1 participant