Generalize tutorials for audio and vision #4468

stevhliu · 2022-06-09T22:00:44Z

This PR updates the tutorials to be more generalizable to all modalities. After reading the tutorials, a user should be able to load any type of dataset, know how to index into and slice a dataset, and do the most basic/common type of preprocessing (tokenization, resampling, applying transforms) depending on their dataset.

Other changes include:

Removed the sections about a dataset's metadata, features, and columns because we cover this in an earlier tutorial about inspecting the DatasetInfo through the dataset builder.
Separate the sharing dataset tutorial into two sections: (1) uploading via the web interface and (2) using the huggingface_hub library.
Renamed some tutorials in the TOC to be more clear and specific.
Added more text to nudge users towards joining the community and asking questions on the forums.
If it's okay with everyone, I'd also like to remove the section about loading and using metrics since we have the evaluate docs now.

HuggingFaceDocBuilderDev · 2022-06-09T22:08:07Z

The documentation is not available anymore as the PR was closed or merged.

mariosasko

Looks good already.

Maybe instead of removing the metrics docs abruptly, we can deprecate them for now (with a warning?) and redirect users to the evaluate docs.

Some nits:

docs/source/access.mdx

docs/source/load_hub.mdx

docs/source/upload_dataset.mdx

docs/source/use_dataset.mdx

lhoestq

Awesome ! And +1 about deprecating metrics instead of removing them, and redirecting to evaluate

CI fails are unrelated to this PR, you can ignore them

docs/source/metrics.mdx

📝 first draft

87522c2

stevhliu added the documentation Improvements or additions to documentation label Jun 9, 2022

stevhliu requested review from lhoestq and mariosasko June 9, 2022 22:00

mariosasko reviewed Jun 10, 2022

View reviewed changes

lhoestq approved these changes Jun 10, 2022

View reviewed changes

🖍 apply reviews

ed14c92

lhoestq reviewed Jun 14, 2022

View reviewed changes

docs/source/metrics.mdx Show resolved Hide resolved

stevhliu merged commit 40553c7 into huggingface:master Jun 14, 2022

stevhliu deleted the fresh-tutorials branch June 14, 2022 16:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Generalize tutorials for audio and vision #4468

Generalize tutorials for audio and vision #4468

Uh oh!

stevhliu commented Jun 9, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Jun 9, 2022 •

edited

Loading

Uh oh!

mariosasko left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lhoestq left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Generalize tutorials for audio and vision #4468

Generalize tutorials for audio and vision #4468

Uh oh!

Conversation

stevhliu commented Jun 9, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Jun 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mariosasko left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lhoestq left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

HuggingFaceDocBuilderDev commented Jun 9, 2022 •

edited

Loading

lhoestq left a comment •

edited

Loading