Update docs around audio and vision #4440

stevhliu · 2022-06-02T17:42:03Z

As part of the strategy to center the docs around the different modalities, this PR updates the quickstart to include audio and vision examples. This improves the developer experience by making audio and vision content more discoverable, enabling users working in these modalities to also quickly get started without digging too deeply into the docs.

Other changes include:

Moved the installation guide to the Get Started section because it should be part of a user's onboarding to the library before exploring tutorials or how-to's.
Updated the native TF code at creating a tf.data.Dataset because it was throwing an error. The to_tensor() bit was redundant and removing it fixed the error (please double-check me here!).
Added some UI components to the quickstart so it's easier for users to navigate directly to the relevant section with context about what to expect.
Reverted to the code tabs for content that don't have any framework-specific text. I think this saves space compared to the code blocks. We'll still use the code blocks if the torch text is different from the tf text.

Let me know what you think, especially if we should include some code samples for training a model in the audio/vision sections. I left this out since we already showed it in the NLP section. I want to keep the focus on using Datasets to load and process a dataset, and not so much the training part. Maybe we can add links to the Transformers docs instead?

HuggingFaceDocBuilderDev · 2022-06-02T17:47:02Z

The documentation is not available anymore as the PR was closed or merged.

mariosasko

Love the changes!

docs/source/quickstart.mdx

mariosasko · 2022-06-05T12:48:33Z

Let me know what you think, especially if we should include some code samples for training a model in the audio/vision sections. I left this out since we already showed it in the NLP section. I want to keep the focus on using Datasets to load and process a dataset, and not so much the training part. Maybe we can add links to the Transformers docs instead?

We plan to address this with end-to-end examples (for each modality) more focused on preprocessing than the ones in the Transformers docs.

lhoestq

Awesome thanks !

Let me know what you think, especially if we should include some code samples for training a model in the audio/vision sections. I left this out since we already showed it in the NLP section.

I'd add the conversion to pytorch DataLoader and TF Dataset as well for audio and vision, if it's not too much information. But I think the training loop itself needs to be part of an end-to-end example as @mariosasko suggested.

As soon as there is a pytorch DataLoader or a TF Dataset, it means we're ready for training, I don't think it's necessary to show the complexity of one particular task/model and how it's trained in the quickstart.

Maybe the quickstart can redirect to its corresponding end-to-end example for those who would like to see a complete example with more context ? Something like "Want to see more in a concrete example ? See how a dataset can be prepared for speech recognition with transformers, etc."

docs/source/quickstart.mdx

lhoestq

Love the links you provided ! Later I think we can provide end-to-end examples in the datasets doc itself using a super simpler training loop instead of redirecting to the transformers doc that uses the Trainer

My last comments:

docs/source/quickstart.mdx

lhoestq

Color gradients look all good now, thanks !

CI failures are unrelated to this PR btw - you can ignore them.

Feel free to merge if it's all good for you @stevhliu

📝 first draft

d4cb543

stevhliu added the documentation Improvements or additions to documentation label Jun 2, 2022

stevhliu requested review from albertvillanova, lhoestq and mariosasko June 2, 2022 17:42

mariosasko reviewed Jun 5, 2022

View reviewed changes

docs/source/quickstart.mdx Show resolved Hide resolved

lhoestq reviewed Jun 8, 2022

View reviewed changes

docs/source/quickstart.mdx Outdated Show resolved Hide resolved

🖍 apply reviews

8877652

lhoestq reviewed Jun 13, 2022

View reviewed changes

docs/source/quickstart.mdx Outdated Show resolved Hide resolved

docs/source/quickstart.mdx Outdated Show resolved Hide resolved

stevhliu added 2 commits June 13, 2022 10:48

🖍 apply review, use tailwind v2 colors

bf1a9bb

🖍 update button links

4d06c63

mishig25 reviewed Jun 14, 2022

View reviewed changes

docs/source/quickstart.mdx Show resolved Hide resolved

mariosasko reviewed Jun 14, 2022

View reviewed changes

docs/source/quickstart.mdx Outdated Show resolved Hide resolved

stevhliu added 2 commits June 14, 2022 09:30

🖍 update wording in tip to be clearer

3f3e49b

🖍 minor edits

e859a8d

lhoestq approved these changes Jun 23, 2022

View reviewed changes

stevhliu merged commit 25bb7c9 into huggingface:master Jun 23, 2022

stevhliu deleted the modality-focus branch June 23, 2022 16:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update docs around audio and vision #4440

Update docs around audio and vision #4440

Uh oh!

stevhliu commented Jun 2, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jun 2, 2022 •

edited

Loading

Uh oh!

mariosasko left a comment

Uh oh!

Uh oh!

mariosasko commented Jun 5, 2022 •

edited

Loading

Uh oh!

lhoestq left a comment

Uh oh!

Uh oh!

lhoestq left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lhoestq left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Update docs around audio and vision #4440

Update docs around audio and vision #4440

Uh oh!

Conversation

stevhliu commented Jun 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jun 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mariosasko left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mariosasko commented Jun 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lhoestq left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lhoestq left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lhoestq left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

stevhliu commented Jun 2, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 2, 2022 •

edited

Loading

mariosasko commented Jun 5, 2022 •

edited

Loading

lhoestq left a comment •

edited

Loading