Add new model docs by patrickvonplaten · Pull Request #9667 · huggingface/transformers

patrickvonplaten · 2021-01-18T22:16:01Z

What does this PR do?

This PR adds more information on how to add a model to Transformers docs.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

UPDATE

The model_doc/add_new_model.rst is now finished for a first merge IMO. It would be amazing if @LysandreJik @sgugger you could review the file real quick again - I tried to add all of your suggestions. Also, I added a diagram showing the model design of Transformers - which was not reviewed yet. Note that I did not add a clear design for Tokenizers since it takes a lot of time to do so and I want to iteratively improve this step-by-step explanation. The first model, for which I'd like to mentor someone from the community would also be BigBird which does not need a new tokenizer.

In addition, I would be extremely grateful if @stas00 @abhishekkrthakur @patil-suraj @stefan-it @NielsRogge you have 10 minutes review the model_doc/add_model.rst file for possible improvements since you guys just recently added a new model. Your feedback would be especially useful since you might have a much more "unbiased" view what is difficult/easy when adding a model.

LysandreJik

Fantastic work! Thank you for putting all of that into words.

I really love how enthusiastic it feels, with the 🎉 You are Awesome! 😎 and such!

sgugger

This is a great template, thanks for adding!

…into add_new_model_temp

stas00

Boy, this was a monumental work, @patrickvonplaten! Absolutely phenomenal work!

I left a bunch of small suggestions - please feel free to ignore any or all, no need for any justifications.

Wrt delivery, I felt it may be a bit overwhelming with the amount of information. So I highly recommend to add numbers to the sub-sections so that it's easier for one to know where they are at. It's super-detailed, which is great, but also one could get scared with so many details. Perhaps even adding a clear Table of Contents with numbered items, so the reader knows, ok, I'm at 6/10, like in a book.

In my experience I used a very different approach to porting, because I have a hard time working in vacuum which this approach suggests. To me I need to see a constant success, while progressing towards a bigger goal. So my approach was to have 2 code bases side-by-side, with 2 small scripts, using real text and not just a few numbers and porting the tokenizer first, so I at each step I was matching the ported code with the original, rather than work on each separately.
That's why it was important for me to choose checkpoints that used languages I speak.

But there is more than one way to skin-the-cat, and my approach is documented in my blog post, albeit it's outdated now since many things have changed in the code layout since it was written.

Co-authored-by: Stas Bekman <[email protected]> Co-authored-by: Bram Vanroy <[email protected]>

Co-authored-by: Bram Vanroy <[email protected]> Co-authored-by: Stas Bekman <[email protected]>

Co-authored-by: Stas Bekman <[email protected]>

Co-authored-by: Stas Bekman <[email protected]> Co-authored-by: Stefan Schweter <[email protected]> Co-authored-by: Bram Vanroy <[email protected]>

Co-authored-by: Stas Bekman <[email protected]> Co-authored-by: Pierric Cistac <[email protected]>

sgugger

Left some last typo-fixes suggestions. In general remember that in rst you can't have nested formatting (like code-formatting inside a bold block for instance).

Also this is super mega nitty but you use indiscriminately Object and :obj:Object. In a docstring they end up both in bold and code-formatting but in a rst file the first is jsut code-formatting and the second is bold + code formatting.

Co-authored-by: Sylvain Gugger <[email protected]>

patrickvonplaten added 3 commits January 18, 2021 23:11

add new model logic

6c9a2c2

fix docs

41ccdb7

change structure

82ed971

LysandreJik reviewed Jan 19, 2021

View reviewed changes

sgugger approved these changes Jan 19, 2021

View reviewed changes

patrickvonplaten added 12 commits January 20, 2021 16:49

improve add_new_model

a5b53d9

push new changes

2eb8ce8

up

d9618a0

up

92dcc01

correct spelling

f4f4c7a

improve docstring

057ee8f

Merge branch 'master' of https://github.com/huggingface/transformers …

c17530a

…into add_new_model_temp

Merge branch 'master' of https://github.com/huggingface/transformers …

a9550d2

…into add_new_model_temp

correct line length

6028926

update readme

29c728b

correct links

b94dedf

correct typos

8985b1b

stas00 reviewed Jan 27, 2021

View reviewed changes

Comment thread docs/source/add_new_model.rst Outdated

Pierrci reviewed Jan 27, 2021

View reviewed changes

Comment thread docs/source/add_new_model.rst Outdated

stas00 approved these changes Jan 27, 2021

View reviewed changes

stefan-it reviewed Jan 28, 2021

View reviewed changes

Comment thread docs/source/add_new_model.rst

BramVanroy suggested changes Jan 28, 2021

View reviewed changes

only add rst file for now

744133f

patrickvonplaten changed the title ~~[WIP] Add new model docs~~ Add new model docs Feb 1, 2021

patrickvonplaten commented Feb 1, 2021

View reviewed changes

Comment thread docs/source/add_new_model.rst Outdated

patrickvonplaten commented Feb 1, 2021

View reviewed changes

Comment thread docs/source/add_new_model.rst Outdated

patrickvonplaten and others added 4 commits February 1, 2021 13:01

Apply suggestions from code review 1

7523410

Co-authored-by: Stas Bekman <[email protected]> Co-authored-by: Bram Vanroy <[email protected]>

Apply suggestions from code review

4369da9

Co-authored-by: Bram Vanroy <[email protected]> Co-authored-by: Stas Bekman <[email protected]>

Apply suggestions from code review

ff8c3f1

Co-authored-by: Stas Bekman <[email protected]>

Apply suggestions from code review

fdeaac0

Co-authored-by: Stas Bekman <[email protected]> Co-authored-by: Stefan Schweter <[email protected]> Co-authored-by: Bram Vanroy <[email protected]>

patrickvonplaten and others added 4 commits February 1, 2021 14:48

Apply suggestions from code review

09d356c

Co-authored-by: Stas Bekman <[email protected]> Co-authored-by: Pierric Cistac <[email protected]>

finish adding all suggestions

cfafc9f

make style

dbaab48

apply Niels feedback

e92e236

sgugger approved these changes Feb 1, 2021

View reviewed changes

patrickvonplaten and others added 2 commits February 1, 2021 17:07

Apply suggestions from code review

635e602

Co-authored-by: Sylvain Gugger <[email protected]>

apply sylvains suggestions

7ffaa09

patrickvonplaten merged commit 0e3be1a into huggingface:master Feb 1, 2021

patrickvonplaten deleted the add_new_model_temp branch February 1, 2021 14:55

MLDovakin mentioned this pull request Feb 28, 2021

OSError: Error no file named ['pytorch_model.bin', 'tf_model.h5'] When I try to use my model #10450

Closed

Conversation

patrickvonplaten commented Jan 18, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

UPDATE

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stas00 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

patrickvonplaten commented Jan 18, 2021 •

edited

Loading