MBartForConditionalGeneration #6441

patil-suraj · 2020-08-12T14:49:14Z

This PR adds MBartForConditionalGeneration. Regarding #6416

@sshleifer

patil-suraj · 2020-08-12T17:20:23Z

The failure is coming from test_modeling_tf_electra.py

sshleifer

LGTM, thanks suraj!

sgugger

Great work! Just left a few doc nits.

sgugger · 2020-08-12T18:47:39Z

docs/source/model_doc/mbart.rst

+
+MBART is a sequence-to-sequence denoising auto-encoder pre-trained on large-scale monolingual corpora in many languages using the BART objective. mBART is one of the first methods for pre-training a complete sequence-to-sequence model by denoising full texts in multiple languages, while previous approaches have focused only on the encoder, decoder, or reconstructing parts of the text.
+
+The Authors' code can be found `here <https://github.com/pytorch/fairseq/tree/master/examples/mbart>`_


Suggested change

The Authors' code can be found `here <https://github.com/pytorch/fairseq/tree/master/examples/mbart>`_

The Authors' code can be found `here <https://github.com/pytorch/fairseq/tree/master/examples/mbart>`__

(Using only one _ will cause sphinx to associate every here with that link or complain, so it's best to always use two _)

sgugger · 2020-08-12T18:48:26Z

src/transformers/modeling_mbart.py

+MBART_PRETRAINED_MODEL_ARCHIVE_LIST = [
+    "facebook/mbart-large-cc25",
+    "facebook/mbart-large-en-ro",
+    # See all BART models at https://huggingface.co/models?filter=mbart


Suggested change

# See all BART models at https://huggingface.co/models?filter=mbart

# See all multilingual BART models at https://huggingface.co/models?filter=mbart

sgugger · 2020-08-12T18:48:43Z

src/transformers/modeling_mbart.py

+
+MBART_START_DOCSTRING = r"""
+
+    This model is a PyTorch `torch.nn.Module <https://pytorch.org/docs/stable/nn.html#torch.nn.Module>`_ sub-class.


Suggested change

This model is a PyTorch `torch.nn.Module <https://pytorch.org/docs/stable/nn.html#torch.nn.Module>`_ sub-class.

This model is a PyTorch `torch.nn.Module <https://pytorch.org/docs/stable/nn.html#torch.nn.Module>`__ sub-class.

codecov · 2020-08-12T18:54:01Z

Codecov Report

Merging #6441 into master will increase coverage by 0.29%.
The diff coverage is 88.22%.

@@            Coverage Diff             @@
##           master    #6441      +/-   ##
==========================================
+ Coverage   79.77%   80.06%   +0.29%     
==========================================
  Files         148      156       +8     
  Lines       27214    28024     +810     
==========================================
+ Hits        21710    22438     +728     
- Misses       5504     5586      +82

Impacted Files	Coverage Δ
src/transformers/configuration_reformer.py	`100.00% <ø> (ø)`
src/transformers/data/test_generation_utils.py	`0.00% <0.00%> (ø)`
src/transformers/modeling_marian.py	`90.00% <ø> (-0.91%)`	⬇️
src/transformers/modeling_utils.py	`87.35% <ø> (ø)`
src/transformers/tokenization_bart.py	`100.00% <ø> (+4.22%)`	⬆️
src/transformers/trainer_tf.py	`12.25% <0.00%> (-0.13%)`	⬇️
src/transformers/testing_utils.py	`51.92% <28.57%> (-20.81%)`	⬇️
src/transformers/trainer.py	`37.84% <37.50%> (-0.18%)`	⬇️
src/transformers/modeling_tf_bert.py	`98.38% <50.00%> (+1.79%)`	⬆️
src/transformers/data/data_collator.py	`90.90% <52.94%> (-5.68%)`	⬇️
... and 69 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e92efcf...49f74a5. Read the comment docs.

patil-suraj · 2020-08-13T13:53:52Z

Thanks @sgugger , I've applied the suggestions.

LysandreJik

LGTM!

LysandreJik · 2020-08-13T15:37:14Z

utils/check_repo.py

    "test_modeling_tf_xlm_roberta.py",
    "test_modeling_xlm_roberta.py",
    "test_modeling_pegasus.py",
+    "test_modeling_mbart.py",


Looking forward to adding this to the common tests!

* Generation doc * MBartForConditionalGeneration (#6441) * add MBartForConditionalGeneration * style * rebase and fixes * add mbart test in TEST_FILES_WITH_NO_COMMON_TESTS * fix docs * don't ignore mbart * doc * fix mbart fairseq link * put mbart before bart * apply doc suggestions * Use hash to clean the test dirs (#6475) * Use hash to clean the test dirs * Use hash to clean the test dirs * Use hash to clean the test dirs * fix * [EncoderDecoder] Add Cross Attention for GPT2 (#6415) * add cross attention layers for gpt2 * make gpt2 cross attention work * finish bert2gpt2 * add explicit comments * remove attention mask since not yet supported * revert attn mask in pipeline * Update src/transformers/modeling_gpt2.py Co-authored-by: Sylvain Gugger <[email protected]> * Update src/transformers/modeling_encoder_decoder.py Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]> * Sort unique_no_split_tokens to make it deterministic (#6461) * change unique_no_split_tokens's type to set * use sorted list instead of set * style * Import accuracy_score (#6480) * Apply suggestions from code review Co-authored-by: Lysandre Debut <[email protected]> * Address comments * Styling * Generation doc * Apply suggestions from code review Co-authored-by: Lysandre Debut <[email protected]> * Address comments * Styling Co-authored-by: Suraj Patil <[email protected]> Co-authored-by: Kevin Canwen Xu <[email protected]> Co-authored-by: Patrick von Platen <[email protected]> Co-authored-by: Quentin Lhoest <[email protected]> Co-authored-by: gijswijnholds <[email protected]> Co-authored-by: Lysandre Debut <[email protected]>

patil-suraj changed the title ~~Add mbart model~~ [WIP][MBartForConditionalGeneration] Add mbart model Aug 12, 2020

patil-suraj added 3 commits August 12, 2020 21:09

add MBartForConditionalGeneration

fe24b20

style

f8670ee

rebase and fixes

5a31182

patil-suraj force-pushed the add-mbart-class branch from 368ea18 to 5a31182 Compare August 12, 2020 16:00

patil-suraj added 6 commits August 12, 2020 21:42

add mbart test in TEST_FILES_WITH_NO_COMMON_TESTS

fbc028b

fix docs

8c7d69b

don't ignore mbart

3ecd8a4

doc

2d49f98

fix mbart fairseq link

bb61b64

put mbart before bart

6527293

patil-suraj changed the title ~~[WIP][MBartForConditionalGeneration] Add mbart model~~ [WIP] MBartForConditionalGeneration Aug 12, 2020

sshleifer approved these changes Aug 12, 2020

View reviewed changes

sshleifer requested review from LysandreJik and sgugger August 12, 2020 18:14

sgugger approved these changes Aug 12, 2020

View reviewed changes

apply doc suggestions

674d68b

LysandreJik approved these changes Aug 13, 2020

View reviewed changes

patil-suraj changed the title ~~[WIP] MBartForConditionalGeneration~~ MBartForConditionalGeneration Aug 13, 2020

Merge branch 'master' into add-mbart-class

49f74a5

LysandreJik merged commit 680f133 into huggingface:master Aug 14, 2020

sshleifer mentioned this pull request Aug 17, 2020

[WIP] Fix mbart benchmark #5891

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MBartForConditionalGeneration #6441

MBartForConditionalGeneration #6441

Uh oh!

patil-suraj commented Aug 12, 2020

Uh oh!

patil-suraj commented Aug 12, 2020

Uh oh!

sshleifer left a comment

Uh oh!

sgugger left a comment

Uh oh!

sgugger Aug 12, 2020

Uh oh!

sgugger Aug 12, 2020

Uh oh!

sgugger Aug 12, 2020

Uh oh!

codecov bot commented Aug 12, 2020 •

edited

Loading

Uh oh!

patil-suraj commented Aug 13, 2020

Uh oh!

LysandreJik left a comment

Uh oh!

LysandreJik Aug 13, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		MBART is a sequence-to-sequence denoising auto-encoder pre-trained on large-scale monolingual corpora in many languages using the BART objective. mBART is one of the first methods for pre-training a complete sequence-to-sequence model by denoising full texts in multiple languages, while previous approaches have focused only on the encoder, decoder, or reconstructing parts of the text.

		The Authors' code can be found `here <https://github.com/pytorch/fairseq/tree/master/examples/mbart>`_

	# See all BART models at https://huggingface.co/models?filter=mbart
	# See all multilingual BART models at https://huggingface.co/models?filter=mbart


		MBART_START_DOCSTRING = r"""

		This model is a PyTorch `torch.nn.Module <https://pytorch.org/docs/stable/nn.html#torch.nn.Module>`_ sub-class.

MBartForConditionalGeneration #6441

MBartForConditionalGeneration #6441

Uh oh!

Conversation

patil-suraj commented Aug 12, 2020

Uh oh!

patil-suraj commented Aug 12, 2020

Uh oh!

sshleifer left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger Aug 12, 2020

Choose a reason for hiding this comment

Uh oh!

sgugger Aug 12, 2020

Choose a reason for hiding this comment

Uh oh!

sgugger Aug 12, 2020

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Aug 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

patil-suraj commented Aug 13, 2020

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

LysandreJik Aug 13, 2020

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented Aug 12, 2020 •

edited

Loading