Hi, I am trying to reproduce experiment results for the mT5-Large text model (F0.5 score of 70.1), as presented in Table 3 of your paper.
Would it be possible to share the script you have used to train the mT5-Large text model on CLang8 German data and evaluate the performance on the Falko-MERLIN German dataset? I have tried to find the settings of the hyper-parameters used for this experiment, but I could only find the hyper-parameters for evaluating multimodal GEC models in Appendix A.2 of your paper.
Thank you!