You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Nov 1, 2024. It is now read-only.
I am working on the long text classification task, and given the spectacular results of MEGA for long sequence modelling I wanted to use it for this task. The only thing that I haven't figured out how to do is the tokenization of my text samples, so I was wondering if someone could help me out on how to tokenize my text with the dict that is obtained from a checkpoint like the LRA one from the text task.
First of all thank you very much for your work.
I am working on the long text classification task, and given the spectacular results of MEGA for long sequence modelling I wanted to use it for this task. The only thing that I haven't figured out how to do is the tokenization of my text samples, so I was wondering if someone could help me out on how to tokenize my text with the dict that is obtained from a checkpoint like the LRA one from the text task.
Thank you very much for your time