Tokenization for downstream tasks

First of all thank you very much for your work. 

I am working on the long text classification task, and given the spectacular results of MEGA for long sequence modelling I wanted to use it for this task. The only thing that I haven't figured out how to do is the tokenization of my text samples, so I was wondering if someone could help me out on how to tokenize my text with the dict that is obtained from a checkpoint like the LRA one from the text task.

Thank you very much for your time

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tokenization for downstream tasks #10

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Tokenization for downstream tasks #10

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions