Commit 9a9df85
Fix bug with labels of eurlex config of lex_glue dataset (#5048)
* Update lex_glue.py
Fix for a critical bug in the EURLEX dataset label list to make LexGLUE EURLEX results replicable. In LexGLUE (Chalkidis et al., 2022), the following is mentioned w.r.t. EUR-LEX: "It supports four different label granularities, comprising 21, 127, 567, 7390 EuroVoc concepts, respectively. We use the 100 most frequent concepts from level 2 [...]”. The current label list has all 127 labels, which leads to different (lower) results, as communicated by users.
* Update lex_glue.py
Update code formatting
* Update dataset card
* Update metadata JSON
Co-authored-by: Albert Villanova del Moral <[email protected]>1 parent a4a571a commit 9a9df85
3 files changed
+2
-29
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
199 | 199 | | |
200 | 200 | | |
201 | 201 | | |
202 | | - | |
| 202 | + | |
203 | 203 | | |
204 | 204 | | |
205 | 205 | | |
| |||
0 commit comments