NER model for Armenian

Hello! I have trained a NER model for the Armenian language using the[ ArmTDP dataset](https://github.com/myavrum/ArmTDP-NER) and the [xlm-roberta-base model](https://huggingface.co/xlm-roberta-base).

After that, I attempted to test the model using stanza.Pipeline:

```
import stanza

config = {
'processors': 'tokenize, ner',
'lang': 'hy',
'ner_model_path': '/Lab/Projects/ner/models/hy_armtdp_nertagger_bert_18.pt',
}

nlp = stanza.Pipeline(**config)

nlp("some text in Arminian")

```

While working with the same data, I observed that the outputs after loading the model were different each time. 
Although there was no such problem when testing the code using internal commands. Whenever I run the following code, I get the same output:

`python3 -m stanza.utils.training.run_ner hy_armtdp --score_test`

What could be the cause of this problem? 

Additionally, I have added data conversion and BERT code for Armenian in this [pull request](https://github.com/ShakeHakobyan/stanza/pull/2) (trained model can be downloaded from this [drive](https://drive.google.com/file/d/15Fc1BFj_Rlbio3Vt5c69QUH9YvS7dTzW/view?usp=sharing)).

If the problem is feasible, it would be great to integrate a NER model for Armenian in the main package


Thanks!



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NER model for Armenian #1206

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

NER model for Armenian #1206

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions