Skip to content

Notes on tests#117

Open
LourensVeen wants to merge 1 commit intoNLeSC:masterfrom
LourensVeen:tests-notes
Open

Notes on tests#117
LourensVeen wants to merge 1 commit intoNLeSC:masterfrom
LourensVeen:tests-notes

Conversation

@LourensVeen
Copy link
Member

See the inline comments

annotators=['tokenize', 'ssplit', 'pos', 'lemma'])
saf = stanford_to_saf(lines)
assert_equal({t['lemma'] for t in saf['tokens']},
{'Cesar', 'hit', 'Hovik'})
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is the order reversed here relative to the input?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

{} is set builder syntax, so the order is considered irrelevant. From the looks of stanford_to_saf output, the order shouldn't matter because each token carries an offset.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants