Skip to content

to_bytes and from_bytes changes the token lemma #636

@rajhans

Description

@rajhans

import spacy
nlp=spacy.load('en')
x1=nlp('I cant do this.')
[t.lemma_ for t in x1]
['i', 'can', 'not', 'do', 'this', '.']
g=x1.to_bytes()
d=spacy.tokens.doc.Doc(nlp.vocab)
d.from_bytes(g)
I cant do this.
[t.lemma_ for t in d]
['i', 'ca', 'nt', 'do', 'this', '.']

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugBugs and behaviour differing from documentation

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions