Skip to content

Different POS tags for same sentence repeated in paragraph #954

@garyspatterson

Description

@garyspatterson

I am seeing odd behavior with regards to fine-grained POS tags for a text with identical repeated sentences: 'The cactus also bears fruit. The cactus also bears fruit.' For the first sentence, the 'cactus' token is tagged as NN, whereas in the second sentence, it is NNS. If you take away the 'also' in the second sentence, the tag is correctly 'NN'. I had assumed that POS tagging was done at the sentence level of analysis, so I'm curious why this is happening. Thanks!

for t in sent:
print t, t.tag_, t.dep_

The DT det
cactus NN nsubj
also RB advmod
bears VBZ ROOT
fruit NN dobj
. . punct
The DT det
cactus NNS nsubj
also RB advmod
bears VBZ ROOT
fruit NN dobj
. . punct

Your Environment

spaCy 1.6
spyder 3.0.2
Error also replicates on displacy.

Metadata

Metadata

Assignees

No one assigned

    Labels

    lang / enEnglish language data and modelsmodelsIssues related to the statistical models

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions