-
-
Notifications
You must be signed in to change notification settings - Fork 4.6k
Closed
Labels
lang / enEnglish language data and modelsEnglish language data and modelsmodelsIssues related to the statistical modelsIssues related to the statistical models
Description
I am seeing odd behavior with regards to fine-grained POS tags for a text with identical repeated sentences: 'The cactus also bears fruit. The cactus also bears fruit.' For the first sentence, the 'cactus' token is tagged as NN, whereas in the second sentence, it is NNS. If you take away the 'also' in the second sentence, the tag is correctly 'NN'. I had assumed that POS tagging was done at the sentence level of analysis, so I'm curious why this is happening. Thanks!
for t in sent:
print t, t.tag_, t.dep_
The DT det
cactus NN nsubj
also RB advmod
bears VBZ ROOT
fruit NN dobj
. . punct
The DT det
cactus NNS nsubj
also RB advmod
bears VBZ ROOT
fruit NN dobj
. . punct
Your Environment
spaCy 1.6
spyder 3.0.2
Error also replicates on displacy.
Metadata
Metadata
Assignees
Labels
lang / enEnglish language data and modelsEnglish language data and modelsmodelsIssues related to the statistical modelsIssues related to the statistical models