Skip to content

NER doesn't identify lowercase entities #701

@bluefuzz01

Description

@bluefuzz01

As the title suggests, entities in lower case are not recognized as entities. I also noticed entities in upper case are not recognized either. It seems to only recognize entities with title/proper case:

EX: United States but not united states or UNITED STATES

Are there any plans to improve detection for these instances? Has anyone attempted this problem yet? If so, what did you do to deal with these cases?

Thanks!

Your Environment

  • Operating System: Windows 7
  • Python Version Used: 2.7.12
  • spaCy Version Used: 1.4.0 (1.60 as of Jan 20, 2016)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions