Skip to content

Improve multi-language handling #72

@grhoten

Description

@grhoten

Some of the inflection variants aren't handled well. Some changes are needed to improve handling them. Here are some examples.

  1. The theater and theatre lemmas (L7083) need separate inflection tables.
  2. Provide a way to combine multiple languages, since language isn't pure.
    1. Handle multiple Norwegian variants.
    2. Allow combining of Serbian and Croatian.
    3. Allow combining of phonetic information for Korean and English for improved Korean particle usage.
    4. and so on...
  3. Improve the dictionary-parser speed to run in less time through better data filtering. Irrelevant data is skipped when necessary. For English, this was an improvement of 5 seconds or 25%.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions