Requested by @justaddcoffee on Berkeley BOP Slack #contextualizer channel (Sept 4):
Maybe we should label each test with one of (say) 5 types so we can break out plots by how they performed for (say):
- Bibliographic metadata - title, authors, affiliation, doi, publisher
- Publication text extraction - getting stuff from specific sections
- Figure / Table extraction - getting figures and tables
- Supplementary material - getting supplemental data
- Retraction status - was paper extracted, etc.
@cmungall and @ct-parker supported the idea.
@justaddcoffee committed changes to mcp_literature_eval config including a case_category for each case.
Requested by @justaddcoffee on Berkeley BOP Slack #contextualizer channel (Sept 4):
@cmungall and @ct-parker supported the idea.
@justaddcoffee committed changes to mcp_literature_eval config including a
case_categoryfor eachcase.