Docs for EvaluationSuite #340

mathemakitten · 2022-11-03T15:26:49Z

Adding docs for EvaluationSuite.

HuggingFaceDocBuilderDev · 2022-11-03T15:30:27Z

The documentation is not available anymore as the PR was closed or merged.

lhoestq

Cool thanks ! I think you can also mention it to the quick tour :)

lhoestq · 2022-11-16T13:35:10Z

docs/source/evaluation_suite.mdx

+
+```python
+{'glue/cola': {'accuracy': 0.0, 'total_time_in_seconds': 0.9766696180449799, 'samples_per_second': 10.238876909079256, 'latency_in_seconds': 0.09766696180449798}, 'glue/sst2': {'accuracy': 0.5, 'total_time_in_seconds': 1.1422595420153812, 'samples_per_second': 8.754577775166744, 'latency_in_seconds': 0.11422595420153811}, 'glue/qqp': {'accuracy': 0.6, 'total_time_in_seconds': 1.3553926559980027, 'samples_per_second': 7.377935800188323, 'latency_in_seconds': 0.13553926559980026}, 'glue/mrpc': {'accuracy': 0.6, 'total_time_in_seconds': 2.021696529001929, 'samples_per_second': 4.946340786832532, 'latency_in_seconds': 0.2021696529001929}, 'glue/mnli': {'accuracy': 0.2, 'total_time_in_seconds': 2.0380110969999805, 'samples_per_second': 4.9067446270142145, 'latency_in_seconds': 0.20380110969999807}, 'glue/qnli': {'accuracy': 0.3, 'total_time_in_seconds': 2.082032073987648, 'samples_per_second': 4.802999975330509, 'latency_in_seconds': 0.20820320739876477}, 'glue/rte': {'accuracy': 0.7, 'total_time_in_seconds': 2.8592985830036923, 'samples_per_second': 3.4973612267855576, 'latency_in_seconds': 0.2859298583003692}, 'glue/wnli': {'accuracy': 0.5, 'total_time_in_seconds': 1.5406486629508436, 'samples_per_second': 6.490772517107661, 'latency_in_seconds': 0.15406486629508437}}
+```


(nit) Would be nice to show it as a pandas DataFrame for readability

Good call, the result is now a list of dicts so it can be easily transformed into a dataframe. I've added that to the example 😄

lhoestq · 2022-11-16T13:35:59Z

docs/source/evaluation_suite.mdx

+        self.preprocessor = lambda x: {"text": x["text"].lower()}
+        self.suite = [
+            SubTask(
+                task_type="text-classification",


Can you list the available task types maybe ? Or redirect to their docs ?

I've added a link to the supported tasks on the Evaluator docs so we don't have to maintain the list in two places!

lvwerra

Hi @mathemakitten this great, thanks for working on this. I left a few comments, happy to discuss further if you want.

docs/source/a_quick_tour.mdx

docs/source/base_evaluator.mdx

docs/source/evaluation_suite.mdx

Co-authored-by: Leandro von Werra <[email protected]>

lvwerra

Just a few minor comments, then we can merge 🚀

docs/source/a_quick_tour.mdx

docs/source/evaluation_suite.mdx

lvwerra · 2022-12-08T09:21:59Z

docs/source/evaluation_suite.mdx

+>>> suite = EvaluationSuite.load('mathemakitten/glue-evaluation-suite')
+>>> results = suite.run("gpt2")
+
+|   accuracy |   total_time_in_seconds |   samples_per_second |   latency_in_seconds | task_name   |


Would do the same here and remove the table from the codeblock so it's actually rendered as a nice table.

Co-authored-by: Leandro von Werra <[email protected]>

mathemakitten added 8 commits November 1, 2022 01:24

Evaluation suite runs

05c3b32

Runs but needs refactor

c409f3e

Refactor complete

5011b28

Code review comments

96c3a0e

update example comment

f7d014a

relative imports

3bf419e

types

8309ad2

docs for EvaluationSuite

c232876

mathemakitten force-pushed the hn-evaluation-suite-v2 branch 3 times, most recently from e64cd83 to f8da2a6 Compare November 8, 2022 23:00

lhoestq mentioned this pull request Nov 16, 2022

Evaluation suite #337

Merged

lhoestq reviewed Nov 16, 2022

View reviewed changes

Base automatically changed from hn-evaluation-suite-v2 to main November 16, 2022 15:44

mathemakitten added 4 commits November 28, 2022 11:23

edits

064539b

merge main

d5bc020

update

cb921fe

code quality

da413f8

mathemakitten requested review from NimaBoscarino and lvwerra November 28, 2022 19:42

Add evaluation suite to quicktour

7fe7a57

lvwerra reviewed Nov 29, 2022

View reviewed changes

mathemakitten and others added 4 commits November 29, 2022 09:08

Update docs/source/a_quick_tour.mdx

f0b5897

Co-authored-by: Leandro von Werra <[email protected]>

Update docs/source/base_evaluator.mdx

d97b7fb

Co-authored-by: Leandro von Werra <[email protected]>

Apply suggestions from code review

fb89329

Co-authored-by: Leandro von Werra <[email protected]>

Code review

7da77ed

mathemakitten force-pushed the hn-docs-evalsuite branch from 0527ff4 to 7da77ed Compare November 29, 2022 19:10

Add more instructions on instantiating an evaluationsuite

f52ac02

lvwerra reviewed Dec 8, 2022

View reviewed changes

mathemakitten and others added 3 commits December 8, 2022 14:40

Apply suggestions from code review

52abfd7

Co-authored-by: Leandro von Werra <[email protected]>

Move tables out of codeblock

474fa55

Merge remote-tracking branch 'origin/main' into hn-docs-evalsuite

8ca9b1a

lvwerra approved these changes Dec 9, 2022

View reviewed changes

lvwerra merged commit 2814419 into main Dec 9, 2022

lvwerra deleted the hn-docs-evalsuite branch December 9, 2022 16:57

Docs for EvaluationSuite #340

Docs for EvaluationSuite #340

Uh oh!

Conversation

mathemakitten commented Nov 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Nov 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lhoestq left a comment

Choose a reason for hiding this comment

Uh oh!

lhoestq Nov 16, 2022

Choose a reason for hiding this comment

Uh oh!

mathemakitten Nov 28, 2022

Choose a reason for hiding this comment

Uh oh!

lhoestq Nov 16, 2022

Choose a reason for hiding this comment

Uh oh!

mathemakitten Nov 28, 2022

Choose a reason for hiding this comment

Uh oh!

lvwerra left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lvwerra left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lvwerra Dec 8, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

mathemakitten commented Nov 3, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Nov 3, 2022 •

edited

Loading