Model | AGIEval | GPT4All | TruthfulQA | Bigbench |
---|---|---|---|---|
recoilme-gemma-2-9B-v0.4 | 45.94 | Error: File does not exist | 59.03 | Error: File does not exist |
Task | Version | Metric | Value | Stderr | |
---|---|---|---|---|---|
agieval_aqua_rat | 0 | acc | 26.77 | ± | 2.78 |
acc_norm | 25.98 | ± | 2.76 | ||
agieval_logiqa_en | 0 | acc | 41.32 | ± | 1.93 |
acc_norm | 40.25 | ± | 1.92 | ||
agieval_lsat_ar | 0 | acc | 23.04 | ± | 2.78 |
acc_norm | 21.74 | ± | 2.73 | ||
agieval_lsat_lr | 0 | acc | 51.76 | ± | 2.21 |
acc_norm | 49.22 | ± | 2.22 | ||
agieval_lsat_rc | 0 | acc | 68.77 | ± | 2.83 |
acc_norm | 65.43 | ± | 2.91 | ||
agieval_sat_en | 0 | acc | 83.98 | ± | 2.56 |
acc_norm | 82.04 | ± | 2.68 | ||
agieval_sat_en_without_passage | 0 | acc | 40.29 | ± | 3.43 |
acc_norm | 38.35 | ± | 3.40 | ||
agieval_sat_math | 0 | acc | 49.09 | ± | 3.38 |
acc_norm | 44.55 | ± | 3.36 |
Average: 45.94%
Average: Error: File does not exist%
Task | Version | Metric | Value | Stderr | |
---|---|---|---|---|---|
truthfulqa_mc | 1 | mc1 | 41.37 | ± | 1.72 |
mc2 | 59.03 | ± | 1.56 |
Average: 59.03%
Average: Error: File does not exist%
Average score: Not available due to errors
Elapsed time: 03:29:12