Discrepancy in benchmark score (BFCL-v3)

#18
by mmrbulbul - opened

The model card says the model obtains a score of 61.9 on BFCL-v3 benchmark

But the benchmark website says otherwise. 33.04

I wonder why there is such a large gap in these two reports.

image

Sign up or log in to comment