We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
[Update] Add ESOV_S
[VLM] Support o1
[Dataset] Add AesBench VAL (#240) * Add files via upload * update aesbench * update init * update dataset config * update md5 * update --------- Co-authored-by: kennymckormick <[email protected]>
[Result] Update Evaluation Results (#60) * update MME, SEEDBench * update results * update LLaVABench * fix * update AI2D accuracy * update LLaVABench * update README * update teaser link