Skip to content

0.5.1.post1

Latest

Choose a tag to compare

@Myhs-phz Myhs-phz released this 17 Oct 10:17
· 6 commits to main since this release
ecc86a2

OpenCompass v0.5.1 Release Notes

🌟 Highlights

✨ A New Method to quickly Integrate and Evaluate Your Datasets: Added a fast dataset integration and evaluation method based on ChatML format, simplifying the previously complex dataset integration process.
✨ New Datasets: Integrated new benchmarks including SeedBench and BeyondAIME.
✨ Infrastructure & Enhancements: Fixed several bugs and updated CI.


πŸš€ New Features

πŸ”§ Introduced a new approach for dataset integration and evaluation based on ChatML Template with evaluation examples (#2277).
πŸ”§ Added SeedBench dataset (#2020).
πŸ”§ Added BeyondAIME dataset (#2192).


πŸ› Bug Fixes

πŸ”§ Fixed Module Registers(#2262, #2266)
πŸ”§ Fixed duplicate engine config update in TurboMindModelwithChatTemplate (#2276)
πŸ”§ Fixed torchrun to avoid unexpected PATH environment (#2269)
πŸ”§ Fixed the None return value case in cascade evaluator (#2211)


βš™ Enhancements and Refactors

βš™ Infrastructure Refactors:

  • Update rjob.py and subjective_eval.py (#2263)

βš™ CI/CD Improvements:

  • Updated testcase (#2257)
  • Updated pr_test (#2281)
  • Fixed pr_test installation (#2290)

πŸŽ‰ Welcome New Contributors

A warm welcome and special thanks to our newest contributors who made this release possible:


Full Changelog: 0.5.0...0.5.1.post1

Thank you for using OpenCompass! These updates empower deeper insights and more reliable evaluations. Keep exploring, and stay tuned for future innovations! 🌟