+17.72% on MuSR. +8.16% on MATH. Five out of six benchmarks improved, with only IFEval taking a small hit. The average put it at #1 on the leaderboard.
Филолог заявил о массовой отмене обращения на «вы» с большой буквы09:36,推荐阅读whatsapp获取更多信息
Merge commits from upstream repository,更多细节参见谷歌
Issue body actions