Еврокомиссия оказалась в курсе настоящей причины остановки «Дружбы»

2026年2月26日 · 张伟 · 来源：dev快讯

Burke confirms Trump has called Albanese

Sarvam 30B performs strongly on multi-step reasoning benchmarks, reflecting its ability to handle complex logical and mathematical problems. On AIME 25, it achieves 88.3 Pass@1, improving to 96.7 with tool use, indicating effective integration between reasoning and external tools. It scores 66.5 on GPQA Diamond and performs well on challenging mathematical benchmarks including HMMT Feb 2025 (73.3) and HMMT Nov 2025 (74.2). On Beyond AIME (58.3), the model remains competitive with larger models. Taken together, these results indicate that Sarvam 30B sustains deep reasoning chains and expert-level problem solving, significantly exceeding typical expectations for models with similar active compute.

06版，推荐阅读新收录的资料获取更多信息

美國總統特朗普表示，與伊朗的戰爭將「很快結束」，但同時稱美國「贏得還不夠」，目標是取得「終極勝利」；，详情可参考新收录的资料

Police say arrests have been made after Scottish Cup tie

Samsung’s Mario

网友评论