Home > Quick > Body

美团LongCat发布VitaBench 2.0

clock
2026-06-25 12:55:36
美团LongCat团队推出VitaBench 2.0,这是首个真实生活场景下面向长期动态用户建模的智能体评测基准,用于系统性评测大语言模型在长期、真实、动态用户互动中的个性化与主动性能力。据36氪报道,该团队去年10月已发布VitaBench 1.0。
Disclaimer:
1. The information provided does not constitute investment advice. Investors should make independent decisions and bear all risks themselves.
2. The copyright of this content belongs to the original author. The views expressed herein are solely those of the author and do not represent the stance or position of this website.
New Tab Page - Desk3 | Plugin
Stay ahead of the game in the cryptocurrency space.