OpenAI Open Source BrowseComp, Reinventing Agent Browser Reviews
2025-04-10 20:46:09
At 2 am today, OpenAI open-sourced a test benchmark dedicated to the function of the agent browser - BrowseComp. This test benchmark is very difficult. Even OpenAI's own GPT-4o and GPT-4.5 have an accuracy rate of only 0.6% and 0.9% almost 0, and even using GPT-4o with browser function is only 1.9%. But OpenAI's latest agent model Deep Research has an accuracy rate of 51.5%, which is excellent in autonomous search, information integration, and accuracy calibration. (AIGC Open Community)
Disclaimer:
1. The information provided does not constitute investment advice. Investors should make independent decisions and bear all risks themselves.
2. The copyright of this content belongs to the original author. The views expressed herein are solely those of the author and do not represent the stance or position of this website.
Previous article:
美联储柯林斯:关税可能使核心通胀在今年“远超”3%