OpenAI Open Source BrowseComp, Reinventing Agent Browser Reviews

2025-04-10 20:46:09

openai browsecomp agent desk3 cryptocurrency desktop Crypto News

At 2 am today, OpenAI open-sourced a test benchmark dedicated to the function of the agent browser - BrowseComp. This test benchmark is very difficult. Even OpenAI's own GPT-4o and GPT-4.5 have an accuracy rate of only 0.6% and 0.9% almost 0, and even using GPT-4o with browser function is only 1.9%. But OpenAI's latest agent model Deep Research has an accuracy rate of 51.5%, which is excellent in autonomous search, information integration, and accuracy calibration. (AIGC Open Community)

Disclaimer:

1. The information provided does not constitute investment advice. Investors should make independent decisions and bear all risks themselves.

2. The copyright of this content belongs to the original author. The views expressed herein are solely those of the author and do not represent the stance or position of this website.

New Tab Page - Desk3 | Plugin

Stay ahead of the game in the cryptocurrency space.

美联储柯林斯：关税可能使核心通胀在今年“远超”3%

Collins: Tariffs could push core inflation "well above" 3% this year

OpenAI Open Source BrowseComp, Reinventing Agent Browser Reviews

7x24 Crypto Flash News

Hot News

Related Recommendations