Home > Quick > Body

AI TRENDS | OpenAI Launches LifeSciBench to Evaluate AI Performance in Real Research Tasks

clock
2026-06-19 15:33:47
OpenAI has released a new evaluation benchmark, LifeSciBench, designed to measure AI systems’ capabilities in real scientific research settings. According to Odaily, LifeSciBench is built on 750 expert-written tasks spanning seven research workflows and seven biology domains.

The tasks were contributed by 173 researchers with PhDs and experience in biotechnology or the pharmaceutical industry. The benchmark focuses on assessing complex research abilities, including evidence integration, experimental design, data analysis, scientific reasoning, and research communication, rather than single fact-based questions.

More than 79% of the tasks involve multi-step reasoning, with an average of about four reasoning steps per task. The benchmark also includes 1,062 real research-related data attachments, such as papers, charts, sequence data, and structure files.
Disclaimer:
1. The information provided does not constitute investment advice. Investors should make independent decisions and bear all risks themselves.
2. The copyright of this content belongs to the original author. The views expressed herein are solely those of the author and do not represent the stance or position of this website.
New Tab Page - Desk3 | Plugin
Stay ahead of the game in the cryptocurrency space.