Loading...
アイコン

Discover AI

チャンネル登録者数 7.79万人

2838 回視聴 ・ 122いいね ・ 2025/08/31

Comparison of LLMs with tools (agents) to the DEEP Research mode by OPENai and GooGLE.

The authors assessed several state-of-the-art (SOTA) LLMs, originally lacking native Internet access, by augmenting them with an external search engine and link reader to enable the web-retrieval capabilities essential for completing our evaluation tasks. "In the evaluation of base models, we integrated search and link-reading tools using each model's native function call interface."

Additional info by the authors: "Specifically, we used SerpAPI for Google Search access and Firecrawl for retrieving web pages in Markdown format."

All rights w/ authors:
ReportBench: Evaluating Deep Research Agents via Academic Survey Tasks
Minghao Li, Ying Zeng, Zhihao Cheng, Cong Ma, Kai Jia
from
ByteDance BandAI

#aireasoning
#aiagents
#scienceexplained #trust

コメント

コメントを取得中...

コントロール
設定

使用したサーバー: directk