Computer Science > Computation and Language

arXiv:2512.20491 (cs)

[Submitted on 23 Dec 2025 (v1), last revised 29 Dec 2025 (this version, v4)]

Title:Step-DeepResearch Technical Report

Abstract:As LLMs shift toward autonomous agents, Deep Research has emerged as a pivotal metric. However, existing academic benchmarks like BrowseComp often fail to meet real-world demands for open-ended research, which requires robust skills in intent recognition, long-horizon decision-making, and cross-source verification. To address this, we introduce Step-DeepResearch, a cost-effective, end-to-end agent. We propose a Data Synthesis Strategy Based on Atomic Capabilities to reinforce planning and report writing, combined with a progressive training path from agentic mid-training to SFT and RL. Enhanced by a Checklist-style Judger, this approach significantly improves robustness. Furthermore, to bridge the evaluation gap in the Chinese domain, we establish ADR-Bench for realistic deep research scenarios. Experimental results show that Step-DeepResearch (32B) scores 61.4% on Scale AI Research Rubrics. On ADR-Bench, it significantly outperforms comparable models and rivals SOTA closed-source models like OpenAI and Gemini DeepResearch. These findings prove that refined training enables medium-sized models to achieve expert-level capabilities at industry-leading cost-efficiency.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2512.20491 [cs.CL]
	(or arXiv:2512.20491v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2512.20491

Submission history

From: Ruihang Miao [view email]
[v1] Tue, 23 Dec 2025 16:32:27 UTC (407 KB)
[v2] Wed, 24 Dec 2025 15:52:31 UTC (314 KB)
[v3] Thu, 25 Dec 2025 08:38:54 UTC (307 KB)
[v4] Mon, 29 Dec 2025 08:44:40 UTC (307 KB)

Computer Science > Computation and Language

Title:Step-DeepResearch Technical Report

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Step-DeepResearch Technical Report

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators