ElliottYan

Jianhao Yan ElliottYan

Now PhD Student at WestlakeNLP/Zhejiang University. Former Researcher @ Wechat AI.

54 followers · 18 following

Zhejiang University

Achievements

x2 x2

Achievements

x2 x2

Stars

AgentR1 / Claw-R1

Claw-R1: Empowering OpenClaw with Advanced Agentic RL.

Python 163 8 Updated Apr 7, 2026

modelscope / AgentEvolver

AgentEvolver: Towards Efficient Self-Evolving Agent System

Python 1,345 153 Updated Apr 1, 2026

danieldritter / OAPL

Python 24 3 Updated Feb 24, 2026

FloyedShen / VESPO

Python 30 4 Updated Feb 12, 2026

kanishkg / endless-terminals

Python 95 13 Updated Mar 31, 2026

PRIME-RL / P1-VL

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

15 2 Updated Feb 11, 2026

ssmisya / AdaReasoner

[ICLR 2026] The official repository for the paper "AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning".

Jupyter Notebook 79 6 Updated Feb 27, 2026

Danau5tin / terminal-bench-rl

GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's TerminalBench leaderboard.

Python 366 23 Updated Aug 24, 2025

camel-ai / seta

💻 SETA: Scaling Environments for Terminal Agents

Python 88 12 Updated Feb 16, 2026

OpenBMB / AgentCPM

An End-to-End Infrastructure for Training and Evaluating Various LLM Agents

Python 782 67 Updated Feb 9, 2026

THUDM / AgentRL

Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Python 265 19 Updated Jan 17, 2026

inclusionAI / AReaL

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 5,000 453 Updated Apr 7, 2026

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 1,507 330 Updated Apr 7, 2026

Continual-Intelligence / SEAL

Self-Adapting Language Models

Python 1,734 305 Updated Aug 1, 2025

Joshua-Ren / Learning_dynamics_LLM

Jupyter Notebook 214 13 Updated Dec 23, 2025

tatsu-lab / alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,964 306 Updated Aug 9, 2025

SalesforceAIResearch / UserBench

Python 58 4 Updated Aug 5, 2025

axon-rl / gem

A Gym for Agentic LLMs

Python 476 31 Updated Jan 21, 2026

princeton-pli / RLMT

[R]einforcement [L]earning from [M]odel-rewarded [T]hinking - code for the paper "Language Models That Think, Chat Better"

Python 127 6 Updated Oct 27, 2025

TIGER-AI-Lab / General-Reasoner

General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]

Python 224 14 Updated Nov 27, 2025

OpenBMB / RLPR

Extrapolating RLVR to General Domains without Verifiers

Python 200 11 Updated Aug 12, 2025

TrustJudge / TrustJudge

🎉 TrustJudge is accepted to ICLR 2026!

Python 46 2 Updated Sep 27, 2025

EvoAgentX / EvoAgentX

🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents

Python 2,710 227 Updated Apr 7, 2026

sierra-research / tau-bench

Code and Data for Tau-Bench

Python 1,168 189 Updated Mar 18, 2026

tokenbender / avataRL

rl from zero pretrain, can it be done? yes.

Python 291 21 Updated Sep 28, 2025

Chenluye99 / PROF

Process Consistency Filter: Improve Reasoning Quality for LLM Reinforcement Learning

11 Updated Sep 4, 2025

lyh6560new / implicit-user-feedback

This is the official github repo for paper "mplicit User Feedback in Human-LLM Dialogues: Informative to Understand Users yet Noisy as a Learning Signal"

2 1 Updated Sep 4, 2025

Osilly / Awesome-Interleaving-Reasoning

Interleaving Reasoning: Next-Generation Reasoning Systems for AGI

267 11 Updated Oct 17, 2025

XiaoYee / Awesome_Efficient_LRM_Reasoning

😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond

353 12 Updated Jan 22, 2026

ElliottYan / RobustKeyEdit

Official repository for ACL 2025 Main Conference Paper "Keys to Robust Edits: From Theoretical Insights to Practical Advances"

Python 3 Updated May 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jianhao Yan ElliottYan

Achievements

Achievements

Block or report ElliottYan

Stars

AgentR1 / Claw-R1

modelscope / AgentEvolver

danieldritter / OAPL

FloyedShen / VESPO

kanishkg / endless-terminals

PRIME-RL / P1-VL

ssmisya / AdaReasoner

Danau5tin / terminal-bench-rl

camel-ai / seta

OpenBMB / AgentCPM

THUDM / AgentRL

inclusionAI / AReaL

NVIDIA-NeMo / RL

Continual-Intelligence / SEAL

Joshua-Ren / Learning_dynamics_LLM

tatsu-lab / alpaca_eval

SalesforceAIResearch / UserBench

axon-rl / gem

princeton-pli / RLMT

TIGER-AI-Lab / General-Reasoner

OpenBMB / RLPR

TrustJudge / TrustJudge

EvoAgentX / EvoAgentX

sierra-research / tau-bench

tokenbender / avataRL

Chenluye99 / PROF

lyh6560new / implicit-user-feedback

Osilly / Awesome-Interleaving-Reasoning

XiaoYee / Awesome_Efficient_LRM_Reasoning

ElliottYan / RobustKeyEdit