Seungone Kim's picture

Seungone Kim PRO

seungone

·

https://seungonekim.github.io/

AI & ML interests

Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment

Recent Activity

updated a dataset about 12 hours ago

prometheus-eval/peerreview-bench

published a dataset 6 days ago

prometheus-eval/peerreview-bench

upvoted a paper about 1 month ago

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

View all activity

Organizations

Papers 38

arxiv:2511.22173

arxiv:2510.24684

arxiv:2509.21451

arxiv:2508.13141

spaces 2

My Argilla

Test3

models 1

seungone/skywork-reward-replicate

Text Classification • 8B • Updated Dec 11, 2024 • 6

datasets 5

seungone/ablation1_math_gpt4o_mini

Viewer • Updated Nov 25, 2024 • 5.56k • 22

seungone/ablation3_math_llama3.1_8b_instruct

Viewer • Updated Nov 25, 2024 • 24.8k • 25

seungone/ablation2_math_llama3.1_8b_instruct

Viewer • Updated Nov 25, 2024 • 5.99k • 22

seungone/ablation1_code_gpt4o_mini

Viewer • Updated Nov 25, 2024 • 10k • 11

seungone/final-math-claude3.5_sonnet-10000

Viewer • Updated Sep 16, 2024 • 10k • 28 • 1