Aboriginer

🎯

Focusing

Chentao Cao Aboriginer

🎯

Focusing

PhD student at HKBU @tmlr-group. Focus on reasoning in foundation models and trustworthy machine learning.

32 followers · 153 following

Hong Kong
23:39 (UTC +08:00)
https://scholar.google.com/citations?user=vZPl_oQAAAAJ&hl=en

Achievements

Lists (11)

Sort

readlist

reasoning

1 repository

tips

2 repositories

tools

6 repositories

Trustworthy Foundation Models

1 repository

VLM

5 repositories

Stars

huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,211 403 Updated Dec 15, 2025

tmlr-group / Co-rewarding

Forked from resistzzz/Co-rewarding

[arXiv:2508.00410] "Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models"

Python 45 1 Updated Oct 6, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,764 2,888 Updated Dec 24, 2025

tmlr-group / AR-Bench

[ICML 2025] "From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?"

Python 49 2 Updated Oct 8, 2025

tmlr-group / ECON

[ICML 2025] "From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium"

Python 31 4 Updated Nov 23, 2025

microsoft / markitdown

Python tool for converting files and office documents to Markdown.

Python 84,555 4,869 Updated Dec 1, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,472 478 Updated Dec 23, 2025

EasyJailbreak / EasyJailbreak

An easy-to-use Python framework to generate adversarial jailbreak prompts.

Python 790 72 Updated Mar 27, 2025

bigcode-project / bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.

Python 1,009 253 Updated Jul 22, 2025

tmlr-group / landscape-of-thoughts

[ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"

Jupyter Notebook 44 6 Updated Aug 16, 2025

mk-runner / Awesome-Radiology-Report-Generation

paper list, dataset, and tools for radiology report generation

329 33 Updated Dec 20, 2025

uw-nsl / safechain

[ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities

Python 27 3 Updated Apr 2, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,948 288 Updated May 15, 2025

salcc / Scholar-PDF-Reader-with-Annotations

The Google Scholar PDF Reader browser extension, now with annotations!

JavaScript 203 18 Updated Nov 25, 2025

bytarnish / AGILE

Python 161 11 Updated Jan 21, 2025

xiaojunxu / multi-bit-text-watermark

Python 15 Updated Jul 15, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 6,621 764 Updated Jun 25, 2025

AndrewZhou924 / RGIB

[NeurIPS 2023] Combating Bilateral Edge Noise for Robust Link Prediction

Python 13 2 Updated Nov 3, 2023

AndrewZhou924 / Awesome-model-inversion-attack

[arXiv:2411.10023] "Model Inversion Attacks: A Survey of Approaches and Countermeasures"

212 15 Updated May 30, 2025

ZFancy / awesome-activation-engineering

A curated list of resources for activation engineering

119 5 Updated Oct 2, 2025

open-thoughts / open-thoughts

Fully open data curation for reasoning models

Python 2,174 182 Updated Dec 2, 2025

deepseek-ai / DeepSeek-R1

91,599 11,773 Updated Jun 27, 2025

wasiahmad / Awesome-LLM-Synthetic-Data

A reading list on LLM based Synthetic Data Generation 🔥

1,493 90 Updated Jun 5, 2025

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 1,490 107 Updated Apr 24, 2025

genrm-star / genrm-critiques

GenRM-CoT: Data release for verification rationales

67 6 Updated Oct 16, 2024

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,216 51 Updated Nov 16, 2024

tmlr-group / NoisyRationales

[NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"

Python 37 3 Updated Jul 18, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,763 2,374 Updated Dec 24, 2025

zhyjSIAT / SPIRiT-Diffusion

[TMI 2024] "SPIRiT-Diffusion: Self-Consistency Driven Diffusion Model for Accelerated MRI"

< 393D span itemprop="programmingLanguage">Python 12 1 Updated Aug 1, 2025

tmlr-group / CoPA

Forked from HongduanTian/CoPA

[NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"

Jupyter Notebook 10 Updated Nov 15, 2024

Chentao Cao Aboriginer

Lists (11)

Awesome repo

data-centric

foundation model downstream task

LLM