10000
Skip to content
View Aboriginer's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Aboriginer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,211 403 Updated Dec 15, 2025

[arXiv:2508.00410] "Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models"

Python 45 1 Updated Oct 6, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,764 2,888 Updated Dec 24, 2025

[ICML 2025] "From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?"

Python 49 2 Updated Oct 8, 2025

[ICML 2025] "From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium"

Python 31 4 Updated Nov 23, 2025

Python tool for converting files and office documents to Markdown.

Python 84,555 4,869 Updated Dec 1, 2025

AllenAI's post-training codebase

Python 3,472 478 Updated Dec 23, 2025

An easy-to-use Python framework to generate adversarial jailbreak prompts.

Python 790 72 Updated Mar 27, 2025

A framework for the evaluation of autoregressive code generation language models.

Python 1,009 253 Updated Jul 22, 2025

[ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"

Jupyter Notebook 44 6 Updated Aug 16, 2025

paper list, dataset, and tools for radiology report generation

329 33 Updated Dec 20, 2025

[ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities

Python 27 3 Updated Apr 2, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,948 288 Updated May 15, 2025

The Google Scholar PDF Reader browser extension, now with annotations!

JavaScript 203 18 Updated Nov 25, 2025
Python 161 11 Updated Jan 21, 2025

s1: Simple test-time scaling

Python 6,621 764 Updated Jun 25, 2025

[NeurIPS 2023] Combating Bilateral Edge Noise for Robust Link Prediction

Python 13 2 Updated Nov 3, 2023

[arXiv:2411.10023] "Model Inversion Attacks: A Survey of Approaches and Countermeasures"

212 15 Updated May 30, 2025

A curated list of resources for activation engineering

119 5 Updated Oct 2, 2025

Fully open data curation for reasoning models

Python 2,174 182 Updated Dec 2, 2025

A reading list on LLM based Synthetic Data Generation 🔥

1,493 90 Updated Jun 5, 2025

Recipes to train reward model for RLHF.

Python 1,490 107 Updated Apr 24, 2025

GenRM-CoT: Data release for verification rationales

67 6 Updated Oct 16, 2024

A bibliography and survey of the papers surrounding o1

TeX 1,216 51 Updated Nov 16, 2024

[NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"

Python 37 3 Updated Jul 18, 2025

Train transformer language models with reinforcement learning.

Python 16,763 2,374 Updated Dec 24, 2025

[TMI 2024] "SPIRiT-Diffusion: Self-Consistency Driven Diffusion Model for Accelerated MRI"

< 393D span itemprop="programmingLanguage">Python 12 1 Updated Aug 1, 2025

[NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"

Jupyter Notebook 10 Updated Nov 15, 2024
Next
0