Highlights
- Pro
Pinned Loading
-
behavior-edit
behavior-edit PublicModel Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm [AAAI'26 Oral]
Python 4
-
HalluEditBench
HalluEditBench PublicCan Knowledge Editing Really Correct Hallucinations? (ICLR 2025)
-
llm-editing/editing-attack
llm-editing/editing-attack PublicCode and dataset for the paper: "Can Editing LLMs Inject Harm?"
-
survey-authorship
survey-authorship PublicPaper list for the paper "Authorship Attribution in the Era of Large Language Models: Problems, Methodologies, and Challenges (SIGKDD Exploration)"
TeX 18
-
authorship-llm
authorship-llm PublicCan Large Language Models Identify Authorship? (EMNLP 2024 Findings)
Jupyter Notebook 11
If the problem persists, check the GitHub status page or contact support.