pillowsofwind

Rongwu Xu pillowsofwind

PhD student at UWNLP

Achievements

Knowledge-Conflicts-Survey Knowledge-Conflicts-Survey Public

[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"

148 7
llms-believe-the-earth-is-flat llms-believe-the-earth-is-flat Public

[ACL 2024] The official GitHub repo for the paper "The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Persuasive Conversation"

Python 79 6
Course-Correction Course-Correction Public

[EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"

Python 19 1
LLM-CBRN-Risks LLM-CBRN-Risks Public

[ACL 2025 Findings] The official GitHub repo for the paper "Nuclear Deployed: Analyzing Catastrophic Risks in Decision-making of Autonomous LLM Agents"

Python 19 2
DebateQA DebateQA Public

The official GitHub repo for the paper "DebateQA: Evaluating Question Answering on Debatable Knowledge"

Python 9
CoT-Attack CoT-Attack Public

[ACL 2024 Findings] The official GitHub repo for the paper "Preemptive Answer" Attacks" on Chain-of-Thought Reasoning"

Python 4