Aidan Kierans - AI Safety Researcher

About

I am a Ph.D. student at the University of Connecticut (expected 2027), advised by Prof. Shiri Dori-Hacohen in the Reducing Information Ecosystem Threats (RIET) Lab and working closely with MIT's Algorithmic Alignment Group. My research focuses on what it takes for AI systems to reason well about moral issues.

My earlier work on quantifying misalignment (AAAI-25) addressed preference divergence between agents. My recent position papers argue that alignment requires more than matching preferences—AI systems also need to follow appropriate reasoning norms. I'm now building knowledge graph infrastructure for philosophical argumentation to support this broader vision of moral reasoning for AI.

Beyond research, I work on AI safety evaluation and policy. I have contributed to frontier model evaluations through the OpenAI Red Teaming Network and Nemesys Insights, and to governance discussions through the Wilson Center's Pathways to AI Policy program, as a Google Policy Fellow at CDT, and at NIST workshops. I also founded BEACON and led the 2025 Machine Ethics and Reasoning Workshop.

Publications

Position: Aligning AI Requires Automating Reasoning Norms

Kierans, A., Ghosh, A., Dori-Hacohen, S.
Submitted to ICML 2026
[Preprint]

Position: Why LLMs Should Be Reasonably Morally Inconsistent

Stenseke, J., Kierans, A., Pres, I., Hadfield-Menell, D.
Submitted to ICML 2026
[Preprint]

Catastrophic Liability: Managing Systemic Risks in Frontier AI Development

Kierans, A., Ritticher, K., Sonsayar, U., Ghosh, A.
TAIS 2025 and EAAMO 2025
[Preprint]

Quantifying Misalignment Between Agents: Towards a Sociotechnical Understanding of Alignment

Kierans, A., Ghosh, A., Hazan, H., Dori-Hacohen, S.
AAAI 2025, Special Track on AI Alignment
[Paper] [Preprint]

Benchmarked Ethics: A Roadmap to AI Alignment, Moral Knowledge, and Control

Kierans, A.
AIES 2023
[Paper]

Bootstrap percolation via automated conjecturing

Bushaw, N., Conka, B., Gupta, V., Kierans, A., Lafayette, H., Larson, C., et al.
Ars Mathematica Contemporanea, 2023
[Paper]

Selected Experience

Graduate Research Assistant, UConn RIET Lab (2022–present) — AI alignment research
Independent Contractor, OpenAI Red Teaming Network (2024–present) — Evaluated frontier models including computer-using agents
Founder & President, Beneficial and Ethical AI at UConn (BEACON) (2024–2025) — Student group running AI safety fellowship cohorts
Teaching Assistant, ML Alignment & Theory Scholars (MATS) (2024–2025) — Facilitated AI safety strategy sessions
Google Policy Fellow, Center for Democracy & Technology (2023) — Supported Senate testimony on AI and human rights

See CV for complete history.

Selected Recognition

Nemesys Insights Red Teaming Exercise — Biological Risk Category Winner (2025)
Wilson Center Pathways to AI Policy — Selected Participant (2024–2025)
ICLP Safe and Trustworthy AI Workshop — Best Poster Award (2023)
NeurIPS ML Safety Workshop — AI Risk Analysis Award (2022)

Media

The Conversation: "Getting AIs working toward human goals — study shows how to measure misalignment"
UConn Daily Campus: "Artificial Intelligence poses novel social threats, researchers prepare for the worst"
Future of Life Institute: Invited talk for AI Existential Safety Community
UConn Center for Excellence in Teaching and Learning: Led "mAI dAI" seminar series on AI alignment and malicious misuse for university instructional staff