Trustworthy AI · NLP ETH Zürich × University of Zürich

AI systems that know when to be trusted.

I am a PhD student mining the uncertainty hidden inside large language models and exposing it for human supervision.

A language model's reasoning runs as a steady signal until it hits a spike of uncertainty, where a human reviewer steps in to check it.

About · Research

From a model's hidden uncertainty to human oversight.

I am a dedicated AI researcher passionate about building trustworthy AI systems that can make the future better. I am currently pursuing my PhD (2023–2027, expected) under the joint supervision of Prof. Elliott Ash and Prof. Mrinmaya Sachan at ETH Zurich, and Prof. Markus Leippold at University of Zurich. I divide my time equally between both institutions.

I build AI systems that know when to be trusted: I extract the uncertainty a model already carries and expose it for human supervision. One requirement sits behind everything — a model should report low confidence whenever it is likely to be wrong — and that single reporting ability is what makes oversight possible. What changes is how tractable the problem is, and that turns on whether the uncertainty is objectively checkable.

These methods matter most where the stakes are high and people stay accountable for the answer — which is why I ground them in climate and law. From corporate climate disclosures to legal reasoning, an unverified claim can sway policy or judgment, so domain experts need to see exactly how far to trust a model. I like to aim high while keeping my feet firmly on the ground.

Where the signal is checkable

Verifiers that report confidence

Verifying a math reasoning step, or whether retrieved evidence is relevant: labels are cheap, models are strong, and uncertainty can be trained and rigorously tested. Here I build non-agentic verifiers like ReProbe, which reads a model's internal states mid-reasoning to tell when an answer has settled.

Where the labels conflict

Surfacing contested judgments

Subjective classification and conflicting human labels — signal, not noise. I build uncertainty-aware classifiers, evidence-grounded specialists whose every claim traces to its source, and interfaces that surface the edge cases people must adjudicate.

Projects & community service

Open models, climate NLP, and research communities.

Community collaboration

Apertus: Democratizing Open and Compliant LLMs

Community collaboration on democratizing open and compliant LLMs for global language environments, with contributions to trustworthiness post-training.

Technical report →

Community collaboration

When AI Benchmarks Plateau

Community collaboration accepted to ICML 2026 on benchmark saturation and how plateauing scores affect evaluation practice.

ICML 2026 →

Workshop

ClimateNLP at ACL 2025

Organizing the second ClimateNLP workshop at ACL 2025, Vienna.

Workshop

ClimateNLP at ACL 2024

Organized the first ClimateNLP workshop at ACL 2024, Bangkok.

Publications

Selected research

Mining uncertainty from language models — and the verifiers, annotations, and interfaces that surface it for people.

Supervised Master Thesis & Mentorship

02
2026 AI4Law @ ICML 2026

Unlocking LLM Legal Reasoning with IRAC-Constrained Chain-of-Thought

Master student · Adam Rahmoun

Full publication list

Accepted  In review / preprint

Education

Academic path

Mar 2023 – Present

PhD @ ETH D-GESS

ETH Zürich & University of Zürich

Sept 2021 – Sept 2022

MSc in Data Science & Machine Learning

University College London

Sep 2017 – Sep 2021

BEng in Computer Science

University of Hong Kong