Trustworthy AI · NLP ETH Zürich × University of Zürich

AI systems that know when to be trusted.

I am a PhD student mining the uncertainty hidden inside large language models and exposing it for human supervision.

A language model's reasoning runs as a steady signal until it hits a spike of uncertainty, where a human reviewer steps in to check it.

Email CV (PDF) Scholar

ReProbe: Efficient Test-Time Scaling of Multi-Step Reasoning by Probing Internal States of Large Language Models

Jingwei Ni et al. · 2025

ACL 2026 Oral Read paper →

About · Research

From a model's hidden uncertainty to human oversight.

I am a dedicated AI researcher passionate about building trustworthy AI systems that can make the future better. I am currently pursuing my PhD (2023–2027, expected) under the joint supervision of Prof. Elliott Ash and Prof. Mrinmaya Sachan at ETH Zurich, and Prof. Markus Leippold at University of Zurich. I divide my time equally between both institutions.

I build AI systems that know when to be trusted: I extract the uncertainty a model already carries and expose it for human supervision. A trustworthy model should report low confidence whenever it is likely to be wrong — and that ability to flag its own unreliability is what makes oversight possible. What changes from setting to setting is how tractable the problem is, and that turns on whether the uncertainty is objectively checkable.

Where the signal is checkable

Verifiers that report confidence

Verifying a math reasoning step, or whether retrieved evidence is relevant: labels are cheap, models are strong, and uncertainty can be trained and rigorously tested. Here I build non-agentic verifiers like ReProbe, which reads a model's internal states mid-reasoning to tell when an answer has settled.

Where the labels conflict

Surfacing contested judgments

Subjective classification and conflicting human labels — signal, not noise. I build uncertainty-aware classifiers, evidence-grounded specialists whose every claim traces to its source, and interfaces that surface the edge cases people must adjudicate.

Read the full research statement →

Projects & community service

Open models, climate NLP, and research communities.

Community collaboration

Apertus: Democratizing Open and Compliant LLMs

Community collaboration on democratizing open and compliant LLMs for global language environments, with contributions to trustworthiness post-training.

Technical report →

Community collaboration

When AI Benchmarks Plateau

Community collaboration accepted to ICML 2026 on benchmark saturation and how plateauing scores affect evaluation practice.

ICML 2026 →

Workshop

ClimateNLP at ACL 2025

Organizing the second ClimateNLP workshop at ACL 2025, Vienna.

Proceedings Workshop site

Workshop

ClimateNLP at ACL 2024

Organized the first ClimateNLP workshop at ACL 2024, Bangkok.

Proceedings Workshop site

Publications

Selected research

Mining uncertainty from language models — and the verifiers, annotations, and interfaces that surface it for people.

2025 ACL 2026 Oral

ReProbe: Efficient Test-Time Scaling of Multi-Step Reasoning by Probing Internal States of Large Language Models

Reads the model's own internal states mid-reasoning to tell when an answer has settled, scaling test-time reasoning only as far as the model actually needs.

Jingwei Ni et al.

2025 EACL 2026 Oral

Can Reasoning Help Large Language Models Capture Human Annotator Disagreement?

Tests whether reasoning lets a model recognize when a question is genuinely contested, instead of collapsing real human disagreement into one overconfident answer.

Jingwei Ni*, Yu Fan* et al.

2025 NAACL 2025 Long

DIRAS: Efficient LLM Annotation of Document Relevance for Retrieval Augmented Generation

Trains small, cheap models to judge whether retrieved documents are truly relevant, giving RAG an external check on its evidence before the generator ever sees it.

Jingwei Ni et al.

2025 EMNLP Demo

Co-DETECT: Collaborative Discovery of Edge Cases in Text Classification

Pairs human annotators with an LLM to surface the edge cases a classifier quietly gets wrong, pointing oversight straight at the model's blind spots.

Chenfei Xiong*, Jingwei Ni* et al.

2024 ACL 2024 Long

Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering

Trains open QA specialists to answer strictly from the evidence they are given and resist being misled when it is noisy or missing — keeping every answer traceable to its source.

Tobias Schimanski*, Jingwei Ni* et al.

2024 ACL 2024 Long

AFaCTA: Assisting the Annotation of Factual Claim Detection with Reliable LLM Annotators

Cross-checks several LLM annotators against each other to reliably flag which sentences are checkable factual claims — the first filter any fact-verification pipeline needs.

Jingwei Ni et al.

Supervised Master Thesis & Mentorship

2026 ACL 2026

Tackling the Root of Misinformation by Teaching Laypeople about Logical Fallacies via Socratic Questioning and Critical Argumentation

Master student · Minjing Shi

2026 AI4Law @ ICML 2026

Unlocking LLM Legal Reasoning with IRAC-Constrained Chain-of-Thought

Master student · Adam Rahmoun

2025 Code & Data

From Dataset to Optimization: A Benchmarking Framework for Information Retrieval in the Particle Accelerator Domain

Master student · Qing Dai

Full publication list

2025 ACL 2026 Oral

Academic path

Mar 2023 – Present

PhD @ ETH D-GESS

ETH Zürich & University of Zürich

Sept 2021 – Sept 2022

MSc in Data Science & Machine Learning

University College London

Sep 2017 – Sep 2021

BEng in Computer Science

University of Hong Kong