Yik Siu Chan

Hi! My name is Yik Siu, and I am a Computer Science master’s student at Brown University. My research interests lie in building reliable and interpretable AI, which I believe requires both rigorous evaluation of models’ external behavior and deeper understanding of their internal states.

I have been fortunate to work with Atticus Geiger, Prof. Stephen Bach and Prof. Ellie Pavlick at Brown, Prof. Marzyeh Ghassemi at MIT, and the Personal Robots Group at MIT Media Lab. I did my B.A. in Computer Science and Economics at Wellesley College, and I am very grateful to the UWC Davis Scholarship and the Brown CS Department for supporting my studies.

news

Sep 20, 2025	Excited to serve as a reviewer for ICLR and for the WiML workshop at NeurIPS.
Jul 15, 2025	Attending ICML in Vancouver to present the Speak Easy and MIB papers.
Dec 11, 2024	Attending NeurIPS in Vancouver (my first conference!) to present our oral paper MDAgents.

papers

NeurIPS FoRLM

Can We Predict Alignment Before Models Finish Thinking? Towards Monitoring Misaligned Reasoning Models

Yik Siu Chan^*, Zheng-Xin Yong^*, and Stephen H. Bach

NeurIPS Foundations of Reasoning in Language Models, 2025

arXiv Code
ICML 2025

Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions

Yik Siu Chan^*, Narutatsu Ri^*, Yuxin Xiao^*, and Marzyeh Ghassemi

The Forty-Second International Conference on Machine Learning, 2025

arXiv Code
ICML 2025

MIB: A Mechanistic Interpretability Benchmark

Aaron Mueller^*, Atticus Geiger^*, Sarah Wiegreffe, Dana Arad, Iván Arcuschin, Adam Belfki, Yik Siu Chan, Jaden Fiotto-Kaufman, Tal Haklay, Michael Hanna, Jing Huang, Rohan Gupta, Yaniv Nikankin, Hadas Orgad, Nikhil Prakash, Anja Reusch, Aruna Sankaranarayanan, Shun Shao, Alessandro Stolfo, Martin Tutek, Amir Zur, David Bau, and Yonatan Belinkov

The Forty-Second International Conference on Machine Learning, 2025

arXiv Website
NeurIPS 2024

MDAgents: An Adaptive Collaboration of LLMs for Medical Decision Making

Yubin Kim, Chanwoo Park, Hyewon Jeong, Yik Siu Chan, Xuhai Xu, Daniel McDuff, Hyeonhoon Lee, Marzyeh Ghassemi, Cynthia Breazeal, and Hae Won Park

The Thirty-eighth Annual Conference on Neural Information Processing Systems (Oral), 2024

arXiv Website