Yik Siu Chan

profile_pic.jpg

Hi! My name is Yik Siu, and I am a Computer Science master’s student at Brown University. My research interests lie in evaluating and interpreting deep learning models, with a focus on their potentially dangerous capabilities and alignment with human values.

I have been fortunate to work with Dr. Atticus Geiger, Prof. Stephen Bach at Brown, Prof. Marzyeh Ghassemi at MIT, and the Personal Robots Group at MIT Media Lab, where I began my research journey.

Previously, I graduated from Wellesley College in 2024 with a B.A. in Computer Science and Economics. I am grateful to the Brown CS Department and the UWC Davis Scholarship for supporting my studies.

news

Sep 20, 2025 Excited to serve as a reviewer for ICLR and for the WiML workshop at NeurIPS.
Jul 15, 2025 Attending ICML in Vancouver to present the Speak Easy and MIB papers.
Dec 11, 2024 Attending NeurIPS in Vancouver (my first conference!) to present our oral paper MDAgents.

papers

  1. NeurIPS FoRLM
    Can We Predict Alignment Before Models Finish Thinking? Towards Monitoring Misaligned Reasoning Models
    Yik Siu Chan*, Zheng-Xin Yong*, and Stephen H. Bach
    NeurIPS Foundations of Reasoning in Language Models, 2025
  2. ICML 2025
    Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions
    Yik Siu Chan*, Narutatsu Ri*, Yuxin Xiao*, and Marzyeh Ghassemi
    The Forty-Second International Conference on Machine Learning, 2025
  3. ICML 2025
    MIB: A Mechanistic Interpretability Benchmark
    Aaron Mueller*, Atticus Geiger*, Sarah Wiegreffe, Dana Arad, Iván Arcuschin, Adam Belfki, Yik Siu Chan, Jaden Fiotto-Kaufman, Tal Haklay, Michael Hanna, Jing Huang, Rohan Gupta, Yaniv Nikankin, Hadas Orgad, Nikhil Prakash, Anja Reusch, Aruna Sankaranarayanan, Shun Shao, Alessandro Stolfo, Martin Tutek, Amir Zur, David Bau, and Yonatan Belinkov
    The Forty-Second International Conference on Machine Learning, 2025
  4. NeurIPS 2024
    MDAgents: An Adaptive Collaboration of LLMs for Medical Decision Making
    Yubin Kim, Chanwoo Park, Hyewon Jeong, Yik Siu Chan, Xuhai Xu, Daniel McDuff, Hyeonhoon Lee, Marzyeh Ghassemi, Cynthia Breazeal, and Hae Won Park
    The Thirty-eighth Annual Conference on Neural Information Processing Systems (Oral), 2024