body { background: #6D8B74;/*TODO*/ }

Hi! I'm Xiaoyan(Elena) Bai

柏 晓琰

I am currently a first-year PhD student in CS at University of Chicago , advised by Chenhao Tan . I am also a member of Chicago Human+AI lab.

Before that, I earned B.E in Computer Science at CSE at Univeristy of Michigan, Ann Arbor. During my undergrad study, I was honored to be a member of Language and Information Technologies (LIT) lab group to work with Rada Mihalcea. I also worked on NLP inference time efficiency in Atul Prakash's group. I am interested in doing research in Natural Language Processing. At the same time, I like photography and Game Design & Development. Check the games I made!

Research Interest: As machine learning continues to advance, so does the potential for the digital divide and inequality to widen. These issues motivate my research in building transparent, efficient, and more accessible tools for social good. I believe this can be done by building interpretable models or building more cost-efficient methods

profile

Publications

Concept Incongruence: An Exploration of Time and Death in Role Playing (preprint, 2025)
Xiaoyan Bai, Ike Peng*, Aditya Singh*, Chenhao Tan

A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity (ICML, 2024) Oral
Andrew Lee, Xiaoyan Bai, Itamar Pres, Martin Wattenberg, Jonathan K. Kummerfeld, Rada Mihalcea

Learn To be Efficient: Build Structured Sparsity in Large Language Models (Neurips, 2024) Spotlight
Haizhong Zheng, Xiaoyan Bai, Xueshen Liu, Z. Morley Mao, Beidi Chen, Fan Lai, Atul Prakash

Blog

Concept Incongruence is Key to AI Safety and Creativity (July, 2025)
⚡️Ever asked an LLM-as-Marilyn Monroe about the 2020 election? Our paper calls this concept incongruence, common in AI and human creativity. Read my blog to learn what we found, why it matters for AI safety and creativity, and what's next.

Services

  • Reviewer for ICLR (SCOPE and DeLTa workshop) 2025
  • Reviewer for ACL ARR 2024

  • Teaching & Work Experience

  • TA for CMSC25300: Mathematical Foundation of Machine Learning (Fall 2024) -- University of Chicago
  • Grader for EECS 376: Foundations of Computer Science (Fall 2023) -- University of Michigan
  • TA for Serious Game and AI (Summer 2023) -- MIT Beaver Works Summer Institute
  • During the winter of 2021, I worked as an intern in Emogent to help develop human-machine interactive product, Irene, who are also considered as a hyper-realistic artificial intelligence.