Yichi Zhang
Never stop thinking :-)

Hello! My name is Yichi Zhang, and I am doing multimodal research at Bytedance Seed. I got my Ph.D. in Computer Science and Engineering at the University of Michigan, advised by Professor Joyce Chai as a member of the SLED lab. I am broadly interested in the intersection between conversational and perceptual AI research, with a particular focus on real-time interactive systems, language grounding to visual and physical contexts, multi-modal dialog, and embodied AI. I have won the 1st Amazon Alexa Prize SimBot Challenge in 2023 as the team leader of SEAGULL. Before joining UMich, I obtained my Master’s in Information and Communication Engineering at Tsinghua University in 2020, advised by Professor Zhijian Ou. In 2019, I worked with Professor Zhou Yu as a visiting scholar on task-oriented dialog systems. I got my Bachelor’s in Electronic Information Science and Technology from Tsinghua University in 2017.
News
May 23, 2025 | Excited to share that I’ve joined the Multimodal Interaction & World Model team at ByteDance! Looking forward to tackling new challenges ahead. I’m open to collaborations and mentoring interns — feel free to reach out! |
---|---|
Jun 17, 2024 | Agent-Eval-Refine won the best paper award at CVPR’24 MAR Workshop, and is accepted to COLM 2024! |
Feb 26, 2024 | GROUNDHOG is accepted to CVPR 2024! see you in Seattle! |
Feb 15, 2024 | I will join Meta Reality Labs to work with Dr. Shane Moon in Summer 2024 as a Research Scientist Intern. Hope to meet new friends in Seattle! |
Oct 07, 2023 | Two papers accepted to EMNLP 2023! |