meUCSD.jpg

Yi Yang (杨益)

Hi! Welcome to my homepage :wink:

I am currently a first-year PhD student in the Halıcıoğlu Data Science Institute at University of California San Diego. Before, I earned my Statistics Bachelor’s degree from the School of the Gifted Young, University of Science and Technology of China in June 2025. You can check my CV (PDF) for more.

I am currently working on the identification theory about causal structures in the presence of latent confounders w/o the long-used but inapplicable invertiable functional assumptions, supervised by Prof. Biwei Huang. During my undergrad, I am passionate about applying ML algorithms with statistical intuition to gain insights into interdisciplinary areas, with a primary focus on linguistics. Below is a summary.

Research Interest

I am interested in applying statistical principles from bayesian, information theory, and causality to better understand and improve complex systems, i.e., generative AI, cognition, languages.

  • Utilizing ML to build useful tools and act as test subjects in different areas
  • Building efficient, reliable, infant-like ML algorithms.

Miscellaneous

One of my biggest dreams is to create a full-length animated film, designed in the aesthetic of traditional Chinese calligraphy, painting, and literature. I hope to make it a reality someday🥹!

Playing badminton and running regularly.

I grew up surrounded by animals🥳. The Chinese Box Turtles and Border Collie are my favoriate.

Movies, poems, and novels always bring me something new amid the trivialities of daily life (my DouBan).

Chinese Calligraphy has been a part of my life since kindergarten. More Interestingly, I even got a chance to learn how to make a mixed-hair brush from scratch during my undergrad😄.

If anything interests you, I am happy to introduce them to you in detail!

Selected Publications [full list]

(*) denotes equal contribution

  1. COLINGOral
    Transformer-based Speech Model Learns Well as Infants and Encodes Abstractions through Exemplars in the Poverty of the Stimulus Environment
    Yi Yang*, Yiming Wang*, and Jiahong Yuan
    In Proceedings of the 31st International Conference on Computational Linguistics
    Oral Presentation
  2. EMNLPFindings
    Automated Tone Transcription and Clustering with Tone2Vec
    Yi Yang, Yiming Wang, and Jiahong Yuan
    In Findings of the Association for Computational Linguistics: EMNLP, 2024.
  3. TheWebConf
    GIF: A General Graph Unlearning Strategy via Influence Function
    Jiancan Wu*, Yi Yang*, Yuchun Qian, Yongduo Sui, Xiang Wang, and Xiangnan He
    In the International World Wide Web Conference, 2023.