Portrait
Yaxuan Wang
Ph.D. Candidate in Computer Science
University of California, Santa Cruz
About Me

I am a third-year Ph.D. Candidate in Computer Science and Engineering at the University of California, Santa Cruz, advised by Yang Liu. My research currently focuses on LLM alignment and safety, LLM Reasoning, and responsible LLM agents. I also work on LLM unlearning. I study how to adapt and control LLMs to ensure safe, robust, and reliable behavior. My long-term goal is to develop safe, robust, generalizable, and trustworthy LLM-based systems. I have been working as a Research Scientist Intern focusing on conducting high impact research in the Generative AI space at Accenture Data & AI Research Center since June 2024.

More about my research interests:
• LLM Alignment, Unlearning, Reasoning, and Post-training
• LLM Agents: Safe, scalable, and trustworthy LLM-based agentic systems
• Trustworthy Machine Learning: Fairness, Interpretability, Robustness
• Representation Learning & Learning from Dynamic and Noisy Data

I am interested in building intelligent agents or models that can learn from various data sources (e.g., images, audio, and text) and automate decision-making processes in uncertain environments to solve multiple real-world tasks realiably and efficiently. I love the process of giving machines the intelligence to address real-world problems.

Education
  • University of California, Santa Cruz
    University of California, Santa Cruz
    Computer Science and Engineering
    Ph.D. Candidate
    Advisor: Prof. Yang Liu
    GPA: 3.92/4.0
    Sep. 2023 - present
  • Sichuan University
    Sichuan University
    B.S. in Computer Science
    Top-Talent Training Program*
    Advisor: Prof. Qijun Zhao
    GPA: 3.85/4.0
    *A prestigious national program designed to cultivate outstanding students in fundamental disciplines through enhanced, research-oriented coursework.
    Sep. 2019 - Jun. 2023
Honors & Awards
  • Anti-Racist Research Fellowship, UCSC
    2024
  • Dean’s Fellowship, UCSC
    2023
  • Computer Science Polaris Star Award, SCU
    2023
  • Scholarship in Honor of Chinese Contemporary Scientists, SCU
    2023
  • Top 100 Outstanding Students Award, SCU
    2023
  • RoboMaster University AI Challenge - International Third Prize
    2022
  • National Scholarship
    2021
  • CK Power Scholarship
    2021
  • First-Class Scholarship, SCU
    2021
  • RoboMaster University Championship — South Division First Prize
    2021
  • RoboMaster University League — Provincial First Prize
    2021
  • Outstanding Student Award, SCU
    2020
  • Outstanding Cadre, SCU Library Volunteer Team
    2020
  • SCU Engineering Training Competition — First Prize
    2020
  • College-level English Speech Contest — Second Prize
    2019
News
2025
Advanced to Ph.D. Candidacy at UCSC! Featured
Dec 08
Our paper PromptBridge: Cross-Model Prompt Transfer for Large Language Models has been released to arXiv!
Dec 01
2024
Delivered an invited talk titled 'Unlearning in LLM: A New Exploration in AI Safety' in Sichuan University.
Nov 24
Our paper Noise-Resilient Point-wise Anomaly Detection in Time Series Using Weak Segment Labels has been accepted to KDD 2025!
Nov 17
Our paper Large Language Model Unlearning via Embedding-Corrupted Prompts has been accepted to NeurIPS 2024!
Sep 24
Joined Accenture as a Research Scientist Intern, working on LLM Alignment.
Jun 19
Awarded the Anti-Racist Research Fellowship at UCSC
Jun 01
2023
Embarked on my PhD journey, diving into advanced research and exploration in Responsible AI. Featured
Sep 18
Selected Publications (view all )
PromptBridge: Cross-Model Prompt Transfer for Large Language Models
PromptBridge: Cross-Model Prompt Transfer for Large Language Models

Yaxuan Wang, Quan Liu, Zhenting Wang, Zichao Li, Wei Wei, Yang Liu, Yujia Bao

Arxiv 2025

PromptBridge: Cross-Model Prompt Transfer for Large Language Models

Yaxuan Wang, Quan Liu, Zhenting Wang, Zichao Li, Wei Wei, Yang Liu, Yujia Bao

Arxiv 2025

DRAGON: Guard LLM Unlearning in Context via Negative Detection and Reasoning
DRAGON: Guard LLM Unlearning in Context via Negative Detection and Reasoning

Yaxuan Wang, Chris Yuhao Liu, Quan Liu, Jinlong Pang, Wei Wei, Yujia Bao, Yang Liu

Machine Unlearning for Generative AI Workshop for ICML2025; Under review. 2025

DRAGON: Guard LLM Unlearning in Context via Negative Detection and Reasoning

Yaxuan Wang, Chris Yuhao Liu, Quan Liu, Jinlong Pang, Wei Wei, Yujia Bao, Yang Liu

Machine Unlearning for Generative AI Workshop for ICML2025; Under review. 2025

LLM Unlearning via Loss Adjustment with Only Forget Data

Yaxuan Wang, Jiaheng Wei, Chris Yuhao Liu, Jinlong Pang, Quan Liu, Ankit Shah, Yujia Bao, Yang Liu, Wei Wei

The Thirteenth International Conference on Learning Representations (ICLR) 2025

LLM Unlearning via Loss Adjustment with Only Forget Data

Yaxuan Wang, Jiaheng Wei, Chris Yuhao Liu, Jinlong Pang, Quan Liu, Ankit Shah, Yujia Bao, Yang Liu, Wei Wei

The Thirteenth International Conference on Learning Representations (ICLR) 2025

Noise-Resilient Point-wise Anomaly Detection in Time Series Using Weak Segment Labels
Noise-Resilient Point-wise Anomaly Detection in Time Series Using Weak Segment Labels

Yaxuan Wang, Hao Cheng, Jing Xiong, Qingsong Wen, Han Jia, Ruixuan Song, Liyuan Zhang, Zhaowei Zhu, Yang Liu

The 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2025

Noise-Resilient Point-wise Anomaly Detection in Time Series Using Weak Segment Labels

Yaxuan Wang, Hao Cheng, Jing Xiong, Qingsong Wen, Han Jia, Ruixuan Song, Liyuan Zhang, Zhaowei Zhu, Yang Liu

The 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2025

Large Language Model Unlearning via Embedding-Corrupted Prompts
Large Language Model Unlearning via Embedding-Corrupted Prompts

Chris Yuhao Liu, Yaxuan Wang, Jeffrey Flanigan, Yang Liu

The 38th Annual Conference on Neural Information Processing Systems (NeurIPS) 2024

Large Language Model Unlearning via Embedding-Corrupted Prompts

Chris Yuhao Liu, Yaxuan Wang, Jeffrey Flanigan, Yang Liu

The 38th Annual Conference on Neural Information Processing Systems (NeurIPS) 2024

All publications