publications

Please see my full publication list at google scholar.
* presents equal contribution.

2025

  1. Usenix Security
    Yixin Wu, Ziqing Yang, Yun Shen, Michael Backes, and Yang Zhang
    In USENIX Security Symposium (USENIX Security), 2025
  2. Usenix Security
    On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
    Yixin Wu, Ning Yu, Michael Backes, Yun Shen, and Yang Zhang
    In USENIX Security Symposium (USENIX Security), 2025
  3. Usenix Security
    Xinyue Shen, Yixin Wu, Yiting Qu, Michael Backes, Savvas Zannettou, and Yang Zhang
    In USENIX Security Symposium (USENIX Security), 2025
  4. arxiv
    The Challenge of Identifying the Origin of Black-Box Large Language Models
    Ziqing Yang, Yixin Wu, Yun Shen, Wei Dai, Michael Backes, and Yang Zhang
    CoRR arXiv:2503.04332, 2025
  5. arxiv
    Peering Behind the Shield: Guardrail Identification in Large Language Models
    Ziqing Yang, Yixin Wu, Rui Wen, Michael Backes, and Yang Zhang
    CoRR arXiv:2502.01241, 2025

2024

  1. Usenix Security
    Yixin Wu, Rui Wen, Michael Backes, Pascal Berrang, Mathias Humbert, Yun Shen, and Yang Zhang
    In USENIX Security Symposium (USENIX Security), 2024
  2. CCS
    Yixin Wu, Yun Shen, Michael Backes, and Yang Zhang
    In ACM Conference on Computer and Communications Security (CCS), 2024
  3. PETS
    Yixin Wu, Xinlei He, Pascal Berrang, Mathias Humbert, Michael Backes, Neil Zhenqiang Gong, and Yang Zhang
    In Privacy Enhancing Technologies Symposium (PETS), 2024
  4. EMNLP
    Yihan Ma, Xinyue Shen, Yixin Wu, Boyang Zhang, Michael Backes, and Yang Zhang
    In Empirical Methods in Natural Language Processing (EMNLP), 2024
  5. arxiv
    Xinyue Shen*Yixin Wu*, Michael Backes, and Yang Zhang
    CoRR abs/2405.19103, 2024
  6. arxiv
    Yiting Qu, Xinyue Shen, Yixin Wu, Michael Backes, Savvas Zannettou, and Yang Zhang
    CoRR abs/2405.03486, 2024

2022

  1. arxiv
    Yixin Wu, Ning Yu, Zheng Li, Michael Backes, and Yang Zhang
    CoRR abs/2210.00968, 2022

2021

  1. arxiv
    Xinlei He, Rui Wen, Yixin Wu, Michael Backes, Yun Shen, and Yang Zhang
    CoRR abs/2102.05429, 2021