I am Haibo Jin, a PhD student majoring in Information Sciences, at the University of Illinois Urbana-Champaign, under the supervision of Prof. Haohan Wang.
My research interest includes trustworthy machine learning and the robustness of deep learning systems. I am now working on attacks and defense on computer vision, diffusion models, and multi-modal models. If you are seeking any form of academic cooperation, please feel free to email me.
๐ฅ News
- 2025.10: ย ๐๐ Welcome to visit Neuripsโ25 Paper website.
- 2025.09: ย ๐๐ Evaluating the Inductive Abilities of Large Language Models: Why Chain-of-Thought Reasoning Sometimes Hurts More Than Helps is accepted by NeurIPSโ25!
- 2025.08: ย ๐๐ Exploring the Vulnerability of the Content Moderation Guardrail in Large Language Models via Intent Manipulation is accepted by EMNLPโ25!
- 2025.07: ย ๐๐ Welcome to visit Revolve website.
- 2025.06: ย ๐๐ Revolve: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization is accepted by ICMLโ25!
- 2024.12: ย ๐๐ Welcome to our survey paper: Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
- 2024.11: ย ๐๐ Welcome to visit JAM website.
- 2024.09: ย ๐๐ Jailbreaking Large Language Models Against Moderation Guardrails via Cipher Characters is accepted by NeurIPSโ24!
- 2024.08: ย ๐๐ Fight Perturbations with Perturbations: Defending Adversarial Attacks via Neuron Influence is accepted by IEEE Transactions on Dependable and Secure Computing (TDSC)!
- 2024.07: ย ๐๐ CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path Fuzzing is accepted by ECCVโ24!
- 2024.07: ย ๐๐ EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models is accepted by ECCVโ24!
- 2024.06: ย ๐๐ Welcome to visit JailbreakZoo website.
- 2024.05: ย ๐๐ Welcome to my new homepage!
- 2024.04: ย ๐๐ Receive a TA/RA offer from the School of Information Sciences, University of Illinois Urbana-Champaign!
- 2024.03: ย ๐๐ Welcome to visit JailbreakZoo, a dedicated repository focused on the jailbreaking of large models (LMs), encompassing both large language models (LLMs) and vision language models (VLMs).
- 2024.01: ย ๐๐ Start my trip at the University of Illinois Urbana-Champaign as a visiting scholar!
๐ Selected Publications


Exploring the Vulnerability of the Content Moderation Guardrail in Large Language Models via Intent Manipulation
Jun Zhuang, Haibo Jin, Ye Zhang, Zhengjian Kang, Wenbin Zhang, Gaby G Dagher, Haohan Wang
The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLPโ25)










๐๏ธ Honors and Awards
- 2022.10: ย National Scholarship, Postgraduate Premium Scholarship.
๐ Educations
- 2025.09 - now: ย PhD Candicate, Information Sciences, University of Illinois Urbana-Champaign, Illinois, US.
๐ป Internships
- 2023.04 - 2025.08: ย Visiting scholar at DREAM Lab, University of Illinois Urbana-Champaign, US. (Supervisor: Haohan Wang)
๐ Service
- 2025: ย ICLR 2025, AISTATS 2025
- 2024: ย NeurIPS 2024, ICML 2024, WWW 2024, SeT LLM @ ICLR 2024