I am a second-year PhD student at the Media Synthesis and Forensics Lab (formerly known as Multimedia Computing Group), Instutite of Computing Technology, Chinese Academy of Sciences, advised by professor Juan Cao and assistant professor (researcher) Qiang Sheng.

My research interest includes toxic content detection, jailbreaking, alignment, and fake news detection, in the era of large language models.

🔥 News

  • 2025.04  🎉🎉 One co-author paper got accepted by SIGIR 2025.
  • 2025.02  🎉🎉 My scholar profile reached 100 citations!

📖 Educations

  • 2023.09 - now, PhD Candidate
    • Institute of Computing Technology, Chinese Academy of Sciences.
  • 2019.09 - 2023.06, Bachelor of Engineering
    • School of Computer Science and Technology, Shandong University.

📝 Publications

Preprint
sym

From Judgment to Interference: Early Stopping LLM Harmful Outputs via Streaming Content Monitoring

Yang Li, Qiang Sheng, Yehan Yang, Xueyao Zhang, Juan Cao

  • Data: We construct FineHarm, a dataset consisting of 29K prompt-response pairs with fine-grained annotations to provide reasonable supervision for token-level training.
  • Model: we propose the Streaming Content Monitor (SCM), which is trained with dual supervision of response- and token-level labels and can follow the output stream of LLM to make a timely judgment of harmfulness.

🎖 Honors and Awards

  • 2025 Merit Student, University of Chinese Academy of Sciences.
  • 2024 Second-Class Academic Scholarship, University of Chinese Academy of Sciences.
  • 2023 Freshman Scholarship of E Fund Financial Technology.
  • 2023 First-Class Academic Scholarship, University of Chinese Academy of Sciences.
  • 2022 Second Prize of Shandong Province, China Undergraduate Mathematical Contest in Modeling.
  • 2022 Third-Class Academic Scholarship, Shandong University.

📚 Academic Services

  • Conf. Reviewer/PC Member
    • TheWebConf (WWW) 2025