* indicates corresponding author
2026
The Reasoning Trap: How Enhancing LLM Reasoning Amplifies Tool Hallucination
ACL Main 2026 Teach to Reason Safely: Policy-Guided Safety Tuning for MLRMs
ICLR 20262025
Single AI Agent Runtime Security Testing Standards
WDTA 2025 AI智能体安全治理白皮书
中国电信 终端智能体安全2025
WAIC 20252024
Conversation Reconstruction Attack Against GPT Models
EMNLP Main 2024 ZeroFake: Zero-Shot Detection of Fake Images Generated and Edited by Text-to-Image Generation Models
CCS 2024 Games and Beyond: Analyzing the Bullet Chats of Esports Livestreaming
ICWSM 20242023
DE-FAKE: Detection and Attribution of Fake Images Generated by Text-to-Image Generation Models
CCS 2023 Best Paper Finalist · CSAW Europe 2024 Can't Steal? Cont-Steal! Contrastive Stealing Attacks Against Image Encoders
CVPR 2023