Gelei Deng ☕️

    About Me

    My research focuses on AI safety and AI security. I am particularly interested in leveraging AI and automated systems to attack AI and cyber systems autonomously, enabling scalable and intelligent security testing. I received my PhD from Nanyang Technological University, advised by Prof. Tianwei Zhang and Prof. Yang Liu.

    Download CV
    Interests
    • AI Security and Safety
    • Large Language Models
    • Penetration Testing
    • Blockchain Security
    • System Security
    Education
    • PhD in Computer Science

      Nanyang Technological University

    • B.E. Electrical Engineering

      Singapore University of Technology and Design

    Featured Research
    Selected Projects

    Research artifacts, open-source systems, and security testing frameworks from my recent work.

    Recent Publications
    (2026). What Makes a Good LLM Agent for Real-world Penetration Testing?. arXiv 2026.
    (2026). Agent Skills in the Wild: An Empirical Study of Security Vulnerabilities at Scale. arXiv 2026.
    (2025). RSafe: Incentivizing Proactive Reasoning to Build Robust and Adaptive LLM Safeguards. NeurIPS 2025.
    (2025). Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models. NeurIPS 2025.
    (2025). Controllable Spoofing Attacks on Visual SLAM in Robotic Vehicles. ACSAC 2025.
    (2025). When Audio and Text Disagree: Revealing Text Bias in Large Audio-Language Models. EMNLP 2025.
    (2025). Oedipus: LLM-enchanced Reasoning CAPTCHA Solver. CCS 2025.
    (2025). IllusionCAPTCHA: A CAPTCHA based on Visual Illusion. WWW 2025.
    (2025). Source Code Summarization in the Era of Large Language Models. ICSE 2025.
    (2024). Efficient Detection of Toxic Prompts in Large Language Models. ASE 2024.
    (2024). GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender Bias in Large Language Models. CCS 2024.