Yichi Wang - Research Intern

About Me

Passionate research intern exploring the frontiers of multimodal intelligence and security

I am an undergraduate student from Beijing University of Technology & a research intern at Hong Kong University of Science and Technology (Guangzhou). My current research approach primarily employs interpretability methods to focus on innovative research in multimodal systems, embodied intelligence, and AI for Security. Through my work, I hope to enable us not only to drive AI models but also to understand why models make certain decisions, which helps us build more white-box AI systems.

Multimodal Large Language Models

Testing MLLMs from a security perspective, observing substantial biases in models through strategies such as Logit Lens and statistical metrics across various results.

Embodied Intelligence

Implementing numerous Vision Language Action Model baselines, discovering significant vulnerabilities to jailbreak strategies such as Prompt Injection.

AI for Security

Exploring various MLLM models, conducting safety performance testing and fine-tuning of MLLMs from different perspectives, and discovering inherent security issues during current model training processes.

Education

Academic foundation in artificial intelligence and computer science

Beijing University of Technology

Bachelor of Science in Artificial Intelligence

Twice awarded First-Class Scholarship at Beijing University of Technology, once received the annual Outstanding Student award, and achieved scores of 90+ in multiple theoretical and practical courses (Advanced Mathematics (99), Deep Learning (97), Analog Electronic Technology (92), Complex Functions (92)).

GPA: 88.36/100

Research Experience

Collaborative research at leading institutions

Research Intern

The Hong Kong University of Science and Technology (Guangzhou)

Supervisor: Prof. Renjing Xu

Engaged in advanced research projects focusing on AI security, multimodal systems, and adversarial machine learning. Contributing to cutting-edge publications and developing novel approaches to AI robustness and security challenges.

Publications

Participated in multiple CCF-A paper projects and gradually mastered practical and writing skills through this experience

IJCAI Workshop Best Paper Nomination

Exploring Typographic Visual Prompts Injection Threats

IJCAI 2025 Workshop

Investigates novel attack vectors in multimodal AI systems through typographic visual prompts, revealing critical security vulnerabilities and proposing defensive mechanisms.

Paper

ACM MM 2025

Transfer Attack for Bad and Good

ACM Multimedia Conference 2025

Explores the dual nature of transfer attacks in multimodal systems, analyzing both malicious applications and potential benefits for robustness testing and model improvement.

Paper

Under Review

JailbreakAudioBench

NeurIPS 2025 (Submitted)

Introduces a comprehensive benchmark for evaluating audio-based jailbreak attacks on multimodal AI systems, providing new insights into audio adversarial vulnerabilities.

Paper Dataset

JailbreakAudioBench Interactive Demo

Experience our cutting-edge audio-based jailbreak testing framework. This interactive demo showcases the capabilities of our benchmark system for evaluating multimodal AI security vulnerabilities through audio inputs.

Explore various audio attack scenarios, test different AI models, and understand the importance of audio security in multimodal systems.

Launch Interactive Demo

Skills & Expertise

Technical and professional skills developed through research and practice

Web Development

Proficient in modern web technologies including HTML5, CSS3, JavaScript, and various frameworks. Experience in building responsive, interactive web applications and research presentation platforms.

Python Programming

Advanced proficiency in Python for research intern, including deep learning frameworks like PyTorch and TensorFlow, data analysis with NumPy and Pandas, and scientific computing.

Technical Presentation

Demonstrating understanding and implementation of interpretability papers and algorithms to fellow students through Bilibili, receiving highly positive feedback from audiences.

Awards & Recognition

Recognition for academic excellence and research contributions

Best Paper Nomination

Received Best Paper Nomination at IJCAI 2025 Workshop for research on typographic visual prompt injection threats in multimodal AI systems.

Academic Excellence

Maintained outstanding academic performance with a GPA of 88.36/100 in the competitive AI program at Beijing University of Technology.

Research Intern

Selected for prestigious research Intern with HKUST(GZ) under the supervision of Prof. Renjing Xu, focusing on advanced AI security research.