I am currently a third-year postgraduate student in the Department of Artificial Intelligence, School of Informatics, Xiamen University, advised by Prof. Xiaoshuai Sun and Prof. Rongrong Ji.

My recent research interests are in (2D/3D) vision-and-language learning.

  • 01/2024 – Now: Researcher in Youtu Lab, Tencent
  • 09/2021 – 06/2024: M.S. in Artificial Intelligence, Xiamen University
  • 01/2023 – 07/2023: Multi-modal Research Intern, Netease Fuxi AI Lab
  • 09/2017 – 06/2021: B.S. in Intelligent Science and Technology, Xiamen University

Latest News

20254β–Ό
  • 12/2025 Two papers accepted by AAAI 2026 (ZoomFakes, TripleFDS)
  • 09/2025 One paper accepted by TPAMI 2025 (NICE)
  • 09/2025 One paper accepted by TIFS 2025 (ME-FAS)
  • 07/2025 One paper accepted by TOMM 2025 (X-Dreamer)
20246β–Ό
  • 09/2024 One paper accepted by NeurIPS 2024 (RG-SAN)
  • 07/2024 One paper accepted by ECCV 2024 (Exploring Phrase-Level Grounding)
  • 07/2024 One paper accepted by ACM MM 2024 (3D-GRES)
  • 05/2024 One paper accepted by ICML 2024 (SAM as the Guide)
  • 02/2024 One paper accepted by CVPR 2024 (RMSIN)
  • 02/2024 One paper accepted by TPAMI 2024 (JM3D & JM3D-LLM)
20233β–Ό
  • 12/2023 Two papers accepted by AAAI 2024
  • 07/2023 Two papers accepted by ACM MM 2023
  • 07/2023 One paper accepted by ICCV 2023

Publications

20262β–Ό
Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery Amplification Approach
Lvpan Cai, Haowei Wang, Jiayi Ji, Yicong Zhoumen, Shen Chen, Taiping Yao, Xiaoshuai Sun
AAAI 2026
TripleFDS: Triple Feature Disentanglement and Synthesis for Scene Text Editing
Yusen Bao, Yiting Wang, Wenhui Huang, Haowei Wang, Shen Chen, Taiping Yao, Shouhong Ding, Jizhong Zhang
AAAI 2026
20254β–Ό
NICE: Improving Panoptic Narrative Detection and Segmentation with Cascading Collaborative Learning
Haowei Wang, Jiayi Ji, Tianyu Guo, Yilong Yang, Xiaoshuai Sun, Rongrong Ji
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
ME-FAS: Multimodal Text Enhancement for Cross-Domain Face Anti-Spoofing
Lvpan Cai, Haowei Wang, Jiayi Ji, Xiaoshuai Sun, Liujuan Cao, Rongrong Ji
IEEE Transactions on Information Forensics and Security (TIFS), 2025
Creating High-Quality 3D Content by Bridging the Gap between Text-to-2D and Text-to-3D Generation
Yiwei Ma, Yijun Fan, Jiayi Ji, Haowei Wang, Hao Yin, Xiaoshuai Sun, Rongrong Ji
ACM Transactions on Multimedia Computing, Communications and Applications (TOMM), 2025
RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation
Changli Wu, Qi Chen, Jiayi Ji, Haowei Wang, Yiwei Ma, You Huang, Haiwei Fei, Xiaoshuai Sun, Rongrong Ji
NeurIPS 2024
20247β–Ό
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
Sihan Liu, Yiwei Ma, Xiaoqing Zhang, Haowei Wang, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji
CVPR 2024
JM3D & JM3D-LLM: Elevating 3D Representation with Joint Multi-modal Cues
Jiayi Ji, Haowei Wang, Changli Wu, Yiwei Ma, Xiaoshuai Sun, Rongrong Ji
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation
Changli Wu, Yiwei Ma, Qi Chen, Haowei Wang, Gen Luo, Jiayi Ji, Xiaoshuai Sun
AAAI 2024
Improving Panoptic Narrative Grounding by Harnessing Semantic Relationships and Visual Confirmation
Tianyu Guo, Haowei Wang, Yiwei Ma, Jiayi Ji, Xiaoshuai Sun
AAAI 2024
SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation
Danni Yang, Jiayi Ji, Yiwei Ma, Tianyu Guo, Haowei Wang, Xiaoshuai Sun, Rongrong Ji
ICML 2024
3D-GRES: Generalized 3D Referring Expression Segmentation
Changli Wu, Yihang Liu, Jiayi Ji, Yiwei Ma, Haowei Wang, Gen Luo, Henghui Ding, Xiaoshuai Sun, Rongrong Ji
ACM MM 2024 (Oral)
Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model
Danni Yang, Ruohan Dong, Jiayi Ji, Yiwei Ma, Haowei Wang, Xiaoshuai Sun, Rongrong Ji
ECCV 2024
20234β–Ό
Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation
Haowei Wang, Jiji Tang, Jiayi Ji, Xiaoshuai Sun, Rongsheng Zhang, Yiwei Ma, Minda Zhao, Lincheng Li, Zeng Zhao, Tangjie Lv, Rongrong Ji
ACM MM 2023
Semi-Supervised Panoptic Narrative Grounding
Danni Yang, Jiayi Ji, Xiaoshuai Sun, Haowei Wang, Yinan Li, Yiwei Ma, Rongrong Ji
ACM MM 2023
X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance
Yiwei Ma, Xiaoqing Zhang, Xiaoshuai Sun, Jiayi Ji, Haowei Wang, Guannan Jiang, Weilin Zhuang, Rongrong Ji
ICCV 2023
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network
Haowei Wang, Jiayi Ji, Yiyi Zhou, Yongjian Wu, Xiaoshuai Sun
AAAI 2023
Preprint2β–Ό
ForgeryVCR: Visual-Centric Reasoning via Efficient Forensic Tools in MLLMs for Image Forgery Detection and Localization
Youqi Wang, Shen Chen, Haowei Wang, Ruiyang Peng, Taiping Yao, Shouting Tan, Chao Chen, Bin Li, Shouhong Ding
arXiv 2026
X-Dreamer: Creating High-Quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation
Yiwei Ma, Yijun Fan, Jiayi Ji, Haowei Wang, Xiaoshuai Sun, Guannan Jiang, Annan Shu, Rongrong Ji
arXiv 2023

Major Awards

  • πŸŽ–οΈ CSIG Master’s Thesis Award Program, 2025
  • πŸ… Outstanding Graduate of Xiamen University, China, 2024
  • πŸ… National Scholarship, China, 2023
  • πŸŽ–οΈ Merit Student of Xiamen University, China, 2023
  • πŸ† Outstanding Student Scholarship (Grade 1), Xiamen University, 2018–2020

πŸ“Š Visitor Statistics

visitors
Total Visits (real-time)
-
Unique Visitors
-
Countries / Regions

🟒 Recent Visitors

Loading real visitor data…

🌍 Your Visit

Detecting your location…

πŸ“ˆ Recent Visit Trend (last 7 entries)