Zhedong Lin

About Me

Research overview

Hi, I’m Zhedong Lin (林哲栋). I am currently preparing to begin my PhD studies at the University of Auckland, where I will continue my research under the supervision of Prof. Jiamou Liu. I hold an MSc in Artificial Intelligence from the University of Auckland and received my BSc in Computer Science and Technology from Southwest University in 2025. My research interests include large language models (LLMs), multimodal generative AI, and their applications in text-to-image synthesis, text-to-video generation, image and video editing, and controllable diffusion models. If you are interested in my research or would like to discuss potential collaborations, please feel free to contact me at zlin629@aucklanduni.ac.nz.

Research Interests

My research focuses on multimodal generative AI, with particular interests in text-to-image and text-to-video generation, image and video editing, controllable diffusion models, and LLMs. I am interested in developing foundation-model-based methods that generate coherent, high-quality, and controllable visual content across different modalities.

Text-to-Image and Text-to-Video Generation: Generating high-quality visual content from textual descriptions with better consistency and semantic alignment.
Image and Video Editing: Enabling flexible, user-guided editing while preserving contextual and visual coherence.
Controllable Diffusion Models: Improving fine-grained control over generative processes for creative and practical applications.
Multimodal Foundation Models: Exploring how LLMs and multimodal models can support generation, editing, and cross-modal understanding.

My work aims to advance AI systems for creating consistent, controllable, and editable visual content.

News and Updates

2026.07: 🎉 One paper was accepted by ACM Multimedia 2026 (ACM MM 2026) (CCF-A).
2026.07: 🎤 Delivered a guest lecture on Agentic AI at WHMC.
2026.03: 🚀 Contributed to the open-source project Code Vibe Reading, a VS Code extension for AI-assisted code understanding and navigation.
2026.01: 🎉 One paper was accepted by Artificial Intelligence Review (SCI Q1 Top).
2025.12: 🎉 Won the Best Multimedia Award at ACM MM Asia 2025.
2025.10: 🎓 Received the China Scholarship Council (CSC) Full Scholarship for PhD studies at the University of Auckland, starting in 2026.
2025.10: 🎉 One paper was accepted by ACM MM Asia 2025 (CCF-C).
2025.06: 🎓 Graduated with a BSc in Computer Science and Technology from Southwest University and received the Outstanding Graduate honor.

Selected Publications

📄 Narratology Meets Text-to-Image: A Survey of Consistency in AI Generated Storybook Illustrations
Zhedong Lin, Zhongsheng Wang, Qian Liu, Xinyu Zhang, Jiamou Liu
Artificial Intelligence Review (JCR Q1, SCI Q1 Top), 2026
Paper

📄 CharCom: Composable Identity Control for Multi-Character Story Illustration
Zhongsheng Wang, Ming Lin, Zhedong Lin, Yaser Shakib, Qian Liu, Jiamou Liu
ACM Multimedia Asia 2025 (CCF-C, Best Multimedia Award)
Paper

Projects

🚀 Code Vibe Reading — VS Code extension for AI-assisted code understanding and navigation
Project

Awards & Honors

China Scholarship Council (CSC) Full Scholarship
University of Auckland High Achiever Scholarship
University of Auckland First in Course Award
Academic Scholarship at Southwest University
Outstanding Graduate Award at Southwest University
Best Multimedia Paper Award – ACM Multimedia Asia 2025

Teaching

Teaching Assistant, COMPSCI 120: Mathematics for Computer Science