
CoT-VTM: Visual-to-Music Generation with Chain-of-Thought Reasoning
Findings of the Annual Meeting of the Association for Computational Linguistics (ACL Findings), 2025
I am currently an Assistant Professor with the Visual Computing Research Center (VCC) (led by Prof. Hui Huang) at the College of Computer Science and Software Engineering, Shenzhen University.
I received my Ph.D. degree from the Reasoning and Learning (RL) Group at College of Computer, Nanjing University in 2024, advised by Prof. Yang Gao and Prof. Jing Huo. I received my dual Ph.D. degree from Department of Computer Science, City University of Hong Kong, advised by Prof. Jing Liao. I received my B.Sc. degree from Nanjing University in 2017.
My research focuses on machine learning, pattern analysis, and computer vision. Recently, I'm interested in enhancing the transferability and controllability of Multi-modal generative models, exploring their applications in dynamic and open-world scenarios such as few-shot learning, in-context learning, and continual learning.
Address: L6-811, Shenzhen University Cang Hai Campus, Shenzhen, ChinaEmail: guzheng@szu.edu.cn
I am looking for highly motivated graduate and undergraduate students for research opportunities in machine learning, computer vision, and generative models starting from Sep. 2025. Feel free to get in touch!
Findings of the Annual Meeting of the Association for Computational Linguistics (ACL Findings), 2025
arxiv preprint, 2025
Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 2024
ACM Transactions on Graphics (Proceedings of SIGGRAPH), 2024