Jingye Chen - Homepage

qwerty.chen [at] connect.ust.hk
Google Scholar
GitHub

About Me

Jingye Chen is a Ph.D. in HKUST supervised by Prof. Qifeng Chen. He was fortunate to have fruitful internship experience in Microsoft Research Asia, Adobe Research, and Canva Research. He enjoys doing things with soul.

I am currently in the job market. Would like to chat more if you are interested in my research background!

News

[Oct. 2025]

Deliver a talk in ICCV 2025 Graphic Design Understanding and Generation (GDUG) workshop.

[Jun. 2025]

Two paper accepted to ICCV2025. One additional paper accepted to ICCV HiGen Workshop Best Paper Runnerup.

[Mar. 2025]

An awesome repo about generative game is maintained at link. Welcome to any contributions!

[Mar. 2025]

A paper on the numerical and spatial consistency of generative games is released.

[Feb. 2025]

One paper accepted to CVPR2025.

[Nov. 2024]

We release Videotuna, an all-in-one video fine-tuning framework.

[Nov. 2024]

I pass the qualifying exam and become a Ph.D. candidate.

[Jul. 2024]

One paper accepted to ECCV2024 Oral.

[Jul. 2024]

One paper accepted to ACMMM2024.

[May. 2024]

We release a survey about llms for multimodal generation and editing.

[Nov. 2023]

TextDiffuser-2 is released. More flexible.

[Sept. 2023]

We published a multimodal literate model Kosmos-2.5.

[Sept. 2023]

One paper accepted to NeurIPS2023.

[Nov. 2022]

One paper accepted to AAAI2023.

[Oct. 2022]

One paper accepted to EMNLP2022-Findings.

[Jan. 2022]

We construct a benchmark for Chinese text recognition.

[Dec. 2021]

One paper accepted to AAAI2022.

[Apr. 2021]

One paper accepted to IJCAI2021.

[Mar. 2021]

One paper accepted to CVPR2021.

Publications

Model as a Game: On Numerical and Spatial Consistency for Generative Games

Jingye Chen, Yuzhong Zhao, Yupan Huang, Lei Cui, Li Dong, Tengchao Lv, Qifeng Chen, Furu Wei

International Conference on Computer Vision (ICCV HiGen Workshop), 2025, Best Paper Runnerup

[PDF] [Blog]

Large Motion Video Autoencoding with Cross-modal Video VAE

Yazhou Xing, Yang Fei, Yingqing He, Jingye Chen, Jiaxin Xie, Xiaowei Chi, Qifeng Chen

International Conference on Computer Vision (ICCV), 2025

[PDF] [Code]

Rethinking Layered Graphic Design Generation with a Top-Down Approach

Jingye Chen, Zhaowen Wang, Nanxuan Zhao, Li Zhang, Difan Liu, Jimei Yang, Qifeng Chen

International Conference on Computer Vision (ICCV), 2025

[PDF] [ProjectPage] [Video]

TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering

Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei

European Conference on Computer Vision (ECCV), 2024, Oral Presentation

✨ Top10 in the Hugging Face Space Trending List at Dec. 31st 2023; Featured as Space of the Week.

✨ Used by Recraft V3, the rank 1st image generation model in the global leaderboard.

[PDF] [Code] [ProjectPage] [HuggingFace] [Twitter] [PaperWeekly] [Discord] [?]

Kosmos-2.5: A Multimodal Literate Model

Tengchao Lv*, Yupan Huang*, Jingye Chen*, Lei Cui*, Shuming Ma, Yaoyao Chang, Shaohan Huang, Wenhui Wang, Li Dong, Weiyao Luo, Shaoxiang Wu, Guoxin Wang, Cha Zhang, Furu Wei

Technical Report, 2023

[PDF] [Code] [HuggingFace]

TextDiffuser: Diffusion Models as Text Painters

Jingye Chen*, Yupan Huang*, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei

Neural Information Processing Systems (NeurIPS), 2023

✨ Top10 in the Hugging Face Space Trending List at Jun. 29st 2023; Featured as Space of the Week.

[PDF] [Code] [ProjectPage] [HuggingFace] [GoogleColab] [Twitter] [Zhihu]

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models

Minghao Li, Tengchao Lv, Jingye Chen, Lei Cui, Yijuan Lu, Dinei Florencio, Cha Zhang, Zhoujun Li, Furu Wei

AAAI Conference on Artificial Intelligence (AAAI), 2023

✨ Rank 4th in Most Influential AAAI 2023 Papers

[PDF] [Code] [HuggingFace]

XDoc: Unified Pre-training for Cross-Format Document Understanding

Jingye Chen, Tengchao Lv, Lei Cui, Cha Zhang, Furu Wei

Empirical Methods in Natural Language Processing (EMNLP-Findings), 2022

[PDF] [Code] [HuggingFace]

Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study

Jingye Chen, Haiyang Yu, Jianqi Ma, Mengnan Guan, Xixi Xu, Xiaocong Wang, Shaobo Qu, Bin Li, Xiangyang Xue

Technical Report, 2022

[PDF] [Code] [Zhihu]

Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution

Jingye Chen, Haiyang Yu, Jianqi Ma, Bin Li, Xiangyang Xue

AAAI Conference on Artificial Intelligence (AAAI), 2022

[PDF] [Code]

Zero-Shot Chinese Character Recognition with Stroke-Level Decomposition

Jingye Chen, Bin Li, Xiangyang Xue

International Joint Conference on Artifical intelligence (IJCAI), 2021

[PDF] [Code]

Scene Text Telescope: Text-Focused Scene Image Super-Resolution

Jingye Chen, Bin Li, Xiangyang Xue

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021

[PDF] [Code]

Open-source Projects

VideoTuna: A Powerful Toolkit for Video Generation with Model Fine-Tuning and Post-Training

Yingqing He, Yazhou Xing, Zhefan Rao, Haoyu Wu, Zhaoyang Liu, Jingye Chen, Pengjun Fang, Jiajun Li, Liya Ji, Runtao Liu, Xiaowei Chi, Yang Fei, Guocheng Shao, Yue Ma, Qifeng Chen

Open-source Project, 2025

[Code]

LLMs Meet Multimodal Generation and Editing: A Survey

Yingqing He, Zhaoyang Liu, Jingye Chen, Zeyue Tian, Hongyu Liu, Xiaowei Chi, Runtao Liu, Ruibin Yuan, Yazhou Xing, Wenhai Wang, Jifeng Dai, Yong Zhang, Wei Xue, Qifeng Liu, Yike Guo, Qifeng Chen

Technical Report, 2024

[PDF] [Code]

Education

HKUST

Hong Kong

Sept. 2022 – Dec. 2025

PhD Candidate in Computer Science, supervised by Prof. Qifeng Chen

Fudan University

Shanghai

Sept. 2019 – Feb. 2022

Master in Computer Science, supervised by Prof. Bin Li and Prof. Xiangyang Xue

Fudan University

Shanghai

Sept. 2015 – Jun. 2019

BSc in Computer Science

Experiences

Canva Research

Beijing

Jul. 2025 – Nov. 2025

Research Intern, supervised by Dr. Yuhui Yuan

Microsoft Research Asia

Beijing

Feb. 2022 – Jul. 2022, Dec. 2022 – Feb. 2024, Nov. 2024 - Jul. 2025

Research Intern, supervised by Dr. Lei Cui and Dr. Furu Wei

Adobe Research

San Jose, U.S.A.

Apr. 2024 - Aug. 2024

Research Intern, supervised by Dr. Zhaowen Wang

Johns Hopkins University

U.S.A.

Apr. 2021 – Sept. 2021

Summer Intern, supervised by Dr. Yongyi Lu and Prof. Alan Yuille

University of Cambridge

Cambridge, U.K.

Jan. 2018 – Feb. 2018

Visiting student of winter program

Services

Conference Reviewer: CVPR, ICCV, NeurIPS, Siggraph Asia, ACL, EMNLP, AAAI, ACMMM

Journal Reviewer: TPAMI, TMM

Teaching

2023 Spring: COMP 2011 Programming with C++

2023 Fall: COMP 2011 Programming with C++

Awards & Scholarships

ICCV HiGen Workshop Best Paper Runnerup

2025

MSRA Honor Class of 2025

2025

Excellent Master Dissertation Award of Shanghai

2023

RedBird PhD Scholarship in HKUST

2022

Outstanding Graduate of Shanghai (top 5%)

2022

Excellent Student Award

2021

National Scholarship (top 1%)

2021

Outstanding Undergraduate of Shanghai (top 5%)

2019

Best Team Award in University of Cambridge as a Leader

2018

Third Class Undergraduate Scholarship

2016-2018

Jingye Chen (陈竞晔)

Ph.D. in HKUST