Efficient Scale-Invariant Generator with Column-Row Entangled Pixel Synthesis

March 2, 2023

Any-scale image synthesis offers an efficient and scalable solution to synthesize photo-realistic images at any scale, even going beyond 2K resolution. However, existing GAN-based solutions depend excessively on convolutions and a hierarchical architecture, which introduce inconsistency and the ”texture sticking” issue when scaling the output resolution. From another perspective, INR-based generators are scale-equivariant by design, but their huge memory footprint and slow inference hinder these networks from being adopted in large-scale or real-time systems. In this work, we propose \textbf{C}olumn-\textbf{R}ow \textbf{E}ntangled \textbf{P}ixel \textbf{S}ynthesisthes (\textbf{CREPS}), a new generative model that is both efficient and scale-equivariant without using any spatial convolutions or coarse-to-fine design. To save memory footprint and make the system scalable, we employ a novel bi-line representation that decomposes layer-wise feature maps into separate ”thick” column and row encodings. Experiments on standard datasets, including FFHQ, LSUN-Church, and MetFaces, confirm CREPS’ ability to synthesize scale-consistent and alias-free images up to 4K resolution with proper training and inference speed.

Back to research

Overall

< 1 minute

Thuan Nguyen, Thanh Le, Anh Tran

CVPR 2023

Download PDF

Download Code

Related publications

GenAI

NLP

LREC-COLING

Improving Vietnamese-English Medical Machine Translation

June 28, 2024

Nhu Vo, Dat Quoc Nguyen, Dung D. Le, Massimo Piccardi, Wray Buntine

GenAI

NLP

Findings of ACL

Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs

June 28, 2024

Minh-Vuong Nguyen, Linhao Luo, Fatemeh Shiri, Dinh Phung, Yuan-Fang Li, Thuy-Trang Vu, Gholamreza Haffari

GenAI

NLP

Findings of ACL

Realistic Evaluation of Toxicity in Large Language Models

June 28, 2024

Tinh Son Luong, Thanh-Thien Le, Linh Van Ngo, and Thien Huu Nguyen

GenAI

NLP

ACL Top Tier

UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages

June 28, 2024

Trinh Pham*, Khoi M. Le*, Luu Anh Tuan

Efficient Scale-Invariant Generator with Column-Row Entangled Pixel Synthesis

Related publications

Thank you!