Image Generation
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching
·3113 words·15 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข Tsinghua University
๋จ์ผ ๋จ๊ณ ์ํ๋ง์ผ๋ก ์ด๋ฏธ์ง ์๋ ํ๊ท ๋ชจ๋ธ ์๋๋ฅผ ํ๊ธฐ์ ์ผ๋ก ํฅ์์ํจ ์ฆ๋ฅ ๋์ฝ๋ฉ(DD) ๊ธฐ๋ฒ ์ ์!
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up
·3581 words·17 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข National University of Singapore
CLEAR: ์ ํํ๋ ์ดํ
์
์ผ๋ก ๊ณ ํด์๋ ์ด๋ฏธ์ง ์์ฑ ์๋๋ฅผ ํ๊ธฐ์ ์ผ๋ก ๋์ด๋ค!
UIP2P: Unsupervised Instruction-based Image Editing via Cycle Edit Consistency
·2616 words·13 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข ETH Zurich
๋น์ง๋ ํ์ต ๊ธฐ๋ฐ ์ํ ํธ์ง ์ผ๊ด์ฑ(CEC) ํ์ฉ, ์ง์์ด ๊ธฐ๋ฐ ์ด๋ฏธ์ง ํธ์ง์ ์๋ก์ด ์งํ์ ์ด๋ค!
Parallelized Autoregressive Visual Generation
·3557 words·17 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข Peking University
๋ณธ ์ฐ๊ตฌ๋ ํ ํฐ ์์กด์ฑ์ ๊ณ ๋ คํ ๋ณ๋ ฌํ ์ ๋ต์ ํตํด ์๋ ํ๊ท ์๊ฐ์ ์์ฑ์ ์๋๋ฅผ ์ต๋ 9.5๋ฐฐ๊น์ง ํฅ์์์ผฐ์ต๋๋ค.
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
·2184 words·11 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข Hong Kong University of Science and Technology
LeviTor: ์ฌ์ฉ์์ ๊ฐํธํ 3D ๊ถค์ ์
๋ ฅ๋ง์ผ๋ก ์ฌ์ค์ ์ธ ๋น๋์ค ํฉ์ฑ์ด ๊ฐ๋ฅํ ํ์ ์ ์ธ ๋ชจ๋ธ!
Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion
·3112 words·15 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข Harvard University
Affordance-Aware Object Insertion: ๋ฐฐ๊ฒฝ๊ณผ ์ ๊ฒฝ์ ์ํธ์์ฉ์ ๊ณ ๋ คํ ํ์ค์ ์ธ ์ด๋ฏธ์ง ํฉ์ฑ ๊ธฐ์ !
PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation
·3040 words·15 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข Dept. ECE, University of Alberta
PixelMan์ ํฝ์
์กฐ์ ๋ฐ ์์ฑ์ ํตํด ํ๋ จ ์์ด๋ ์ผ๊ด์ฑ ์๋ ๊ฐ์ฒด ํธ์ง์ 16๋จ๊ณ ๋ง์ ๋ฌ์ฑํ๋ ํ์ ์ ์ธ ํ์ฐ ๋ชจ๋ธ ๊ธฐ๋ฐ ๋ฐฉ๋ฒ์
๋๋ค.
FashionComposer: Compositional Fashion Image Generation
·2170 words·11 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข University of Hong Kong
FashionComposer: ๋ค์ํ ์
๋ ฅ(ํ
์คํธ, ์์ ์ด๋ฏธ์ง, 3D ๋ชจ๋ธ)์ ํ์ฉํด ์ฌ์ค์ ์ธ ํจ์
์ด๋ฏธ์ง๋ฅผ ํฉ์ฑํ๋ ํ์ ์ ์ธ ํ๋ ์์ํฌ!
Autoregressive Video Generation without Vector Quantization
·3553 words·17 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข BAAI
๋ฒกํฐ ์์ํ ์์ด๋ ํจ์จ์ ์ด๊ณ ์ ์ฐํ ์๊ธฐํ๊ท ๋น๋์ค ์์ฑ ๋ชจ๋ธ, NOVA ๊ฐ๋ฐ!
ChatDiT: A Training-Free Baseline for Task-Agnostic Free-Form Chatting with Diffusion Transformers
·1484 words·7 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข Tongyi Lab
ChatDiT: ์ ๋ก์ท ๋ฐฉ์์ผ๋ก ์ฌ์ ํ๋ จ๋ ํ์ฐ ๋ณํ๊ธฐ๋ฅผ ํ์ฉ, ์์ฐ์ด๋ก ๋ค์ํ ์๊ฐ์ ๊ณผ์ ํด๊ฒฐ!
Nearly Zero-Cost Protection Against Mimicry by Personalized Diffusion Models
·3489 words·17 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข Inha University
์ค์๊ฐ ์ด๋ฏธ์ง ๋ณดํธ, ๋ฅํ์ดํฌ ๋๋น์ฑ
.
ColorFlow: Retrieval-Augmented Image Sequence Colorization
·2273 words·11 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข Tsinghua University
๋งํ ์ฑ์ ์๋ํ: ColorFlow๋ ID ์ผ๊ด์ฑ์ ์ ์งํ๋ฉด์ ํ๋ฐฑ ๋งํ ์ํ์ค๋ฅผ ์ฑ์ํฉ๋๋ค.
Causal Diffusion Transformers for Generative Modeling
·4953 words·24 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข ByteDance Research
CausalFusion์ ํ์ฐ ๋ฐ ์๊ธฐ ํ๊ท ๋ชจ๋ธ์ ๊ฒฐํฉํ์ฌ ์์ฑ ๋ชจ๋ธ๋ง์์ ์ต์ฒจ๋จ ๊ฒฐ๊ณผ๋ฅผ ๋ฌ์ฑํ๊ณ ์๋ก์ด ๊ธฐ๋ฅ์ ๊ฐ๋ฅํ๊ฒ ํฉ๋๋ค.
VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping
·1707 words·9 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข CUHK MMLab
VividFace: ์ฒซ ๋ฒ์งธ ํ์ฐ ๊ธฐ๋ฐ ๋น๋์ค ์ผ๊ตด ๋ฐ๊พธ๊ธฐ ํ๋ ์์ํฌ๋ก ๊ณ ์ถฉ์ค๋ ๊ฒฐ๊ณผ ์ ๊ณต.
DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes
·1754 words·9 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข Google DeepMind
DynamicScaler๋ ํ
์คํธ๋ ์ด๋ฏธ์ง์์ ๊ธด ๋๊น ์๋ ํ๋
ธ๋ผ๋ง ๋น๋์ค๋ฅผ ์์ฑํ๋ฉฐ, ํด์๋์ ์ข
ํก๋น์ ๊ด๊ณ์์ด ์ผ๊ด๋ ์์ง์์ ์ ์งํฉ๋๋ค.
Prompt2Perturb (P2P): Text-Guided Diffusion-Based Adversarial Attacks on Breast Ultrasound Images
·1580 words·8 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข University of British Columbia
P2P: ํ
์คํธ ๊ธฐ๋ฐ์ ์๋ก์ด ์ ๋์ ๊ณต๊ฒฉ์ผ๋ก ์๋ฃ ์์ DNN์ ์ทจ์ฝ์ฑ ๊ณต๋ต
BrushEdit: All-In-One Image Inpainting and Editing
·3188 words·15 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข Peking University
BrushEdit: All-in-One Image Inpainting & Editing.
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
·1899 words·9 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข Nanyang Technological University
FreeScale๋ก ํ๋ ์์ด 8K ์ด๋ฏธ์ง ์์ฑ!
FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers
·2291 words·11 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข Virginia Tech
''
ObjectMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation
·3512 words·17 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข Google
๊ฐ์ฒด ํฉ์ฑ์ ์ ์๋: ObjectMate๋ก ํ๋ ์์ด ์ฌ์ค์ ์ธ ๊ฒฐ๊ณผ๋ฅผ ์ป์ผ์ธ์.