Skip to main content

Image Generation

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching
·3113 words·15 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข Tsinghua University
๋‹จ์ผ ๋‹จ๊ณ„ ์ƒ˜ํ”Œ๋ง์œผ๋กœ ์ด๋ฏธ์ง€ ์ž๋™ ํšŒ๊ท€ ๋ชจ๋ธ ์†๋„๋ฅผ ํš๊ธฐ์ ์œผ๋กœ ํ–ฅ์ƒ์‹œํ‚จ ์ฆ๋ฅ˜ ๋””์ฝ”๋”ฉ(DD) ๊ธฐ๋ฒ• ์ œ์•ˆ!
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up
·3581 words·17 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข National University of Singapore
CLEAR: ์„ ํ˜•ํ™”๋œ ์–ดํ…์…˜์œผ๋กœ ๊ณ ํ•ด์ƒ๋„ ์ด๋ฏธ์ง€ ์ƒ์„ฑ ์†๋„๋ฅผ ํš๊ธฐ์ ์œผ๋กœ ๋†’์ด๋‹ค!
UIP2P: Unsupervised Instruction-based Image Editing via Cycle Edit Consistency
·2616 words·13 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข ETH Zurich
๋น„์ง€๋„ ํ•™์Šต ๊ธฐ๋ฐ˜ ์ˆœํ™˜ ํŽธ์ง‘ ์ผ๊ด€์„ฑ(CEC) ํ™œ์šฉ, ์ง€์‹œ์–ด ๊ธฐ๋ฐ˜ ์ด๋ฏธ์ง€ ํŽธ์ง‘์˜ ์ƒˆ๋กœ์šด ์ง€ํ‰์„ ์—ด๋‹ค!
Parallelized Autoregressive Visual Generation
·3557 words·17 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข Peking University
๋ณธ ์—ฐ๊ตฌ๋Š” ํ† ํฐ ์˜์กด์„ฑ์„ ๊ณ ๋ คํ•œ ๋ณ‘๋ ฌํ™” ์ „๋žต์„ ํ†ตํ•ด ์ž๋™ ํšŒ๊ท€ ์‹œ๊ฐ์  ์ƒ์„ฑ์˜ ์†๋„๋ฅผ ์ตœ๋Œ€ 9.5๋ฐฐ๊นŒ์ง€ ํ–ฅ์ƒ์‹œ์ผฐ์Šต๋‹ˆ๋‹ค.
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
·2184 words·11 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข Hong Kong University of Science and Technology
LeviTor: ์‚ฌ์šฉ์ž์˜ ๊ฐ„ํŽธํ•œ 3D ๊ถค์  ์ž…๋ ฅ๋งŒ์œผ๋กœ ์‚ฌ์‹ค์ ์ธ ๋น„๋””์˜ค ํ•ฉ์„ฑ์ด ๊ฐ€๋Šฅํ•œ ํ˜์‹ ์ ์ธ ๋ชจ๋ธ!
Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion
·3112 words·15 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข Harvard University
Affordance-Aware Object Insertion: ๋ฐฐ๊ฒฝ๊ณผ ์ „๊ฒฝ์˜ ์ƒํ˜ธ์ž‘์šฉ์„ ๊ณ ๋ คํ•œ ํ˜„์‹ค์ ์ธ ์ด๋ฏธ์ง€ ํ•ฉ์„ฑ ๊ธฐ์ˆ !
PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation
·3040 words·15 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข Dept. ECE, University of Alberta
PixelMan์€ ํ”ฝ์…€ ์กฐ์ž‘ ๋ฐ ์ƒ์„ฑ์„ ํ†ตํ•ด ํ›ˆ๋ จ ์—†์ด๋„ ์ผ๊ด€์„ฑ ์žˆ๋Š” ๊ฐ์ฒด ํŽธ์ง‘์„ 16๋‹จ๊ณ„ ๋งŒ์— ๋‹ฌ์„ฑํ•˜๋Š” ํ˜์‹ ์ ์ธ ํ™•์‚ฐ ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค.
FashionComposer: Compositional Fashion Image Generation
·2170 words·11 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข University of Hong Kong
FashionComposer: ๋‹ค์–‘ํ•œ ์ž…๋ ฅ(ํ…์ŠคํŠธ, ์˜์ƒ ์ด๋ฏธ์ง€, 3D ๋ชจ๋ธ)์„ ํ™œ์šฉํ•ด ์‚ฌ์‹ค์ ์ธ ํŒจ์…˜ ์ด๋ฏธ์ง€๋ฅผ ํ•ฉ์„ฑํ•˜๋Š” ํ˜์‹ ์ ์ธ ํ”„๋ ˆ์ž„์›Œํฌ!
Autoregressive Video Generation without Vector Quantization
·3553 words·17 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข BAAI
๋ฒกํ„ฐ ์–‘์žํ™” ์—†์ด๋„ ํšจ์œจ์ ์ด๊ณ  ์œ ์—ฐํ•œ ์ž๊ธฐํšŒ๊ท€ ๋น„๋””์˜ค ์ƒ์„ฑ ๋ชจ๋ธ, NOVA ๊ฐœ๋ฐœ!
ChatDiT: A Training-Free Baseline for Task-Agnostic Free-Form Chatting with Diffusion Transformers
·1484 words·7 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข Tongyi Lab
ChatDiT: ์ œ๋กœ์ƒท ๋ฐฉ์‹์œผ๋กœ ์‚ฌ์ „ ํ›ˆ๋ จ๋œ ํ™•์‚ฐ ๋ณ€ํ™˜๊ธฐ๋ฅผ ํ™œ์šฉ, ์ž์—ฐ์–ด๋กœ ๋‹ค์–‘ํ•œ ์‹œ๊ฐ์  ๊ณผ์ œ ํ•ด๊ฒฐ!
Nearly Zero-Cost Protection Against Mimicry by Personalized Diffusion Models
·3489 words·17 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข Inha University
์‹ค์‹œ๊ฐ„ ์ด๋ฏธ์ง€ ๋ณดํ˜ธ, ๋”ฅํŽ˜์ดํฌ ๋Œ€๋น„์ฑ….
ColorFlow: Retrieval-Augmented Image Sequence Colorization
·2273 words·11 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข Tsinghua University
๋งŒํ™” ์ฑ„์ƒ‰ ์ž๋™ํ™”: ColorFlow๋Š” ID ์ผ๊ด€์„ฑ์„ ์œ ์ง€ํ•˜๋ฉด์„œ ํ‘๋ฐฑ ๋งŒํ™” ์‹œํ€€์Šค๋ฅผ ์ฑ„์ƒ‰ํ•ฉ๋‹ˆ๋‹ค.
Causal Diffusion Transformers for Generative Modeling
·4953 words·24 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข ByteDance Research
CausalFusion์€ ํ™•์‚ฐ ๋ฐ ์ž๊ธฐ ํšŒ๊ท€ ๋ชจ๋ธ์„ ๊ฒฐํ•ฉํ•˜์—ฌ ์ƒ์„ฑ ๋ชจ๋ธ๋ง์—์„œ ์ตœ์ฒจ๋‹จ ๊ฒฐ๊ณผ๋ฅผ ๋‹ฌ์„ฑํ•˜๊ณ  ์ƒˆ๋กœ์šด ๊ธฐ๋Šฅ์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค.
VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping
·1707 words·9 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข CUHK MMLab
VividFace: ์ฒซ ๋ฒˆ์งธ ํ™•์‚ฐ ๊ธฐ๋ฐ˜ ๋น„๋””์˜ค ์–ผ๊ตด ๋ฐ”๊พธ๊ธฐ ํ”„๋ ˆ์ž„์›Œํฌ๋กœ ๊ณ ์ถฉ์‹ค๋„ ๊ฒฐ๊ณผ ์ œ๊ณต.
DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes
·1754 words·9 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข Google DeepMind
DynamicScaler๋Š” ํ…์ŠคํŠธ๋‚˜ ์ด๋ฏธ์ง€์—์„œ ๊ธด ๋Š๊น€ ์—†๋Š” ํŒŒ๋…ธ๋ผ๋งˆ ๋น„๋””์˜ค๋ฅผ ์ƒ์„ฑํ•˜๋ฉฐ, ํ•ด์ƒ๋„์™€ ์ข…ํšก๋น„์— ๊ด€๊ณ„์—†์ด ์ผ๊ด€๋œ ์›€์ง์ž„์„ ์œ ์ง€ํ•ฉ๋‹ˆ๋‹ค.
Prompt2Perturb (P2P): Text-Guided Diffusion-Based Adversarial Attacks on Breast Ultrasound Images
·1580 words·8 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข University of British Columbia
P2P: ํ…์ŠคํŠธ ๊ธฐ๋ฐ˜์˜ ์ƒˆ๋กœ์šด ์ ๋Œ€์  ๊ณต๊ฒฉ์œผ๋กœ ์˜๋ฃŒ ์˜์ƒ DNN์˜ ์ทจ์•ฝ์„ฑ ๊ณต๋žต
BrushEdit: All-In-One Image Inpainting and Editing
·3188 words·15 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข Peking University
BrushEdit: All-in-One Image Inpainting & Editing.
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
·1899 words·9 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข Nanyang Technological University
FreeScale๋กœ ํŠœ๋‹ ์—†์ด 8K ์ด๋ฏธ์ง€ ์ƒ์„ฑ!
FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers
·2291 words·11 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข Virginia Tech
''
ObjectMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation
·3512 words·17 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข Google
๊ฐ์ฒด ํ•ฉ์„ฑ์˜ ์ƒˆ ์‹œ๋Œ€: ObjectMate๋กœ ํŠœ๋‹ ์—†์ด ์‚ฌ์‹ค์ ์ธ ๊ฒฐ๊ณผ๋ฅผ ์–ป์œผ์„ธ์š”.