Skip to main content

๐Ÿข Tsinghua University

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization
·1717 words·9 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Natural Language Processing Large Language Models ๐Ÿข Tsinghua University
FoPE: ์ฃผํŒŒ์ˆ˜ ์˜์—ญ ํŠน์ง• ๊ฐœ์„ ์œผ๋กœ ๊ธด ๋ฌธ๋งฅ ๊ธธ์ด ์ผ๋ฐ˜ํ™” ๋‹ฌ์„ฑ!
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching
·3113 words·15 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข Tsinghua University
๋‹จ์ผ ๋‹จ๊ณ„ ์ƒ˜ํ”Œ๋ง์œผ๋กœ ์ด๋ฏธ์ง€ ์ž๋™ ํšŒ๊ท€ ๋ชจ๋ธ ์†๋„๋ฅผ ํš๊ธฐ์ ์œผ๋กœ ํ–ฅ์ƒ์‹œํ‚จ ์ฆ๋ฅ˜ ๋””์ฝ”๋”ฉ(DD) ๊ธฐ๋ฒ• ์ œ์•ˆ!
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing
·4863 words·23 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Natural Language Processing Large Language Models ๐Ÿข Tsinghua University
ReLU ๋ผ์šฐํŒ…์„ ์‚ฌ์šฉํ•˜๋Š” ์™„์ „ ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•œ MoE ์•„ํ‚คํ…์ฒ˜ ReMoE๋ฅผ ํ†ตํ•ด ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ์˜ ํ™•์žฅ์„ฑ๊ณผ ํšจ์œจ์„ฑ์„ ํš๊ธฐ์ ์œผ๋กœ ๊ฐœ์„ ํ–ˆ์Šต๋‹ˆ๋‹ค!
How to Synthesize Text Data without Model Collapse?
·5005 words·24 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Natural Language Processing Large Language Models ๐Ÿข Tsinghua University
ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ์–ธ์–ด ๋ชจ๋ธ ํ•™์Šต์˜ ๋ถ•๊ดด ๋ฌธ์ œ ํ•ด๊ฒฐ: ํ† ํฐ ํŽธ์ง‘ ๊ธฐ๋ฒ• ์ œ์‹œ!
LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer
·3363 words·16 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Multimodal Learning Vision-Language Models ๐Ÿข Tsinghua University
LLaVA-UHD v2๋Š” ๊ณ„์ธต์  ์œˆ๋„์šฐ ๋ณ€ํ™˜๊ธฐ๋ฅผ ์ด์šฉ, ๊ณ ํ•ด์ƒ๋„ ํŠน์ง• ํ”ผ๋ผ๋ฏธ๋“œ๋ฅผ ํ†ตํ•ฉํ•˜์—ฌ ๋‹ค์–‘ํ•œ ์‹œ๊ฐ์  ์„ธ๋ถ€ ์ •๋ณด๋ฅผ ํฌ์ฐฉํ•˜๋Š” ํ˜์‹ ์ ์ธ ๋‹ค์ค‘ ๋ชจ๋‹ฌ ์–ธ์–ด ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค.
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
·3260 words·16 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Natural Language Processing Large Language Models ๐Ÿข Tsinghua University
Self-play with refinement boosts instruction-following in LLMs.
ColorFlow: Retrieval-Augmented Image Sequence Colorization
·2273 words·11 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Computer Vision Image Generation ๐Ÿข Tsinghua University
๋งŒํ™” ์ฑ„์ƒ‰ ์ž๋™ํ™”: ColorFlow๋Š” ID ์ผ๊ด€์„ฑ์„ ์œ ์ง€ํ•˜๋ฉด์„œ ํ‘๋ฐฑ ๋งŒํ™” ์‹œํ€€์Šค๋ฅผ ์ฑ„์ƒ‰ํ•ฉ๋‹ˆ๋‹ค.
SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding
·3268 words·16 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Multimodal Learning Vision-Language Models ๐Ÿข Tsinghua University
SynerGen-VL: ๊ฐ„๋‹จํ•œ ๊ตฌ์กฐ๋กœ ์ด๋ฏธ์ง€ ์ดํ•ด ๋ฐ ์ƒ์„ฑ์„ ๋™์‹œ์— ์ˆ˜ํ–‰ํ•˜๋Š” ๊ฐ•๋ ฅํ•œ MLLM.