๐ข Tsinghua University
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization
·1717 words·9 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Natural Language Processing
Large Language Models
๐ข Tsinghua University
FoPE: ์ฃผํ์ ์์ญ ํน์ง ๊ฐ์ ์ผ๋ก ๊ธด ๋ฌธ๋งฅ ๊ธธ์ด ์ผ๋ฐํ ๋ฌ์ฑ!
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching
·3113 words·15 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข Tsinghua University
๋จ์ผ ๋จ๊ณ ์ํ๋ง์ผ๋ก ์ด๋ฏธ์ง ์๋ ํ๊ท ๋ชจ๋ธ ์๋๋ฅผ ํ๊ธฐ์ ์ผ๋ก ํฅ์์ํจ ์ฆ๋ฅ ๋์ฝ๋ฉ(DD) ๊ธฐ๋ฒ ์ ์!
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing
·4863 words·23 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Natural Language Processing
Large Language Models
๐ข Tsinghua University
ReLU ๋ผ์ฐํ
์ ์ฌ์ฉํ๋ ์์ ๋ฏธ๋ถ ๊ฐ๋ฅํ MoE ์ํคํ
์ฒ ReMoE๋ฅผ ํตํด ๋๊ท๋ชจ ์ธ์ด ๋ชจ๋ธ์ ํ์ฅ์ฑ๊ณผ ํจ์จ์ฑ์ ํ๊ธฐ์ ์ผ๋ก ๊ฐ์ ํ์ต๋๋ค!
How to Synthesize Text Data without Model Collapse?
·5005 words·24 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Natural Language Processing
Large Language Models
๐ข Tsinghua University
ํฉ์ฑ ๋ฐ์ดํฐ ๊ธฐ๋ฐ ์ธ์ด ๋ชจ๋ธ ํ์ต์ ๋ถ๊ดด ๋ฌธ์ ํด๊ฒฐ: ํ ํฐ ํธ์ง ๊ธฐ๋ฒ ์ ์!
LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer
·3363 words·16 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Multimodal Learning
Vision-Language Models
๐ข Tsinghua University
LLaVA-UHD v2๋ ๊ณ์ธต์ ์๋์ฐ ๋ณํ๊ธฐ๋ฅผ ์ด์ฉ, ๊ณ ํด์๋ ํน์ง ํผ๋ผ๋ฏธ๋๋ฅผ ํตํฉํ์ฌ ๋ค์ํ ์๊ฐ์ ์ธ๋ถ ์ ๋ณด๋ฅผ ํฌ์ฐฉํ๋ ํ์ ์ ์ธ ๋ค์ค ๋ชจ๋ฌ ์ธ์ด ๋ชจ๋ธ์
๋๋ค.
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
·3260 words·16 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Natural Language Processing
Large Language Models
๐ข Tsinghua University
Self-play with refinement boosts instruction-following in LLMs.
ColorFlow: Retrieval-Augmented Image Sequence Colorization
·2273 words·11 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Computer Vision
Image Generation
๐ข Tsinghua University
๋งํ ์ฑ์ ์๋ํ: ColorFlow๋ ID ์ผ๊ด์ฑ์ ์ ์งํ๋ฉด์ ํ๋ฐฑ ๋งํ ์ํ์ค๋ฅผ ์ฑ์ํฉ๋๋ค.
SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding
·3268 words·16 mins·
loading
·
loading
AI Generated
๐ค Daily Papers
Multimodal Learning
Vision-Language Models
๐ข Tsinghua University
SynerGen-VL: ๊ฐ๋จํ ๊ตฌ์กฐ๋ก ์ด๋ฏธ์ง ์ดํด ๋ฐ ์์ฑ์ ๋์์ ์ํํ๋ ๊ฐ๋ ฅํ MLLM.