Skip to main content

🏒 University of Bonn

Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
·3101 words·15 mins· loading · loading
AI Generated πŸ€— Daily Papers Multimodal Learning Vision-Language Models 🏒 University of Bonn
Video-Panda: μ΄ˆκ²½λŸ‰ 인코더 μ—†λŠ” λΉ„λ””μ˜€-μ–Έμ–΄ λͺ¨λΈλ‘œ, 계산 λΉ„μš©μ„ 획기적으둜 μ€„μ΄λ©΄μ„œ μ΅œμ²¨λ‹¨ μ„±λŠ₯을 달성!