π’ University of Bonn
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
·3101 words·15 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Multimodal Learning
Vision-Language Models
π’ University of Bonn
Video-Panda: μ΄κ²½λ μΈμ½λ μλ λΉλμ€-μΈμ΄ λͺ¨λΈλ‘, κ³μ° λΉμ©μ νκΈ°μ μΌλ‘ μ€μ΄λ©΄μ μ΅μ²¨λ¨ μ±λ₯μ λ¬μ±!