π’ Snap Inc
AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
·2525 words·12 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Multimodal Learning
Multimodal Generation
π’ Snap Inc
AV-Link: μκ° μ λ ¬ νμ° κΈ°λ₯μ ν΅ν ν¬λ‘μ€ λͺ¨λ¬ μ€λμ€-λΉλμ€ μμ±μ νκΈ°μ μΈ λ°μ !