π’ Snap Inc
AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
Β·2525 wordsΒ·12 minsΒ·
loading
Β·
loading
AI Generated
π€ Daily Papers
Multimodal Learning
Multimodal Generation
π’ Snap Inc
AV-Link: μκ° μ λ ¬ νμ° κΈ°λ₯μ ν΅ν ν¬λ‘μ€ λͺ¨λ¬ μ€λμ€-λΉλμ€ μμ±μ νκΈ°μ μΈ λ°μ !