π’ Chinese University of Hong Kong
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction
·1981 words·10 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Multimodal Learning
Vision-Language Models
π’ Chinese University of Hong Kong
Dispider: μ€μκ° μνΈμμ©μ μν΄ λΆλ¦¬λ μΈμ, κ²°μ , λ°μμ μ¬μ©νλ λΉλμ€ LLMμ κ°λ₯νκ² ν©λλ€.
NILE: Internal Consistency Alignment in Large Language Models
·2709 words·13 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Chinese University of Hong Kong
NILE νλ μμν¬λ LLMμ λ΄λΆ μ§μκ³Ό IFT λ°μ΄ν°μ
μ μΈκ³ μ§μ κ° μΌκ΄μ±μ λμ¬ LLM μ±λ₯μ μ΅λ 68.5%κΉμ§ ν₯μμν΅λλ€.
IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations
·3273 words·16 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Computer Vision
3D Vision
π’ Chinese University of Hong Kong
IDArb: Decomposition under varied lights.