Skip to main content

🏒 Chinese University of Hong Kong

Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction
·1981 words·10 mins· loading · loading
AI Generated πŸ€— Daily Papers Multimodal Learning Vision-Language Models 🏒 Chinese University of Hong Kong
Dispider: μ‹€μ‹œκ°„ μƒν˜Έμž‘μš©μ„ μœ„ν•΄ λΆ„λ¦¬λœ 인식, κ²°μ •, λ°˜μ‘μ„ μ‚¬μš©ν•˜λŠ” λΉ„λ””μ˜€ LLM을 κ°€λŠ₯ν•˜κ²Œ ν•©λ‹ˆλ‹€.
NILE: Internal Consistency Alignment in Large Language Models
·2709 words·13 mins· loading · loading
AI Generated πŸ€— Daily Papers Natural Language Processing Large Language Models 🏒 Chinese University of Hong Kong
NILE ν”„λ ˆμž„μ›Œν¬λŠ” LLM의 λ‚΄λΆ€ 지식과 IFT λ°μ΄ν„°μ…‹μ˜ 세계 지식 κ°„ 일관성을 λ†’μ—¬ LLM μ„±λŠ₯을 μ΅œλŒ€ 68.5%κΉŒμ§€ ν–₯μƒμ‹œν‚΅λ‹ˆλ‹€.
IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations
·3273 words·16 mins· loading · loading
AI Generated πŸ€— Daily Papers Computer Vision 3D Vision 🏒 Chinese University of Hong Kong
IDArb: Decomposition under varied lights.