Skip to main content

Multimodal Reasoning

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM
·3242 words·16 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Multimodal Learning Multimodal Reasoning ๐Ÿข Gaoling School of Artificial Intelligence, Renmin University of China
Virgo: ํ…์ŠคํŠธ ๊ธฐ๋ฐ˜ ์žฅ๋ฌธ ์‚ฌ๊ณ  ๋ฐ์ดํ„ฐ๋ฅผ ํ™œ์šฉ, ๋‹ค์–‘ํ•œ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋ฒค์น˜๋งˆํฌ์—์„œ ์ตœ์ฒจ๋‹จ ์„ฑ๋Šฅ ๋‹ฌ์„ฑ!
Progressive Multimodal Reasoning via Active Retrieval
·2635 words·13 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Multimodal Learning Multimodal Reasoning ๐Ÿข Gaoling School of Artificial Intelligence, Renmin University of China
AR-MCTS: ๋Šฅ๋™์  ๊ฒ€์ƒ‰๊ณผ ๋ชฌํ…Œ ์นด๋ฅผ๋กœ ํŠธ๋ฆฌ ํƒ์ƒ‰์œผ๋กœ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์ถ”๋ก  ํ–ฅ์ƒ