Paper Reviews by AI
2024
Multi-LLM Text Summarization
·2623 words·13 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Text Summarization
π’ UC Santa Cruz
λ€μμ κ±°λ μΈμ΄ λͺ¨λΈ(LLM)μ νμ©ν νμ μ μΈ μ₯λ¬Έ μμ½ νλ μμν¬κ° μ μλμ΄ μμ½ νμ§μ μ΅λ 3λ°° ν₯μμμΌ°μ΅λλ€!
MotiF: Making Text Count in Image Animation with Motion Focal Loss
·2819 words·14 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Computer Vision
Video Understanding
π’ Brown University
MotiF: μμ§μμ μ΄μ μ λ§μΆ μμ€ ν¨μλ‘ ν
μ€νΈ κΈ°λ° μ΄λ―Έμ§ μ λλ©μ΄μ
κ°μ
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
·4085 words·20 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Microsoft Research
λκ·λͺ¨ μΈμ΄ λͺ¨λΈλ€μ μμλΈμ ν΅ν΄ 볡μ‘ν μΆλ‘ λ¬Έμ λ₯Ό λμ± ν¨κ³Όμ μΌλ‘ ν΄κ²°νλ μλ‘μ΄ νλ μμν¬, LE-MCTSλ₯Ό μ μν©λλ€!
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up
·3581 words·17 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Computer Vision
Image Generation
π’ National University of Singapore
CLEAR: μ ννλ μ΄ν
μ
μΌλ‘ κ³ ν΄μλ μ΄λ―Έμ§ μμ± μλλ₯Ό νκΈ°μ μΌλ‘ λμ΄λ€!
UIP2P: Unsupervised Instruction-based Image Editing via Cycle Edit Consistency
·2616 words·13 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Computer Vision
Image Generation
π’ ETH Zurich
λΉμ§λ νμ΅ κΈ°λ° μν νΈμ§ μΌκ΄μ±(CEC) νμ©, μ§μμ΄ κΈ°λ° μ΄λ―Έμ§ νΈμ§μ μλ‘μ΄ μ§νμ μ΄λ€!
TOMG-Bench: Evaluating LLMs on Text-based Open Molecule Generation
·3930 words·19 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Hong Kong Polytechnic University
TOMG-Bench: LLM κΈ°λ° μ€ν λΆμ μμ± λ²€μΉλ§ν¬ μ μ! 25κ° LLM νκ° λ° μλ‘μ΄ instruction tuning λ°μ΄ν°μ
OpenMolIns 곡κ°λ‘, μ€νμμ€ LLMμ μ±λ₯ ν₯μ λ° λΆμ λ°κ²¬μ μλ‘μ΄ κ°λ₯μ± μ μ!
Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
·1340 words·7 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Multimodal Learning
Multimodal Generation
π’ University of Illinois Urbana-Champaign
κ³ νμ§ λΉλμ€-μ€λμ€ ν©μ±μ μν νμ μ μΈ λ€μ€ λͺ¨λ μ‘°μΈνΈ νμ΅ νλ μμν¬ MMAudio μ μ!
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response
·2295 words·11 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Peking University
ROBUSTFTλ μ‘μμ΄ ν¬ν¨λ μλ΅ μλμμ λκ·λͺ¨ μΈμ΄ λͺ¨λΈμ κ°κ±΄ν μ§λ νμ΅ λ―ΈμΈ μ‘°μ μ μν νλ μμν¬λ‘, μ‘μ κ°μ§ λ° μ¬λΌλ²¨λ§μ ν΅ν΄ νλ₯ μμ
μ±λ₯μ ν₯μμν΅λλ€.
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing
·4863 words·23 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Tsinghua University
ReLU λΌμ°ν
μ μ¬μ©νλ μμ λ―ΈλΆ κ°λ₯ν MoE μν€ν
μ² ReMoEλ₯Ό ν΅ν΄ λκ·λͺ¨ μΈμ΄ λͺ¨λΈμ νμ₯μ±κ³Ό ν¨μ¨μ±μ νκΈ°μ μΌλ‘ κ°μ νμ΅λλ€!
Progressive Multimodal Reasoning via Active Retrieval
·2635 words·13 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Multimodal Learning
Multimodal Reasoning
π’ Gaoling School of Artificial Intelligence, Renmin University of China
AR-MCTS: λ₯λμ κ²μκ³Ό λͺ¬ν
μΉ΄λ₯Όλ‘ νΈλ¦¬ νμμΌλ‘ λ©ν°λͺ¨λ¬ μΆλ‘ ν₯μ
Parallelized Autoregressive Visual Generation
·3557 words·17 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Computer Vision
Image Generation
π’ Peking University
λ³Έ μ°κ΅¬λ ν ν° μμ‘΄μ±μ κ³ λ €ν λ³λ ¬ν μ λ΅μ ν΅ν΄ μλ νκ· μκ°μ μμ±μ μλλ₯Ό μ΅λ 9.5λ°°κΉμ§ ν₯μμμΌ°μ΅λλ€.
Outcome-Refining Process Supervision for Code Generation
·2498 words·12 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Peking University
볡μ‘ν μκ³ λ¦¬μ¦ μΆλ‘ μ΄ νμν μ½λ μμ± κ³Όμ μμ κΈ°μ‘΄μ νκ³λ₯Ό 극볡νλ μλ‘μ΄ λ°©λ²λ‘ , Outcome-Refining Process Supervision (ORPS) μ μ
MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design
·2237 words·11 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Microsoft Research
MixLLM: μΆλ ₯ νΉμ§ κ°μ μ μ νΌν© μ λ°λ μμνμ κ³ ν¨μ¨ μμ€ν
μ€κ³λ₯Ό ν΅ν΄ LLMμ μ νλμ ν¨μ¨μ±μ λμμ ν₯μμν€λ νκΈ°μ μΈ μμν λ°©λ²
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval
·2165 words·11 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Multimodal Learning
Vision-Language Models
π’ Hong Kong University of Science and Technology
MegaPairsλ VLMκ³Ό κ³΅κ° λλ©μΈ μ΄λ―Έμ§λ₯Ό νμ©, 2600λ§ κ° μ΄μμ κ³ νμ§ λ€μ€ λͺ¨λ¬ νμ΅ λ°μ΄ν°λ₯Ό μμ±νμ¬ λ²μ© λ€μ€ λͺ¨λ¬ κ²μ μ±λ₯μ νκΈ°μ μΌλ‘ ν₯μμμΌ°μ΅λλ€.
LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
·7524 words·36 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ TU Darmstadt
M-ALERTλ λ€κ΅μ΄ LLMμ μμ μ±μ νκ°νκΈ° μν μλ‘μ΄ λ²€μΉλ§ν¬μ
λλ€. μμ΄, νλμ€μ΄, λ
μΌμ΄, μ΄ν리μμ΄, μ€νμΈμ΄ 5κ° μΈμ΄μ 75,000κ° ν둬ννΈλ₯Ό ν¬ν¨νλ©°, λ€μν μΈμ΄ λ° λ²μ£Όμμ LLMμ μμ μ± λΆμΌμΉλ₯Ό λ°νλμ΅λλ€.
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
·2184 words·11 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Computer Vision
Image Generation
π’ Hong Kong University of Science and Technology
LeviTor: μ¬μ©μμ κ°νΈν 3D κΆ€μ μ
λ ₯λ§μΌλ‘ μ¬μ€μ μΈ λΉλμ€ ν©μ±μ΄ κ°λ₯ν νμ μ μΈ λͺ¨λΈ!
IDOL: Instant Photorealistic 3D Human Creation from a Single Image
·2450 words·12 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Computer Vision
3D Vision
π’ Tencent
λ¨μΌ μ΄λ―Έμ§μμ μ΄κ³ μ, κ³ νμ§, μ λλ©μ΄μ
κ°λ₯ν 3D μλ°νλ₯Ό μμ±νλ IDOL λͺ¨λΈ μ μ!
How to Synthesize Text Data without Model Collapse?
·5005 words·24 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Tsinghua University
ν©μ± λ°μ΄ν° κΈ°λ° μΈμ΄ λͺ¨λΈ νμ΅μ λΆκ΄΄ λ¬Έμ ν΄κ²°: ν ν° νΈμ§ κΈ°λ² μ μ!
Flowing from Words to Pixels: A Framework for Cross-Modality Evolution
·2904 words·14 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Multimodal Learning
Vision-Language Models
π’ GenAI, Meta
CrossFlow: λͺ¨λ¬λ¦¬ν° κ° μ§μ μ λ³ν κ°λ₯ν νμ μ νλ μμν¬!
Fietje: An open, efficient LLM for Dutch
·2556 words·12 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ KU Leuven
Fietje: μ€νμμ€ μν λ€λλλμ΄ LLM 곡κ°!