Paper Reviews by AI
2024
MapQaTor: A System for Efficient Annotation of Map Query Datasets
·2879 words·14 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Question Answering
π’ Department of Computer Science and Engineering
MAPQATOR: νλ¬κ·Έμ€νλ μ΄ λ°©μμ μ§λ¦¬κ³΅κ° μ§μμλ΅ λ°μ΄ν°μ
μμ± μμ€ν
LTX-Video: Realtime Video Latent Diffusion
·2625 words·13 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Computer Vision
Video Understanding
π’ Lightricks
LTX-Video: μ΄κ³ μ μ€μκ° κ³ ν΄μλ λΉλμ€ μμ± λͺ¨λΈ
HUNYUANPROVER: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving
·1341 words·7 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Tencent AI Lab
HunyuanProver: λκ·λͺ¨ μΈμ΄ λͺ¨λΈ κΈ°λ°μ νμ₯ κ°λ₯ν λ°μ΄ν° ν©μ± νλ μμν¬μ μλ΄ νΈλ¦¬ νμμ ν΅ν΄ μ΅μ²¨λ¨ μλ μ 리 μ¦λͺ
μ±λ₯ λ¬μ±!
HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation
·3353 words·16 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Tsinghua University
LLMμ μ μ§μ μΆλ‘ λ° λ¬Έμ ν΄κ²° λ₯λ ₯μ νκ°νκΈ° μν μλ‘μ΄ λ²€μΉλ§ν¬ HumanEval Pro, MBPP Pro, BigCodeBench-Lite Pro μ μ!
Facilitating large language model Russian adaptation with Learned Embedding Propagation
·1947 words·10 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Lomonosov Moscow State University
LEP(Learned Embedding Propagation)λ μ μ μμ νμ΅ λ°μ΄ν°λ§μΌλ‘λ λ€κ΅μ΄ λκ·λͺ¨ μΈμ΄ λͺ¨λΈμ ν¨μ¨μ μΌλ‘ μ μμν€λ μλ‘μ΄ κΈ°λ²μ
λλ€.
Efficiently Serving LLM Reasoning Programs with Certaindex
·3238 words·16 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ UC San Diego
Dynasorμ LLM μΆλ‘ νλ‘κ·Έλ¨μ μμ μ¬μ©μ μ΅μ ννλ μμ€ν
μΌλ‘, certaindexλΌλ μλ‘μ΄ μ§νλ₯Ό νμ©νμ¬ μ΄λ €μ΄ μ§μμλ λ λ§μ μ°μ°μ, κ°λ¨ν μ§μμλ μ μ μ°μ°μ ν λΉνκ³ , μ λ§μ΄ μλ μ§μλ μ‘°κΈ°μ μ’
λ£ν¨μΌλ‘μ¨ μ νλ, μ§μ° μκ° λ° λΉμ©μ κ· ν μκ² λ§μΆ₯λλ€.
Edicho: Consistent Image Editing in the Wild
·2213 words·11 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Computer Vision
Image Generation
π’ Hong Kong University of Science and Technology
Edicho: μ΄λ―Έμ§ κ° μΌκ΄μ± μ μ§νλ©° μ λ‘μ· μ΄λ―Έμ§ νΈμ§ κ°λ₯!
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
·2075 words·10 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Tencent AI Lab
λκ·λͺ¨ μΈμ΄ λͺ¨λΈμ κ³Όλν μ°μ° λ¬Έμ ν΄κ²°: ν¨μ¨μ μΈ μΆλ‘ μ μν μλ‘μ΄ μ§ν λ° μκΈ° νμ΅ μ λ΅ μ μ
Are Vision-Language Models Truly Understanding Multi-vision Sensor?
·3155 words·15 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Multimodal Learning
Vision-Language Models
π’ Integrated Vision Language Lab, KAIST
λ©ν° λΉμ μΌμ λ°μ΄ν°μ λν VLMsμ μ΄ν΄λ ν₯μμ μν μλ‘μ΄ λ²€μΉλ§ν¬(MS-PR)μ DNA μ΅μ ν κΈ°λ² μ μ
Bringing Objects to Life: 4D generation from 3D objects
·2224 words·11 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Computer Vision
Image Generation
π’ NVIDIA
3to4D: ν
μ€νΈ ν둬ννΈλ‘ μ¬μ©μ μ 곡 3D κ°μ²΄λ₯Ό μ€κ°λκ² μ λλ©μ΄μ
ν!
OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System
·304 words·2 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Information Extraction
π’ Zhejiang University
OneKE: λ컀 κΈ°λ°, λ€μ€ μμ΄μ νΈ LLM μ§μ μΆμΆ μμ€ν
μΌλ‘ μΉ, PDFμμ λ€μν λλ©μΈ μ§μ μΆμΆ κ°λ₯
On the Compositional Generalization of Multimodal LLMs for Medical Imaging
·4972 words·24 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Computer Vision
Visual Question Answering
π’ Chinese University of Hong Kong, Shenzhen
μλ£ μμμ λν λ€μ€ λͺ¨λ κ±°λ μΈμ΄ λͺ¨λΈμ μΌλ°ν λ₯λ ₯ ν₯μμ ꡬμ±μ μΌλ°ν(CG)κ° ν΅μ¬ μν μ μννλ©°, μ νλ λ°μ΄ν°μμλ ν¨κ³Όμ μμ λ°ν.
Xmodel-2 Technical Report
·2136 words·11 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Xiaoduo AI Lab
Xmodel-2: 12μ΅ λ§€κ°λ³μμ μΆλ‘ μ λ¬Έ λκ·λͺ¨ μΈμ΄ λͺ¨λΈλ‘, ν¨μ¨μ μΈ μ€κ³μ νλ ¨ μ λ΅μ ν΅ν΄ μ΅μ²¨λ¨ μ±λ₯ λ¬μ±!
VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models
·3812 words·18 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Computer Vision
Image Generation
π’ Tencent AI Lab
VideoMaker: μμ νμ° λͺ¨λΈμ κ³ μ ν νμ μ΄μ©ν μ λ‘μ· λ§μΆ€ν μμ μμ±
Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging
·177 words·1 min·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Intel Labs
λ―ΈμΈ μ‘°μ μΌλ‘ μμ μ±μ΄ μ νλ LLMμ μ±λ₯μ ν₯μμν€λ λμμ μμ μ±μ μ μ§νλ κ°νΈνκ³ ν¨κ³Όμ μΈ λͺ¨λΈ κ²°ν© λ°©λ² μ μ!
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
·2961 words·14 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Multimodal Learning
Vision-Language Models
π’ Hong Kong University of Science and Technology
OS-Genesisλ μλ°©ν₯ μμ
ν©μ±μ ν΅ν΄ GUI μμ΄μ νΈ κΆ€μ μμ± μλν λ¬Έμ λ₯Ό ν΄κ²°νλ νμ μ μΈ νμ΄νλΌμΈμ
λλ€.
From Elements to Design: A Layered Approach for Automatic Graphic Design Composition
·2870 words·14 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Multimodal Learning
Vision-Language Models
π’ Xi'an Jiaotong University
LaDeCo: κ³μΈ΅μ μ κ·Ό λ°©μμ μ¬μ©ν μλ κ·Έλν½ λμμΈ ν©μ±
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment
·3029 words·15 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Multimodal Learning
Vision-Language Models
π’ Shanghai AI Laboratory
μκ°μ κ³Όμ μ λ ¬μ ν΅ν μμ
μ νΈλ μ΅μ ν(TPO)λ‘ λ©ν°λͺ¨λ¬ λκ·λͺ¨ μΈμ΄ λͺ¨λΈμ μ±λ₯μ νκΈ°μ μΌλ‘ ν₯μμμΌ°μ΅λλ€.
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
·3101 words·15 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Multimodal Learning
Vision-Language Models
π’ University of Bonn
Video-Panda: μ΄κ²½λ μΈμ½λ μλ λΉλμ€-μΈμ΄ λͺ¨λΈλ‘, κ³μ° λΉμ©μ νκΈ°μ μΌλ‘ μ€μ΄λ©΄μ μ΅μ²¨λ¨ μ±λ₯μ λ¬μ±!
Token-Budget-Aware LLM Reasoning
·2417 words·12 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Nanjing University
ν ν° μμ° μΈμ LLM μΆλ‘ νλ μμν¬(TALE)λ₯Ό ν΅ν΄ LLM μΆλ‘ μ ν ν° λΉμ©μ ν¬κ² μ€μ΄λ©΄μ μ±λ₯ μ νλ₯Ό μ΅μννμ΅λλ€!