π’ Tencent AI Lab
Scaling Laws for Floating Point Quantization Training
·5642 words·27 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Tencent AI Lab
λΆλ μμμ μμν νλ ¨μ μλ‘μ΄ scaling law λ°κ²¬: μ§μ, 맨ν°μ¬ λΉνΈ λ° μ€μΌμΌλ§ μΈμ κ³μ° μ λ°λκ° LLM μ±λ₯μ λ―ΈμΉλ μν₯μ μ λμ μΌλ‘ κ·λͺ
HUNYUANPROVER: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving
·1341 words·7 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Tencent AI Lab
HunyuanProver: λκ·λͺ¨ μΈμ΄ λͺ¨λΈ κΈ°λ°μ νμ₯ κ°λ₯ν λ°μ΄ν° ν©μ± νλ μμν¬μ μλ΄ νΈλ¦¬ νμμ ν΅ν΄ μ΅μ²¨λ¨ μλ μ 리 μ¦λͺ
μ±λ₯ λ¬μ±!
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
·2075 words·10 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Tencent AI Lab
λκ·λͺ¨ μΈμ΄ λͺ¨λΈμ κ³Όλν μ°μ° λ¬Έμ ν΄κ²°: ν¨μ¨μ μΈ μΆλ‘ μ μν μλ‘μ΄ μ§ν λ° μκΈ° νμ΅ μ λ΅ μ μ
VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models
·3812 words·18 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Computer Vision
Image Generation
π’ Tencent AI Lab
VideoMaker: μμ νμ° λͺ¨λΈμ κ³ μ ν νμ μ΄μ©ν μ λ‘μ· λ§μΆ€ν μμ μμ±
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation
·3181 words·15 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Computer Vision
Video Understanding
π’ Tencent AI Lab
DiTCtrl: νλ μμ΄ λ€μ€ ν둬ννΈλ‘ 맀λλ¬μ΄ μ₯μκ° λΉλμ€ μμ±
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought
·366 words·2 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Machine Translation
π’ Tencent AI Lab
DRT-01 λͺ¨λΈμ μ₯λ¬Έμ μ¬κ³ κ³Όμ μ νμ©νμ¬ λ¬Έν λ²μμ μ νλμ μ μ°½μ±μ ν¬κ² ν₯μμμΌ°μ΅λλ€.