Skip to main content

🏒 Tencent AI Lab

Scaling Laws for Floating Point Quantization Training
·5642 words·27 mins· loading · loading
AI Generated πŸ€— Daily Papers Natural Language Processing Large Language Models 🏒 Tencent AI Lab
뢀동 μ†Œμˆ˜μ  μ–‘μžν™” ν›ˆλ ¨μ˜ μƒˆλ‘œμš΄ scaling law 발견: μ§€μˆ˜, 맨티사 λΉ„νŠΈ 및 μŠ€μΌ€μΌλ§ 인자 계산 정밀도가 LLM μ„±λŠ₯에 λ―ΈμΉ˜λŠ” 영ν–₯을 μ •λŸ‰μ μœΌλ‘œ 규λͺ…
HUNYUANPROVER: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving
·1341 words·7 mins· loading · loading
AI Generated πŸ€— Daily Papers Natural Language Processing Large Language Models 🏒 Tencent AI Lab
HunyuanProver: λŒ€κ·œλͺ¨ μ–Έμ–΄ λͺ¨λΈ 기반의 ν™•μž₯ κ°€λŠ₯ν•œ 데이터 ν•©μ„± ν”„λ ˆμž„μ›Œν¬μ™€ μ•ˆλ‚΄ 트리 탐색을 톡해 μ΅œμ²¨λ‹¨ μžλ™ 정리 증λͺ… μ„±λŠ₯ 달성!
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
·2075 words·10 mins· loading · loading
AI Generated πŸ€— Daily Papers Natural Language Processing Large Language Models 🏒 Tencent AI Lab
λŒ€κ·œλͺ¨ μ–Έμ–΄ λͺ¨λΈμ˜ κ³Όλ„ν•œ μ—°μ‚° 문제 ν•΄κ²°: 효율적인 좔둠을 μœ„ν•œ μƒˆλ‘œμš΄ μ§€ν‘œ 및 자기 ν•™μŠ΅ μ „λž΅ μ œμ‹œ
VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models
·3812 words·18 mins· loading · loading
AI Generated πŸ€— Daily Papers Computer Vision Image Generation 🏒 Tencent AI Lab
VideoMaker: μ˜μƒ ν™•μ‚° λͺ¨λΈμ˜ κ³ μœ ν•œ νž˜μ„ μ΄μš©ν•œ μ œλ‘œμƒ· λ§žμΆ€ν˜• μ˜μƒ 생성
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation
·3181 words·15 mins· loading · loading
AI Generated πŸ€— Daily Papers Computer Vision Video Understanding 🏒 Tencent AI Lab
DiTCtrl: νŠœλ‹ 없이 닀쀑 ν”„λ‘¬ν”„νŠΈλ‘œ λ§€λ„λŸ¬μš΄ μž₯μ‹œκ°„ λΉ„λ””μ˜€ 생성
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought
·366 words·2 mins· loading · loading
AI Generated πŸ€— Daily Papers Natural Language Processing Machine Translation 🏒 Tencent AI Lab
DRT-01 λͺ¨λΈμ€ μž₯문의 사고 과정을 ν™œμš©ν•˜μ—¬ λ¬Έν•™ λ²ˆμ—­μ˜ 정확도와 μœ μ°½μ„±μ„ 크게 ν–₯μƒμ‹œμΌ°μŠ΅λ‹ˆλ‹€.