Skip to main content

🏒 Carnegie Mellon University

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
·2422 words·12 mins· loading · loading
AI Generated πŸ€— Daily Papers Natural Language Processing Large Language Models 🏒 Carnegie Mellon University
TheAgentCompany λ²€μΉ˜λ§ˆν¬λŠ” μ‹€μ œ μ†Œν”„νŠΈμ›¨μ–΄ νšŒμ‚¬ ν™˜κ²½μ„ λͺ¨λ°©ν•˜μ—¬ LLM μ—μ΄μ „νŠΈμ˜ μ‹€μ œ 업무 μˆ˜ν–‰ λŠ₯λ ₯을 ν‰κ°€ν•˜λ©°, AI μ—μ΄μ „νŠΈμ˜ ν˜„μ‹€ 세계 적용 κ°€λŠ₯μ„±κ³Ό ν•œκ³„λ₯Ό λ³΄μ—¬μ€λ‹ˆλ‹€.