π’ Carnegie Mellon University
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
·2422 words·12 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Carnegie Mellon University
TheAgentCompany λ²€μΉλ§ν¬λ μ€μ μννΈμ¨μ΄ νμ¬ νκ²½μ λͺ¨λ°©νμ¬ LLM μμ΄μ νΈμ μ€μ μ
무 μν λ₯λ ₯μ νκ°νλ©°, AI μμ΄μ νΈμ νμ€ μΈκ³ μ μ© κ°λ₯μ±κ³Ό νκ³λ₯Ό 보μ¬μ€λλ€.