Skip to main content

🏒 Alibaba Group

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings
·1888 words·9 mins· loading · loading
AI Generated πŸ€— Daily Papers Natural Language Processing Large Language Models 🏒 Alibaba Group
CODEELO 벀치마크: 인간 μˆ˜μ€€μ˜ Elo λ“±κΈ‰μœΌλ‘œ LLM의 경쟁적 μ½”λ“œ 생성 λŠ₯λ ₯ 평가