π’ Alibaba Group
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings
·1888 words·9 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Alibaba Group
CODEELO λ²€μΉλ§ν¬: μΈκ° μμ€μ Elo λ±κΈμΌλ‘ LLMμ κ²½μμ μ½λ μμ± λ₯λ ₯ νκ°