๐ข Alibaba Group
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings
ยท1888 wordsยท9 minsยท
loading
ยท
loading
AI Generated
๐ค Daily Papers
Natural Language Processing
Large Language Models
๐ข Alibaba Group
CODEELO ๋ฒค์น๋งํฌ: ์ธ๊ฐ ์์ค์ Elo ๋ฑ๊ธ์ผ๋ก LLM์ ๊ฒฝ์์ ์ฝ๋ ์์ฑ ๋ฅ๋ ฅ ํ๊ฐ