โ†“Skip to main content

๐Ÿข Alibaba Group

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings
ยท1888 wordsยท9 minsยท loading ยท loading
AI Generated ๐Ÿค— Daily Papers Natural Language Processing Large Language Models ๐Ÿข Alibaba Group
CODEELO ๋ฒค์น˜๋งˆํฌ: ์ธ๊ฐ„ ์ˆ˜์ค€์˜ Elo ๋“ฑ๊ธ‰์œผ๋กœ LLM์˜ ๊ฒฝ์Ÿ์  ์ฝ”๋“œ ์ƒ์„ฑ ๋Šฅ๋ ฅ ํ‰๊ฐ€
Buy Me A Coffee