โ†“Skip to main content

๐Ÿข University of Aberdeen

DateLogicQA: Benchmarking Temporal Biases in Large Language Models
ยท2927 wordsยท14 minsยท loading ยท loading
AI Generated ๐Ÿค— Daily Papers Natural Language Processing Large Language Models ๐Ÿข University of Aberdeen
DateLogicQA: LLM์˜ ์‹œ๊ฐ„์  ์ถ”๋ก  ํŽธํ–ฅ ๋ฒค์น˜๋งˆํฌ ์ œ์‹œ! ํ† ํฐํ™”, ํ‘œ์ƒ ๋ฐ ๋…ผ๋ฆฌ ์ˆ˜์ค€ ํŽธํ–ฅ ๋ถ„์„์œผ๋กœ ์‹œ๊ฐ„์  ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ ๊ฐœ์„  ๋ฐฉ์•ˆ ์ œ์‹œ!