π’ University of Washington
Byte Latent Transformer: Patches Scale Better Than Tokens
·3839 words·19 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ University of Washington
BLT: λ°μ΄νΈ κΈ°λ° LLM, ν ν°λ³΄λ€ ν¨μΉ μ°μ .