Natural Language Processing
Samba-ASR: State-Of-The-Art Speech Recognition Leveraging Structured State-Space Models
·1134 words·6 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Speech Recognition
π’ SandLogic Technologies Pvt Ltd.
Mamba μν€ν
μ² κΈ°λ°μ Samba-ASRμ ν¨μ¨μ μΈ μν κ³΅κ° λͺ¨λΈμ μ΄μ©, κΈ°μ‘΄ Transformer λͺ¨λΈμ νκ³λ₯Ό 극볡νκ³ μμ± μΈμ λΆμΌμμ μ΅μ²¨λ¨ μ±λ₯μ λ¬μ±νμ΅λλ€.
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning
·2104 words·10 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Shanghai AI Laboratory
BoostStep: λ¨κ³λ³ μΆλ‘ μΌλ‘ LLMsμ μνμ λ₯λ ₯ ν₯μ!
Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation
·4797 words·23 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Question Answering
π’ Stanford University
AutoConverterλ μ€νμλ λ°©μμ VQA μ§λ¬Έμ λ€μ§μ λ€ν μ§λ¬ΈμΌλ‘ μλ λ³ννλ μμ€ν
μ
λλ€. μ΄λ₯Ό ν΅ν΄ VLM(Vision Language Model) νκ°μ κ°κ΄μ±κ³Ό μ¬νμ±μ λμΌ μ μμ΅λλ€. μ°κ΅¬μ§μ AutoConverterλ₯Ό μ¬μ©νμ¬ 20κ°μ κΈ°μ‘΄ VQA λ°μ΄ν°μ
μ ν΅ν©ν VMCBenchλΌλ μλ‘μ΄ λ²€μΉλ§ν¬λ₯Ό ꡬμΆνμ΅λλ€. VMCBen…
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use
·3178 words·15 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ ByteDance
ToolHop: λκ·λͺ¨ μΈμ΄ λͺ¨λΈμ λ€μ€ λ¨κ³ λꡬ μ¬μ© λ₯λ ₯μ μ격ν νκ°νλ μλ‘μ΄ λ²€μΉλ§ν¬
Test-time Computing: from System-1 Thinking to System-2 Thinking
·699 words·4 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Soochow University
ν
μ€νΈ μκ° μ»΄ν¨ν
μ νμ©νμ¬ λκ·λͺ¨ μΈμ΄ λͺ¨λΈμ μΆλ‘ λ₯λ ₯μ μμ€ν
1 μ¬κ³ μμ μμ€ν
2 μ¬κ³ μμ€μΌλ‘ ν₯μμν€λ λ°©λ²μ μ μνλ νκΈ°μ μΈ μ°κ΅¬!
Scaling Laws for Floating Point Quantization Training
·5642 words·27 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Tencent AI Lab
λΆλ μμμ μμν νλ ¨μ μλ‘μ΄ scaling law λ°κ²¬: μ§μ, 맨ν°μ¬ λΉνΈ λ° μ€μΌμΌλ§ μΈμ κ³μ° μ λ°λκ° LLM μ±λ₯μ λ―ΈμΉλ μν₯μ μ λμ μΌλ‘ κ·λͺ
Personalized Graph-Based Retrieval for Large Language Models
·3060 words·15 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ UC Santa Cruz
κ°μΈνλ κ·Έλν κΈ°λ° κ²μ μ¦κ° μμ±(PGraphRAG) νλ μμν¬λ₯Ό ν΅ν΄ ν¬μ λ°μ΄ν° λ¬Έμ λ₯Ό ν΄κ²°νκ³ , LLMμ κ°μΈν μ±λ₯μ ν¬κ² ν₯μμμΌ°μ΅λλ€.
METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring
·2684 words·13 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ University of Southern California
70μ΅ κ° λ§€κ°λ³μλ₯Ό κ°μ§ λ©νμ μ 체 κΈ°λ° λκ·λͺ¨ μΈμ΄ λͺ¨λΈ(METAGENE-1)μ΄ νμ λ°μ΄ν°λ‘ νλ ¨λμ΄ λ³μκ· νμ§ λ° μ μ 체 μμ΄ μλ² λ© μμ
μμ μ΅μ²¨λ¨ μ±λ₯μ λ¬μ±νμ΅λλ€.
Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models
·3175 words·15 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Ant Group
AUTO-RT: μλνλ μ¬λ° μ λ΅ νμμΌλ‘ LLM μ·¨μ½μ ν¨μ¨μ μΌλ‘ λ°κ²¬!
Dynamic Scaling of Unit Tests for Code Reward Modeling
·2368 words·12 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Tsinghua University
λ¨μ ν
μ€νΈμ μλ₯Ό λλ € μ½λ 보μ λͺ¨λΈμ μ νμ±μ λμ΄λ λ°©λ²μ μ μνλ μ°κ΅¬!
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings
·1888 words·9 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Alibaba Group
CODEELO λ²€μΉλ§ν¬: μΈκ° μμ€μ Elo λ±κΈμΌλ‘ LLMμ κ²½μμ μ½λ μμ± λ₯λ ₯ νκ°
BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery
·3521 words·17 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Stanford University
BoxingGym: LLM κΈ°λ° κ³Όνμ μμ΄μ νΈμ μ€ν μ€κ³ λ° λͺ¨λΈ λ°κ²¬ λ₯λ ₯ μ’
ν© νκ° λ²€μΉλ§ν¬
Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
·3211 words·16 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ University of Texas at Austin
TAPE(conTextualized equivAriant Position Embedding) νλ μμν¬λ₯Ό ν΅ν΄ λ¬Έλ§₯ μ 보λ₯Ό νμ©ν λμ μμΉ μΈμ½λ©μΌλ‘ νΈλμ€ν¬λ¨Έμ μμΉ κΈ°λ° μ£Όμ μ§μ μ±λ₯μ ν₯μμμΌ°μ΅λλ€.
Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
·2638 words·13 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ University of Texas at Austin
μ¬μΈ΅ μ κ²½λ§μ μ₯κΈ° μμ‘΄μ±μ λͺ¨λΈλ§νλ ꡬ쑰μ μν κ³΅κ° λͺ¨λΈ(SSM)μ νκ³λ₯Ό 극볡! μ΅μ μ°κ΅¬μμ SSMμ μ΅κ·Ό νΈν₯(recency bias) λ° κ³Όλν ννν(over-smoothing) λ¬Έμ λ₯Ό κ·λͺ
νκ³ , μ΄λ₯Ό ν΄κ²°νλ **κ·Ήμ±ν κΈ°λ²(polarization)**μ μ μνμ¬ μ₯κΈ° ν ν° μκ΄κ΄κ³ μ νλλ₯Ό λμμ΅λλ€.
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization
·2183 words·11 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Text Generation
π’ Singapore University of Technology and Design
TANGOFLUX: μ μ 맀κ°λ³μλ‘ μ΄κ³ μ, κ³ νμ§ ν
μ€νΈ μμ± λ³ν
MapQaTor: A System for Efficient Annotation of Map Query Datasets
·2879 words·14 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Question Answering
π’ Department of Computer Science and Engineering
MAPQATOR: νλ¬κ·Έμ€νλ μ΄ λ°©μμ μ§λ¦¬κ³΅κ° μ§μμλ΅ λ°μ΄ν°μ
μμ± μμ€ν
HUNYUANPROVER: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving
·1341 words·7 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Tencent AI Lab
HunyuanProver: λκ·λͺ¨ μΈμ΄ λͺ¨λΈ κΈ°λ°μ νμ₯ κ°λ₯ν λ°μ΄ν° ν©μ± νλ μμν¬μ μλ΄ νΈλ¦¬ νμμ ν΅ν΄ μ΅μ²¨λ¨ μλ μ 리 μ¦λͺ
μ±λ₯ λ¬μ±!
HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation
·3353 words·16 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Tsinghua University
LLMμ μ μ§μ μΆλ‘ λ° λ¬Έμ ν΄κ²° λ₯λ ₯μ νκ°νκΈ° μν μλ‘μ΄ λ²€μΉλ§ν¬ HumanEval Pro, MBPP Pro, BigCodeBench-Lite Pro μ μ!
Facilitating large language model Russian adaptation with Learned Embedding Propagation
·1947 words·10 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ Lomonosov Moscow State University
LEP(Learned Embedding Propagation)λ μ μ μμ νμ΅ λ°μ΄ν°λ§μΌλ‘λ λ€κ΅μ΄ λκ·λͺ¨ μΈμ΄ λͺ¨λΈμ ν¨μ¨μ μΌλ‘ μ μμν€λ μλ‘μ΄ κΈ°λ²μ
λλ€.
Efficiently Serving LLM Reasoning Programs with Certaindex
·3238 words·16 mins·
loading
·
loading
AI Generated
π€ Daily Papers
Natural Language Processing
Large Language Models
π’ UC San Diego
Dynasorμ LLM μΆλ‘ νλ‘κ·Έλ¨μ μμ μ¬μ©μ μ΅μ ννλ μμ€ν
μΌλ‘, certaindexλΌλ μλ‘μ΄ μ§νλ₯Ό νμ©νμ¬ μ΄λ €μ΄ μ§μμλ λ λ§μ μ°μ°μ, κ°λ¨ν μ§μμλ μ μ μ°μ°μ ν λΉνκ³ , μ λ§μ΄ μλ μ§μλ μ‘°κΈ°μ μ’
λ£ν¨μΌλ‘μ¨ μ νλ, μ§μ° μκ° λ° λΉμ©μ κ· ν μκ² λ§μΆ₯λλ€.