Paper Reviews by AI

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

16 December 2024·3747 words·18 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Question Answering 🏢 Renmin University of China

RetroLLM: 검색과 생성을 통합한 RAG 시스템

Nearly Zero-Cost Protection Against Mimicry by Personalized Diffusion Models

16 December 2024·3489 words·17 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 Inha University

실시간 이미지 보호, 딥페이크 대비책.

MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes

16 December 2024·2949 words·14 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision 3D Vision 🏢 Peking University

MOVIS는 실내 장면에 대한 멀티-객체 novel view synthesis에서 구조적 인식을 향상시켜 일관성 있고 사실적인 novel view를 생성합니다.

MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization

16 December 2024·2262 words·11 mins· loading · loading

AI Generated 🤗 Daily Papers Machine Learning Reinforcement Learning 🏢 ETH Zurich

정보 이득으로 강화 학습 탐색을 강화.

Just a Simple Transformation is Enough for Data Protection in Vertical Federated Learning

16 December 2024·2263 words·11 mins· loading · loading

AI Generated 🤗 Daily Papers Machine Learning Federated Learning 🏢 MIPT

간단한 변환만으로 수직 연합 학습에서 데이터 보호 가능.

IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations

16 December 2024·3273 words·16 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision 3D Vision 🏢 Chinese University of Hong Kong

IDArb: Decomposition under varied lights.

GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training

16 December 2024·2232 words·11 mins· loading · loading

AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 Shanghai Jiao Tong University

GeoX: MLLM보다 뛰어난 기하학적 문제 해결사!

ColorFlow: Retrieval-Augmented Image Sequence Colorization

16 December 2024·2273 words·11 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 Tsinghua University

만화 채색 자동화: ColorFlow는 ID 일관성을 유지하면서 흑백 만화 시퀀스를 채색합니다.

Causal Diffusion Transformers for Generative Modeling

16 December 2024·4953 words·24 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 ByteDance Research

CausalFusion은 확산 및 자기 회귀 모델을 결합하여 생성 모델링에서 최첨단 결과를 달성하고 새로운 기능을 가능하게 합니다.

VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping

15 December 2024·1707 words·9 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 CUHK MMLab

VividFace: 첫 번째 확산 기반 비디오 얼굴 바꾸기 프레임워크로 고충실도 결과 제공.

Smaller Language Models Are Better Instruction Evolvers

15 December 2024·4310 words·21 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Beijing University of Posts and Telecommunications

소형 언어 모델이 더 나은 명령 생성자!

Reliable, Reproducible, and Really Fast Leaderboards with Evalica

15 December 2024·1243 words·6 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 JetBrains

Evalica: 벤치마킹을 쉽고 빠르고 신뢰할 수 있게 만드는 툴킷

GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs

15 December 2024·2657 words·13 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision 3D Vision 🏢 Hong Kong University of Science and Technology

GaussianProperty는 LMM을 사용하여 3D 가우시안에 물리적 속성을 통합하는 훈련 없는 프레임워크로, 물리 기반 시뮬레이션 및 로봇 쥐기와 같은 다운스트림 작업을 가능하게 합니다.

DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes

15 December 2024·1754 words·9 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 Google DeepMind

DynamicScaler는 텍스트나 이미지에서 긴 끊김 없는 파노라마 비디오를 생성하며, 해상도와 종횡비에 관계없이 일관된 움직임을 유지합니다.

TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies

13 December 2024·2744 words·13 mins· loading · loading

AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Microsoft Research

TraceVLA: 과거의 움직임을 시각적으로 보여줌으로써 로봇의 시공간적 인식을 향상시킵니다.

SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video

13 December 2024·3662 words·18 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision 3D Vision 🏢 KAIST

SplineGS: 실시간 동적 3D 장면을 위한 강력한 모션 적응형 스플라인.

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

13 December 2024·4642 words·22 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Microsoft Corporation

SCBench는 멀티턴 및 멀티리퀘스트 시나리오에서 장문 맥락 메서드를 평가하는 새로운 벤치마크입니다.

RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning

13 December 2024·1911 words·9 mins· loading · loading

AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 UC Berkeley

RLDG는 강화 학습을 통해 생성된 고품질 데이터로 범용 로봇 정책의 성능을 향상시키는 획기적인 방법입니다.

Prompt2Perturb (P2P): Text-Guided Diffusion-Based Adversarial Attacks on Breast Ultrasound Images

13 December 2024·1580 words·8 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 University of British Columbia

P2P: 텍스트 기반의 새로운 적대적 공격으로 의료 영상 DNN의 취약성 공략

LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity

13 December 2024·3571 words·17 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Video Understanding 🏢 Princeton University

LinGen: 분 단위 고해상도 텍스트-투-비디오 생성, 선형 계산 복잡도로 효율성 극대화