Computer Vision
BrushEdit: All-In-One Image Inpainting and Editing
·3188 words·15 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
Image Generation
🏢 Peking University
BrushEdit: All-in-One Image Inpainting & Editing.
InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption
·3493 words·17 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
Video Understanding
🏢 Nanjing University
InstanceCap: 인스턴스 인식 구조화 캡션을 통해 텍스트-비디오 생성을 개선합니다.
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
·1899 words·9 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
Image Generation
🏢 Nanyang Technological University
FreeScale로 튜닝 없이 8K 이미지 생성!
FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers
·2291 words·11 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
Image Generation
🏢 Virginia Tech
''
ObjectMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation
·3512 words·17 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
Image Generation
🏢 Google
객체 합성의 새 시대: ObjectMate로 튜닝 없이 사실적인 결과를 얻으세요.
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models
·3977 words·19 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
Image Generation
🏢 Shanghai Artificial Intelligence Laboratory
Evaluation Agent: 더 빠르고, 유연하며, 설명 가능한 시각적 생성 모델 평가 프레임워크.
Background-aware Moment Detection for Video Moment Retrieval
·2175 words·11 mins·
loading
·
loading
AI Generated
Computer Vision
Video Understanding
🏢 Seoul National University
BM-DETR: 배경 정보 활용으로 비디오 순간 검색의 약한 정렬 문제 해결!