Ai pages
updated
HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing
Paper
• 2404.09990
• Published
• 14
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through
Direct Preference Optimization
Paper
• 2404.09956
• Published
• 12
TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal
Large Language Models
Paper
• 2404.09204
• Published
• 11
Taming Latent Diffusion Model for Neural Radiance Field Inpainting
Paper
• 2404.09995
• Published
• 7
A Picture is Worth a Thousand Words: Principled Recaptioning Improves
Image Generation
Paper
• 2310.16656
• Published
• 53
Fabricator: An Open Source Toolkit for Generating Labeled Training Data
with Teacher LLMs
Paper
• 2309.09582
• Published
• 4
ComputeGPT: A computational chat model for numerical problems
Paper
• 2305.06223
• Published
• 1
GPT4Tools: Teaching Large Language Model to Use Tools via
Self-instruction
Paper
• 2305.18752
• Published
• 5
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Paper
• 2312.09911
• Published
• 55