TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control Paper • 2409.15977 • Published Sep 24, 2024 • 2
TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching Paper • 2502.12572 • Published Feb 18 • 2
TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis Paper • 2505.14910 • Published May 20 • 1
Granite 2.0 Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated Nov 17 • 202
Sora参考论文 Collection OpenAI "Video generation models as world simulators"技术报告后面的参考论文,总共32篇。OpenAI的ImageGPT和Dalle3这两篇缺失,链接已补充到note中。 • 32 items • Updated Feb 18, 2024 • 54
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding Paper • 2310.15308 • Published Oct 23, 2023 • 23