Unifying Vision-and-Language Tasks via Text Generation
Paper
• 2102.02779 • Published
This is a VL-T5 (Unifying Vision-and-Language Tasks via Text Generation) model pretrained on Japanese corpus.
日本語コーパスを用いて事前学習を行ったVL-T5 (Unifying Vision-and-Language Tasks via Text Generation) モデルです。