Skip to content

ERNIE-ViLG

BaiduVision-language generationImage generationText-to-imageImage captioningLanguage modeling/generationVisual question answering

ERNIE-ViLG is vision-language generation model published by Baidu in 2021 featuring 10000000000.0 parameters.

About ERNIE-ViLG

Conventional methods for the image-text generation tasks mainly tackle the naturally bidirectional generation tasks separately, focusing on designing task-specific frameworks to improve the quality and fidelity of the generated samples. Recently, Vis

Details

Provider
Baidu
Task
Vision-language generation,Image generation,Text-to-image,Image captioning,Language modeling/generation,Visual question answering
Parameters
10000000000.0
Released
2021-12-31
Open weights
No
View model source

Explore

FAQ