DVA (Data Validation Agent) is Gata's first DataAgent. It evaluates the quality of image-caption data across the entire internet, assigning a score between -1 and 1. DVA scores are used to identify and select the highest-quality data points from the internet pool, which can then be used to pre-train various vision-language AIs, such as stable diffusion, Dall-E, and GPT-4o.