Methods to improve quality and diversity of language-vision models