Learning Transferable Visual Models From Natural Language Supervision, Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, Ilya Sutskever, 2021Proceedings of the 38th International Conference on Machine Learning (ICML), Vol. 139 - 介绍了CLIP模型,这是文本到图像和图像到文本任务的基础,也是衡量文本到图像生成中条件一致性的CLIP分数的基础。