Learning Transferable Visual Models From Natural Language Supervision

Radford, Alec; Wook Kim, Jong; Hallacy, Chris; Ramesh, Aditya; Goh, Gabriel; Agarwal, Sandhini; Sastry, Girish; Askell, Amanda; Mishkin, Pamela; Clark, Jack; Krueger, Gretchen; Sutskever, Ilya

doi:10.48550/arxiv.2103.00020

Public

Learning Transferable Visual Models From Natural Language Supervision

Shared by NobleBlocks on Feb 26, 2021 • 12:00 AM UTC

Authors:

Alec Radford

Jong Wook Kim

Chris Hallacy

Abstract

State-of-the-art computer vision systems are trained to predict a fixed set of predetermined object categories. This restricted form of supervision limits their generality and usability since additional labeled data is needed to specify any other visual concept. Learning directly from raw text about...

Subject

Computer science

Artificial intelligence

Generality

Research Assistant

AI chat, annotations, notes & similar papers

Finding related papers...

Discussions

(0)

No comments yet

Be the first to share your thoughts!