State-of-the-art computer vision systems are trained to predict a fixed set of predetermined object categories. This restricted form of supervision limits their generality and usability since additional labeled data is needed to specify any other visual concept. Learning directly from raw text about...
Research Assistant
AI chat, annotations, notes & similar papers
No comments yet
Be the first to share your thoughts!