Ask Your Images Anything
This week's FiftyOne Plugin is the Visual Question Answering (VQA) Plugin. It allows users to ask open-ended questions about their images within the FiftyOne App, without writing any code. The plugin supports two models: a Vision-Language Transformer and BLIP2 from Salesforce. Users can install the plugin using pip and download it from GitHub. The VQA Plugin is a Python Plugin consisting of four files and doesn't write anything onto the samples themselves, but returns the model's answer as output.
Company
Voxel51
Date published
Aug. 31, 2023
Author(s)
Jacob Marks
Word count
1069
Language
English
Hacker News points
None found.