Computer Vision and FiftyOne Community Year in Review 2023
Voxel51 celebrated its fifth anniversary in 2023, marking a significant milestone for the company that focuses on open-source computer vision tools. The year was marked by several key developments in the field of computer vision, including GPT-4V, which enables users to instruct GPT-4 to analyze image inputs; SAM, a promptable segmentation system with zero-shot generalization capabilities; and DINOv2, a self-supervised system that can learn from any collection of images. Other notable developments included YOLO-NAS, an object detection model that outperforms competitors in terms of accuracy-speed performance; SDXL and SDXL Turbo, which enable near real-time generation of high-quality images; LoRA, a technique for parameter-efficient fine-tuning; DALL-E 3, an image generation model integrated with ChatGPT; Runway Gen-2, a multimodal AI system that can generate novel videos from text, images or video clips; Pika Labs, a startup developing an AI-powered platform for video editing and generation; and Emu Video and Emu Edit, which streamline the processes of training T2V models and precisely editing images via text prompts.
Company
Voxel51
Date published
Dec. 21, 2023
Author(s)
Jimmy Guerrero
Word count
1879
Language
English
Hacker News points
None found.