Why 2024 Was the Best Year for Visual AI (So Far)

Company

Voxel51

Date Published

Dec. 20, 2024

Author

Dan Gural

Word count

2291

Language

English

Hacker News points

None

URL

voxel51.com/blog/why-2024-was-the-best-year-for-visual-ai-so-far

Summary

Visual AI made significant advancements in 2024, redefining how machines perceive and transform the world around us. Groundbreaking open source contributions, such as Meta's SAM2 model, enabled faster annotation times and improved video analysis capabilities. The rise of YOLOv9-11 models and Ultralytics' open-source library has made Visual AI more accessible to developers, while libraries like MedSAM2 and Rerun have empowered new tasks in the field. The 2D → 3D revolution saw significant progress in both academia and industry, with NerFs and Gaussian Splatting becoming prominent alternatives for 3D reconstruction. Autonomous vehicles continued to push the boundaries of Visual AI, with Waymo, Tesla, and Wayve making notable advancements. As we look ahead to 2025, predictions include a departure from 2D to 3D, VLMs achieving their "ChatGPT moment," open source driving innovation, and annotation becoming obsolete. The future of Visual AI holds much promise, with the potential to revolutionize industries such as self-driving cars, medical imaging, and more.