/plushcap/analysis/voxel51/finding-outliers-in-your-vision-datasets

Finding Outliers in Your Vision Datasets

What's this blog post about?

The text discusses the importance of high-quality datasets in AI, as poor samples can negatively impact model performance. It introduces FiftyOne Plugins, a tool that helps identify and remove outliers from datasets. Outlier detection is demonstrated using embeddings and sklearn, with examples including finding classification and detection mistakes, removing duplicates, addressing image quality issues, and visualizing embeddings. The text also highlights the usefulness of the Outlier Detection Plugin in discovering unique samples that can be used for data curation decisions.

Company
Voxel51

Date published
March 7, 2024

Author(s)
Dan Gural

Word count
601

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.