/plushcap/analysis/voxel51/eliminate-image-duplicates-with-fiftyone

Double Trouble: Eliminate Image Duplicates with FiftyOne

What's this blog post about?

This week's FiftyOne Plugin focuses on Image Deduplication, a task that removes exact and approximate duplicate images from a dataset. Duplicate data can lead to longer training times, higher costs, and lower performance in machine learning models. The plugin allows users to find both exact and approximate duplicate images, visualize these groups of duplicates, and delete all duplicates or keep a representative from each set of duplicates without writing any code. It includes eight operators for deduplication workflows and supports native integrations with Pinecone, Qdrant, Milvus, and LanceDB for large datasets.

Company
Voxel51

Date published
Sept. 14, 2023

Author(s)
Jacob Marks

Word count
1419

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.