/plushcap/analysis/arize/arize-t-sne-vs-umap

SNE vs. t-SNE vs. UMAP: An Evolutionary Guide

What's this blog post about?

Dimension reduction techniques are crucial in data science for visualization and pre-processing in machine learning. Three popular dimensionality reduction techniques are SNE (Stochastic Neighbor Embedding), t-SNE (t-distributed Stochastic Neighbor Embedding), and UMAP (Uniform Manifold Approximation and Projection). These neighbor graph algorithms follow a similar process, starting with computing high-dimensional probabilities p, then low-dimensional probabilities q. The cost function C(p,q) is calculated by comparing the differences between probabilities, which is then minimized to obtain human-interpretable information from the embedding space.

Company
Arize

Date published
July 15, 2022

Author(s)
Francisco Castillo

Word count
452

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.