ABI Bioinformatics Guide 2024

CtrlK

t-SNE (t-Distributed Stochastic Neighbor Embedding)

Overview:

t-SNE is a non-linear dimensionality reduction technique primarily used for visualization.
It converts high-dimensional Euclidean distances into conditional probabilities that represent similarities.
The algorithm minimizes the Kullback-Leibler divergence between these probability distributions in the high-dimensional and low-dimensional space.

Key Characteristics:

Effective at creating a visual representation of complex data, revealing clusters and patterns.
Sensitive to parameters such as perplexity and learning rate.
Computationally intensive, especially for large datasets.

Applications:

Visualizing high-dimensional data such as images, text, and gene expression data.
Exploring and understanding the structure of the data.

PreviousPCA (Principal Component Analysis)NextUMAP (Uniform Manifold Approximation and Projection)

Last updated 1 year ago

Was this helpful?